Skip to main content

Showing 1–6 of 6 results for author: Fruchter, S

  1. arXiv:2407.02489  [pdf, other

    cs.CV cs.AI cs.GR cs.HC cs.LG

    Magic Insert: Style-Aware Drag-and-Drop

    Authors: Nataniel Ruiz, Yuanzhen Li, Neal Wadhwa, Yael Pritch, Michael Rubinstein, David E. Jacobs, Shlomi Fruchter

    Abstract: We present Magic Insert, a method for dragging-and-dropping subjects from a user-provided image into a target image of a different style in a physically plausible manner while matching the style of the target image. This work formalizes the problem of style-aware drag-and-drop and presents a method for tackling it by addressing two sub-problems: style-aware personalization and realistic object ins… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Project page: https://magicinsert.github.io/

  2. arXiv:2403.18818  [pdf, other

    cs.CV

    ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

    Authors: Daniel Winter, Matan Cohen, Shlomi Fruchter, Yael Pritch, Alex Rav-Acha, Yedid Hoshen

    Abstract: Diffusion models have revolutionized image editing but often generate images that violate physical laws, particularly the effects of objects on the scene, e.g., occlusions, shadows, and reflections. By analyzing the limitations of self-supervised approaches, we propose a practical solution centered on a \q{counterfactual} dataset. Our method involves capturing a scene before and after removing a s… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  3. arXiv:2401.06105  [pdf, other

    cs.CV cs.CL cs.GR cs.LG

    PALP: Prompt Aligned Personalization of Text-to-Image Models

    Authors: Moab Arar, Andrey Voynov, Amir Hertz, Omri Avrahami, Shlomi Fruchter, Yael Pritch, Daniel Cohen-Or, Ariel Shamir

    Abstract: Content creators often aim to create personalized images using personal subjects that go beyond the capabilities of conventional text-to-image models. Additionally, they may want the resulting image to encompass a specific location, style, ambiance, and more. Existing personalization methods may compromise personalization ability or the alignment to complex textual prompts. This trade-off can impe… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Project page available at https://prompt-aligned.github.io/

  4. arXiv:2312.02133  [pdf, other

    cs.CV cs.GR cs.LG

    Style Aligned Image Generation via Shared Attention

    Authors: Amir Hertz, Andrey Voynov, Shlomi Fruchter, Daniel Cohen-Or

    Abstract: Large-scale Text-to-Image (T2I) models have rapidly gained prominence across creative fields, generating visually compelling outputs from textual prompts. However, controlling these models to ensure consistent style remains challenging, with existing methods necessitating fine-tuning and manual intervention to disentangle content and style. In this paper, we introduce StyleAligned, a novel techniq… ▽ More

    Submitted 11 January, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Project page at style-aligned-gen.github.io

  5. arXiv:2311.17609  [pdf, other

    cs.CV cs.GR cs.LG

    Curved Diffusion: A Generative Model With Optical Geometry Control

    Authors: Andrey Voynov, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or

    Abstract: State-of-the-art diffusion models can generate highly realistic images based on various conditioning like text, segmentation, and depth. However, an essential aspect often overlooked is the specific camera geometry used during image capture. The influence of different optical systems on the final scene appearance is frequently overlooked. This study introduces a framework that intimately integrate… ▽ More

    Submitted 15 July, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Project page at https://anylens-diffusion.github.io/

  6. arXiv:2311.10093  [pdf, other

    cs.CV cs.GR cs.LG

    The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

    Authors: Omri Avrahami, Amir Hertz, Yael Vinker, Moab Arar, Shlomi Fruchter, Ohad Fried, Daniel Cohen-Or, Dani Lischinski

    Abstract: Recent advances in text-to-image generation models have unlocked vast potential for visual creativity. However, the users that use these models struggle with the generation of consistent characters, a crucial aspect for numerous real-world applications such as story visualization, game development, asset design, advertising, and more. Current methods typically rely on multiple pre-existing images… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to SIGGRAPH 2024. Project page is available at https://omriavrahami.com/the-chosen-one/