Skip to main content

Showing 1–8 of 8 results for author: Karnewar, A

  1. arXiv:2407.02599  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Meta 3D Gen

    Authors: Raphael Bensadoun, Tom Monnier, Yanir Kleiman, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi

    Abstract: We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation. 3DGen offers 3D asset creation with high prompt fidelity and high-quality 3D shapes and textures in under a minute. It supports physically-based rendering (PBR), necessary for 3D asset relighting in real-world applications. Additionally, 3DGen supports generative retexturing of previously gener… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2312.08744  [pdf, other

    cs.CV cs.GR

    GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning

    Authors: Animesh Karnewar, Roman Shapovalov, Tom Monnier, Andrea Vedaldi, Niloy J. Mitra, David Novotny

    Abstract: Encoding information from 2D views of an object into a 3D representation is crucial for generalized 3D feature extraction. Such features can then enable 3D reconstruction, 3D generation, and other applications. We propose GOEmbed (Gradient Origin Embeddings) that encodes input 2D images into any 3D representation, without requiring a pre-trained image feature extractor; unlike typical prior approa… ▽ More

    Submitted 15 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ECCV 2024 conference; project page at: https://holodiffusion.github.io/goembed/

  3. arXiv:2308.14244  [pdf, other

    cs.CV cs.GR

    HoloFusion: Towards Photo-realistic 3D Generative Modeling

    Authors: Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny

    Abstract: Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or detailed 2D views of 3D objects but with potential structural defects and lacking view consistency or realism. We present HoloFusion, a method that combines the b… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 conference; project page at: https://holodiffusion.github.io/holofusion

  4. arXiv:2303.16509  [pdf, other

    cs.CV cs.GR

    HoloDiffusion: Training a 3D Diffusion Model using 2D Images

    Authors: Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra

    Abstract: Diffusion models have emerged as the best approach for generative modeling of 2D images. Part of their success is due to the possibility of training them on millions if not billions of images with a stable learning objective. However, extending these models to 3D remains difficult for two reasons. First, finding a large quantity of 3D training data is much more complex than for 2D images. Second,… ▽ More

    Submitted 21 May, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 conference; project page at: https://holodiffusion.github.io/

  5. arXiv:2211.14902  [pdf, other

    cs.CV cs.GR

    3inGAN: Learning a 3D Generative Model from Images of a Self-similar Scene

    Authors: Animesh Karnewar, Oliver Wang, Tobias Ritschel, Niloy Mitra

    Abstract: We introduce 3inGAN, an unconditional 3D generative model trained from 2D images of a single self-similar 3D scene. Such a model can be used to produce 3D "remixes" of a given scene, by mapping spatial latent codes into a 3D volumetric representation, which can subsequently be rendered from arbitrary views using physically based volume rendering. By construction, the generated scenes remain view-c… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Conference accept at 3DV 2022

  6. ReLU Fields: The Little Non-linearity That Could

    Authors: Animesh Karnewar, Tobias Ritschel, Oliver Wang, Niloy J. Mitra

    Abstract: In many recent works, multi-layer perceptions (MLPs) have been shown to be suitable for modeling complex spatially-varying functions including images and 3D scenes. Although the MLPs are able to represent complex scenes with unprecedented quality and memory footprint, this expressive power of the MLPs, however, comes at the cost of long training and inference times. On the other hand, bilinear/tri… ▽ More

    Submitted 2 July, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Published at SIGGRAPH 2022

  7. arXiv:2011.14398  [pdf, other

    cs.CV cs.GR

    RGBD-Net: Predicting color and depth images for novel views synthesis

    Authors: Phong Nguyen-Ha, Animesh Karnewar, Lam Huynh, Esa Rahtu, Jiri Matas, Janne Heikkila

    Abstract: We propose a new cascaded architecture for novel view synthesis, called RGBD-Net, which consists of two core components: a hierarchical depth regression network and a depth-aware generator network. The former one predicts depth maps of the target views by using adaptive depth scaling, while the latter one leverages the predicted depths and renders spatially and temporally consistent target images.… ▽ More

    Submitted 9 July, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: 19 pages, 15 figures. Code will be available at: https://github.com/phongnhhn92/RGBDNet

  8. arXiv:1903.06048  [pdf, other

    cs.CV cs.LG stat.ML

    MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks

    Authors: Animesh Karnewar, Oliver Wang

    Abstract: While Generative Adversarial Networks (GANs) have seen huge successes in image synthesis tasks, they are notoriously difficult to adapt to different datasets, in part due to instability during training and sensitivity to hyperparameters. One commonly accepted reason for this instability is that gradients passing from the discriminator to the generator become uninformative when there isn't enough o… ▽ More

    Submitted 12 June, 2020; v1 submitted 14 March, 2019; originally announced March 2019.

    Comments: CVPR 2020 (Main Conference). Work sponsored by TomTom and Adobe. Code repository: https://github.com/akanimax/msg-stylegan-tf