Skip to main content

Showing 1–13 of 13 results for author: Siddiqui, Y

  1. arXiv:2407.02599  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Meta 3D Gen

    Authors: Raphael Bensadoun, Tom Monnier, Yanir Kleiman, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi

    Abstract: We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation. 3DGen offers 3D asset creation with high prompt fidelity and high-quality 3D shapes and textures in under a minute. It supports physically-based rendering (PBR), necessary for 3D asset relighting in real-world applications. Additionally, 3DGen supports generative retexturing of previously gener… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.02445  [pdf, other

    cs.CV cs.AI cs.GR

    Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials

    Authors: Yawar Siddiqui, Tom Monnier, Filippos Kokkinos, Mahendra Kariya, Yanir Kleiman, Emilien Garreau, Oran Gafni, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotny

    Abstract: We present Meta 3D AssetGen (AssetGen), a significant advancement in text-to-3D generation which produces faithful, high-quality meshes with texture and material control. Compared to works that bake shading in the 3D object's appearance, AssetGen outputs physically-based rendering (PBR) materials, supporting realistic relighting. AssetGen generates first several views of the object with factored s… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Project Page: https://assetgen.github.io

  3. arXiv:2406.13303  [pdf

    cs.AI cs.CY cs.MM cs.SI

    Integration of Policy and Reputation based Trust Mechanisms in e-Commerce Industry

    Authors: Muhammad Yasir Siddiqui, Alam Gir

    Abstract: The e-commerce systems are being tackled from commerce behavior and internet technologies. Therefore, trust aspect between buyer-seller transactions is a potential element which needs to be addressed in competitive e-commerce industry. The e-commerce industry is currently handling two different trust approaches. First approach consists on centralized mechanism where digital credentials/set of rule… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2312.11417  [pdf, other

    cs.CV

    PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models

    Authors: Antonio Alliegro, Yawar Siddiqui, Tatiana Tommasi, Matthias Nießner

    Abstract: We introduce PolyDiff, the first diffusion-based approach capable of directly generating realistic and diverse 3D polygonal meshes. In contrast to methods that use alternate 3D shape representations (e.g. implicit representations), our approach is a discrete denoising diffusion probabilistic model that operates natively on the polygonal mesh data structure. This enables learning of both the geomet… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2311.15475  [pdf, other

    cs.CV cs.LG

    MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers

    Authors: Yawar Siddiqui, Antonio Alliegro, Alexey Artemov, Tatiana Tommasi, Daniele Sirigatti, Vladislav Rosov, Angela Dai, Matthias Nießner

    Abstract: We introduce MeshGPT, a new approach for generating triangle meshes that reflects the compactness typical of artist-created meshes, in contrast to dense triangle meshes extracted by iso-surfacing methods from neural fields. Inspired by recent advances in powerful large language models, we adopt a sequence-based approach to autoregressively generate triangle meshes as sequences of triangles. We fir… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: Project Page: https://nihalsid.github.io/mesh-gpt/, Video: https://youtu.be/UV90O1_69_o

  6. arXiv:2303.11396  [pdf, other

    cs.CV

    Text2Tex: Text-driven Texture Synthesis via Diffusion Models

    Authors: Dave Zhenyu Chen, Yawar Siddiqui, Hsin-Ying Lee, Sergey Tulyakov, Matthias Nießner

    Abstract: We present Text2Tex, a novel method for generating high-quality textures for 3D meshes from the given text prompts. Our method incorporates inpainting into a pre-trained depth-aware image diffusion model to progressively synthesize high resolution partial textures from multiple viewpoints. To avoid accumulating inconsistent and stretched artifacts across views, we dynamically segment the rendered… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Project page: https://daveredrum.github.io/Text2Tex/

  7. arXiv:2212.09802  [pdf, other

    cs.CV cs.LG

    Panoptic Lifting for 3D Scene Understanding with Neural Fields

    Authors: Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Buló, Norman Müller, Matthias Nießner, Angela Dai, Peter Kontschieder

    Abstract: We propose Panoptic Lifting, a novel approach for learning panoptic 3D volumetric representations from images of in-the-wild scenes. Once trained, our model can render color images together with 3D-consistent panoptic segmentation from novel viewpoints. Unlike existing approaches which use 3D input directly or indirectly, our method requires only machine-generated 2D panoptic segmentation masks… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: Project Page: https://nihalsid.github.io/panoptic-lifting/, Video: https://youtu.be/QtsiL-6rSuM

  8. arXiv:2212.01206  [pdf, other

    cs.CV

    DiffRF: Rendering-Guided 3D Radiance Field Diffusion

    Authors: Norman Müller, Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder, Matthias Nießner

    Abstract: We introduce DiffRF, a novel approach for 3D radiance field synthesis based on denoising diffusion probabilistic models. While existing diffusion-based methods operate on images, latent codes, or point cloud data, we are the first to directly generate volumetric radiance fields. To this end, we propose a 3D denoising model which directly operates on an explicit voxel grid representation. However,… ▽ More

    Submitted 27 March, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Project page: https://sirwyver.github.io/DiffRF/ Video: https://youtu.be/qETBcLu8SUk - CVPR 2023 Highlight - updated evaluations after fixing initial data mapping error on all methods

  9. arXiv:2204.02411  [pdf, other

    cs.CV cs.GR

    Texturify: Generating Textures on 3D Shape Surfaces

    Authors: Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai

    Abstract: Texture cues on 3D objects are key to compelling visual representations, with the possibility to create high visual fidelity with inherent spatial consistency across different views. Since the availability of textured 3D shapes remains very limited, learning a 3D-supervised data-driven method that predicts a texture based on the 3D input is very challenging. We thus propose Texturify, a GAN-based… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: Project Page: https://nihalsid.github.io/texturify

  10. arXiv:2104.00024  [pdf, other

    cs.CV

    RetrievalFuse: Neural 3D Scene Reconstruction with a Database

    Authors: Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai

    Abstract: 3D reconstruction of large scenes is a challenging problem due to the high-complexity nature of the solution space, in particular for generative neural networks. In contrast to traditional generative learned models which encode the full generative process into a neural network and can struggle with maintaining local details at the scene level, we introduce a new method that directly leverages scen… ▽ More

    Submitted 10 August, 2021; v1 submitted 31 March, 2021; originally announced April 2021.

    Comments: Project Page: https://nihalsid.github.io/retrieval-fuse/

  11. arXiv:2006.14660  [pdf, other

    cs.CV

    SPSG: Self-Supervised Photometric Scene Generation from RGB-D Scans

    Authors: Angela Dai, Yawar Siddiqui, Justus Thies, Julien Valentin, Matthias Nießner

    Abstract: We present SPSG, a novel approach to generate high-quality, colored 3D models of scenes from RGB-D scan observations by learning to infer unobserved scene geometry and color in a self-supervised fashion. Our self-supervised approach learns to jointly inpaint geometry and color by correlating an incomplete RGB-D scan with a more complete version of that scan. Notably, rather than relying on 3D reco… ▽ More

    Submitted 28 April, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: Video: https://youtu.be/1cj962m9zqo

  12. arXiv:1911.11789  [pdf, other

    cs.CV cs.LG

    ViewAL: Active Learning with Viewpoint Entropy for Semantic Segmentation

    Authors: Yawar Siddiqui, Julien Valentin, Matthias Nießner

    Abstract: We propose ViewAL, a novel active learning strategy for semantic segmentation that exploits viewpoint consistency in multi-view datasets. Our core idea is that inconsistencies in model predictions across viewpoints provide a very reliable measure of uncertainty and encourage the model to perform well irrespective of the viewpoint under which objects are observed. To incorporate this uncertainty me… ▽ More

    Submitted 18 March, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: CVPR2020, Video: https://youtu.be/tAGdx2j-X_g

  13. arXiv:1801.07648  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Clustering with Deep Learning: Taxonomy and New Methods

    Authors: Elie Aljalbout, Vladimir Golkov, Yawar Siddiqui, Maximilian Strobel, Daniel Cremers

    Abstract: Clustering methods based on deep neural networks have proven promising for clustering real-world data because of their high representational power. In this paper, we propose a systematic taxonomy of clustering methods that utilize deep neural networks. We base our taxonomy on a comprehensive review of recent work and validate the taxonomy in a case study. In this case study, we show that the taxon… ▽ More

    Submitted 13 September, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

    MSC Class: 62H30; 62M45; 91C20 ACM Class: H.3.3; I.2.6; I.5; I.5.3; I.5.4