Skip to main content

Showing 1–21 of 21 results for author: Williams, F

  1. arXiv:2407.01781  [pdf, other

    cs.CV cs.GR cs.LG

    fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence

    Authors: Francis Williams, Jiahui Huang, Jonathan Swartz, Gergely Klár, Vijay Thakkar, Matthew Cong, Xuanchi Ren, Ruilong Li, Clement Fuji-Tsang, Sanja Fidler, Eftychios Sifakis, Ken Museth

    Abstract: We present fVDB, a novel GPU-optimized framework for deep learning on large-scale 3D data. fVDB provides a complete set of differentiable primitives to build deep learning architectures for common tasks in 3D learning such as convolution, pooling, attention, ray-tracing, meshing, etc. fVDB simultaneously provides a much larger feature set (primitives and operators) than established frameworks wi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.00911  [pdf, other

    cs.CV cs.CL cs.LG

    Deep Image-to-Recipe Translation

    Authors: Jiangqin Ma, Bilal Mawji, Franz Williams

    Abstract: The modern saying, "You Are What You Eat" resonates on a profound level, reflecting the intricate connection between our identities and the food we consume. Our project, Deep Image-to-Recipe Translation, is an intersection of computer vision and natural language generation that aims to bridge the gap between cherished food memories and the art of culinary creation. Our primary objective involves p… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.08292  [pdf, other

    cs.CV

    Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata

    Authors: Dongsu Zhang, Francis Williams, Zan Gojcic, Karsten Kreis, Sanja Fidler, Young Min Kim, Amlan Kar

    Abstract: We aim to generate fine-grained 3D geometry from large-scale sparse LiDAR scans, abundantly captured by autonomous vehicles (AV). Contrary to prior work on AV scene completion, we aim to extrapolate fine geometry from unlabeled and beyond spatial limits of LiDAR scans, taking a step towards generating realistic, high-resolution simulation-ready 3D street environments. We propose hierarchical Gener… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024 as highlight

  4. arXiv:2404.16221  [pdf, other

    cs.CV cs.DC cs.GR

    NeRF-XL: Scaling NeRFs with Multiple GPUs

    Authors: Ruilong Li, Sanja Fidler, Angjoo Kanazawa, Francis Williams

    Abstract: We present NeRF-XL, a principled method for distributing Neural Radiance Fields (NeRFs) across multiple GPUs, thus enabling the training and rendering of NeRFs with an arbitrarily large capacity. We begin by revisiting existing multi-GPU approaches, which decompose large scenes into multiple independently trained NeRFs, and identify several fundamental issues with these methods that hinder improve… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Webpage: https://research.nvidia.com/labs/toronto-ai/nerfxl/

  5. arXiv:2402.08529  [pdf, other

    cs.LG cs.CV

    Approximately Piecewise E(3) Equivariant Point Networks

    Authors: Matan Atzmon, Jiahui Huang, Francis Williams, Or Litany

    Abstract: Integrating a notion of symmetry into point cloud neural networks is a provably effective way to improve their generalization capability. Of particular interest are $E(3)$ equivariant point cloud networks where Euclidean transformations applied to the inputs are preserved in the outputs. Recent efforts aim to extend networks that are $E(3)$ equivariant, to accommodate inputs made of multiple parts… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  6. arXiv:2312.03806  [pdf, other

    cs.CV cs.GR cs.LG

    XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

    Authors: Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams

    Abstract: We present XCube (abbreviated as $\mathcal{X}^3$), a novel generative model for high-resolution sparse 3D voxel grids with arbitrary attributes. Our model can generate millions of voxels with a finest effective resolution of up to $1024^3$ in a feed-forward fashion without time-consuming test-time optimization. To achieve this, we employ a hierarchical voxel latent diffusion model which generates… ▽ More

    Submitted 25 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. Code: https://github.com/nv-tlabs/XCube/ Website: https://research.nvidia.com/labs/toronto-ai/xcube/

  7. arXiv:2305.19590  [pdf, other

    cs.CV

    Neural Kernel Surface Reconstruction

    Authors: Jiahui Huang, Zan Gojcic, Matan Atzmon, Or Litany, Sanja Fidler, Francis Williams

    Abstract: We present a novel method for reconstructing a 3D implicit surface from a large-scale, sparse, and noisy point cloud. Our approach builds upon the recently introduced Neural Kernel Fields (NKF) representation. It enjoys similar generalization capabilities to NKF, while simultaneously addressing its main limitations: (a) We can scale to large scenes through compactly supported kernel functions, whi… ▽ More

    Submitted 9 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  8. arXiv:2305.01643  [pdf, other

    cs.CV

    Neural LiDAR Fields for Novel View Synthesis

    Authors: Shengyu Huang, Zan Gojcic, Zian Wang, Francis Williams, Yoni Kasten, Sanja Fidler, Konrad Schindler, Or Litany

    Abstract: We present Neural Fields for LiDAR (NFL), a method to optimise a neural field scene representation from LiDAR measurements, with the goal of synthesizing realistic LiDAR scans from novel viewpoints. NFL combines the rendering power of neural fields with a detailed, physically motivated model of the LiDAR sensing process, thus enabling it to accurately reproduce key sensor behaviors like beam diver… ▽ More

    Submitted 13 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 - camera ready. Project page: https://research.nvidia.com/labs/toronto-ai/nfl/

  9. arXiv:2210.06978  [pdf, other

    cs.CV cs.LG stat.ML

    LION: Latent Point Diffusion Models for 3D Shape Generation

    Authors: Xiaohui Zeng, Arash Vahdat, Francis Williams, Zan Gojcic, Or Litany, Sanja Fidler, Karsten Kreis

    Abstract: Denoising diffusion models (DDMs) have shown promising results in 3D point cloud synthesis. To advance 3D DDMs and make them useful for digital artists, we require (i) high generation quality, (ii) flexibility for manipulation and applications such as conditional synthesis and shape interpolation, and (iii) the ability to output smooth surfaces or meshes. To this end, we introduce the hierarchical… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  10. arXiv:2209.11125  [pdf, other

    cs.LG cs.IR

    A Bibliographic View on Constrained Clustering

    Authors: Ludmila Kuncheva, Francis Williams, Samuel Hennessey

    Abstract: A keyword search on constrained clustering on Web-of-Science returned just under 3,000 documents. We ran automatic analyses of those, and compiled our own bibliography of 183 papers which we analysed in more detail based on their topic and experimental study, if any. This paper presents general trends of the area and its sub-topics by Pareto analysis, using citation count and year of publication.… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 18 pages, 11 figures, 177 references

    MSC Class: 62H30 ACM Class: I.5.3; I.5.0

  11. arXiv:2202.08345  [pdf, other

    cs.CV cs.GR

    Learning Smooth Neural Functions via Lipschitz Regularization

    Authors: Hsueh-Ti Derek Liu, Francis Williams, Alec Jacobson, Sanja Fidler, Or Litany

    Abstract: Neural implicit fields have recently emerged as a useful representation for 3D shapes. These fields are commonly represented as neural networks which map latent descriptors and 3D coordinates to implicit function values. The latent descriptor of a neural field acts as a deformation handle for the 3D shape it represents. Thus, smoothness with respect to this descriptor is paramount for performing s… ▽ More

    Submitted 10 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  12. arXiv:2111.13674  [pdf, other

    cs.CV cs.GR cs.LG

    Neural Fields as Learnable Kernels for 3D Reconstruction

    Authors: Francis Williams, Zan Gojcic, Sameh Khamis, Denis Zorin, Joan Bruna, Sanja Fidler, Or Litany

    Abstract: We present Neural Kernel Fields: a novel method for reconstructing implicit 3D shapes based on a learned kernel ridge regression. Our technique achieves state-of-the-art results when reconstructing 3D objects and large scenes from sparse oriented points, and can reconstruct shape categories outside the training set with almost no drop in accuracy. The core insight of our approach is that kernel me… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  13. arXiv:2103.06234  [pdf, other

    math.OC cs.LG

    Symmetry Breaking in Symmetric Tensor Decomposition

    Authors: Yossi Arjevani, Joan Bruna, Michael Field, Joe Kileel, Matthew Trager, Francis Williams

    Abstract: In this note, we consider the highly nonconvex optimization problem associated with computing the rank decomposition of symmetric tensors. We formulate the invariance properties of the loss function and show that critical points detected by standard gradient based methods are \emph{symmetry breaking} with respect to the target tensor. The phenomena, seen for different choices of target tensors and… ▽ More

    Submitted 28 December, 2023; v1 submitted 10 March, 2021; originally announced March 2021.

  14. arXiv:2012.10518  [pdf, other

    cs.CV

    Human 3D keypoints via spatial uncertainty modeling

    Authors: Francis Williams, Or Litany, Avneesh Sud, Kevin Swersky, Andrea Tagliasacchi

    Abstract: We introduce a technique for 3D human keypoint estimation that directly models the notion of spatial uncertainty of a keypoint. Our technique employs a principled approach to modelling spatial uncertainty inspired from techniques in robust statistics. Furthermore, our pipeline requires no 3D ground truth labels, relying instead on (possibly noisy) 2D image-level keypoints. Our method achieves near… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  15. arXiv:2006.13782  [pdf, other

    cs.CV cs.GR

    Neural Splines: Fitting 3D Surfaces with Infinitely-Wide Neural Networks

    Authors: Francis Williams, Matthew Trager, Joan Bruna, Denis Zorin

    Abstract: We present Neural Splines, a technique for 3D surface reconstruction that is based on random feature kernels arising from infinitely-wide shallow ReLU networks. Our method achieves state-of-the-art results, outperforming recent neural network-based techniques and widely used Poisson Surface Reconstruction (which, as we demonstrate, can also be viewed as a type of kernel method). Because our approa… ▽ More

    Submitted 27 May, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  16. arXiv:1912.03629  [pdf, other

    cs.CV cs.GR cs.LG

    VoronoiNet: General Functional Approximators with Local Support

    Authors: Francis Williams, Daniele Panozzo, Kwang Moo Yi, Andrea Tagliasacchi

    Abstract: Voronoi diagrams are highly compact representations that are used in various Graphics applications. In this work, we show how to embed a differentiable version of it -- via a novel deep architecture -- into a generative deep network. By doing so, we achieve a highly compact latent embedding that is able to provide much more detailed reconstructions, both in 2D and 3D, for various shapes. In this t… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

  17. arXiv:1906.07842  [pdf, other

    cs.LG stat.ML

    Gradient Dynamics of Shallow Univariate ReLU Networks

    Authors: Francis Williams, Matthew Trager, Claudio Silva, Daniele Panozzo, Denis Zorin, Joan Bruna

    Abstract: We present a theoretical and empirical study of the gradient dynamics of overparameterized shallow ReLU networks with one-dimensional input, solving least-squares interpolation. We show that the gradient dynamics of such networks are determined by the gradient flow in a non-redundant parameterization of the network function. We examine the principal qualitative features of this gradient flow. In p… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  18. Unwind: Interactive Fish Straightening

    Authors: Francis Williams, Alexander Bock, Harish Doraiswamy, Cassandra Donatelli, Kayla Hall, Adam Summers, Daniele Panozzo, Cláudio T. Silva

    Abstract: The ScanAllFish project is a large-scale effort to scan all the world's 33,100 known species of fishes. It has already generated thousands of volumetric CT scans of fish species which are available on open access platforms such as the Open Science Framework. To achieve a scanning rate required for a project of this magnitude, many specimens are grouped together into a single tube and scanned all a… ▽ More

    Submitted 5 February, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

  19. arXiv:1812.06216  [pdf, other

    cs.GR cs.CG cs.CV cs.LG

    ABC: A Big CAD Model Dataset For Geometric Deep Learning

    Authors: Sebastian Koch, Albert Matveev, Zhongshi Jiang, Francis Williams, Alexey Artemov, Evgeny Burnaev, Marc Alexa, Denis Zorin, Daniele Panozzo

    Abstract: We introduce ABC-Dataset, a collection of one million Computer-Aided Design (CAD) models for research of geometric deep learning methods and applications. Each model is a collection of explicitly parametrized curves and surfaces, providing ground truth for differential quantities, patch segmentation, geometric feature detection, and shape reconstruction. Sampling the parametric descriptions of sur… ▽ More

    Submitted 30 April, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 15 pages

  20. arXiv:1811.10943  [pdf, other

    cs.CV cs.GR cs.LG

    Deep Geometric Prior for Surface Reconstruction

    Authors: Francis Williams, Teseo Schneider, Claudio Silva, Denis Zorin, Joan Bruna, Daniele Panozzo

    Abstract: The reconstruction of a discrete surface from a point cloud is a fundamental geometry processing problem that has been studied for decades, with many methods developed. We propose the use of a deep neural network as a geometric prior for surface reconstruction. Specifically, we overfit a neural network representing a local chart parameterization to part of an input point cloud using the Wasserstei… ▽ More

    Submitted 4 April, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

  21. Parking Sensing and Information System: Sensors, Deployment, and Evaluation

    Authors: Xiao Chen, Zhen, Qian, Ram Rajagopal, Todd Stiers, Christopher Flores, Robert Kavaler, Floyd Williams III

    Abstract: This paper describes a smart parking sensing and information system that disseminates the parking availability information for public users in a cost-effective and efficient manner. The hardware framework of the system is built on advanced wireless sensor networks and cloud service over the Internet, and the system is highly scalable. The parking information provided to the users is set in the for… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Journal ref: Chen, Xiao, et al. "Parking Sensing and Information System: Sensors, Deployment, and Evaluation." Transportation Research Record: Journal of the Transportation Research Board 2559 (2016): 81-89