Skip to main content

Showing 1–50 of 55 results for author: Rolnick, D

  1. arXiv:2407.08313  [pdf, other

    cs.LG

    Improving Molecular Modeling with Geometric GNNs: an Empirical Study

    Authors: Ali Ramlaoui, Théo Saulus, Basile Terver, Victor Schmidt, David Rolnick, Fragkiskos D. Malliaros, Alexandre Duval

    Abstract: Rapid advancements in machine learning (ML) are transforming materials science by significantly speeding up material property calculations. However, the proliferation of ML approaches has made it challenging for scientists to keep up with the most promising techniques. This paper presents an empirical study on Geometric Graph Neural Networks for 3D atomic systems, focusing on the impact of differe… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2406.13031  [pdf, other

    cs.CV

    A machine learning pipeline for automated insect monitoring

    Authors: Aditya Jain, Fagner Cunha, Michael Bunsen, Léonard Pasi, Anna Viklund, Maxim Larrivée, David Rolnick

    Abstract: Climate change and other anthropogenic factors have led to a catastrophic decline in insects, endangering both biodiversity and the ecosystem services on which human society depends. Data on insect abundance, however, remains woefully inadequate. Camera traps, conventionally used for monitoring terrestrial vertebrates, are now being modified for insects, especially moths. We describe a complete, o… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning

  3. arXiv:2406.12452  [pdf, other

    cs.CV cs.AI cs.LG

    Insect Identification in the Wild: The AMI Dataset

    Authors: Aditya Jain, Fagner Cunha, Michael James Bunsen, Juan Sebastián Cañas, Léonard Pasi, Nathan Pinoy, Flemming Helsing, JoAnne Russo, Marc Botham, Michael Sabourin, Jonathan Fréchette, Alexandre Anctil, Yacksecari Lopez, Eduardo Navarro, Filonila Perez Pimentel, Ana Cecilia Zamora, José Alejandro Ramirez Silva, Jonathan Gagnon, Tom August, Kim Bjerge, Alba Gomez Segura, Marc Bélisle, Yves Basset, Kent P. McFarland, David Roy , et al. (3 additional authors not shown)

    Abstract: Insects represent half of all global biodiversity, yet many of the world's insects are disappearing, with severe implications for ecosystems and agriculture. Despite this crisis, data on insect diversity and abundance remain woefully inadequate, due to the scarcity of human experts and the lack of scalable tools for monitoring. Ecologists have started to adopt camera traps to record and study inse… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2405.20719  [pdf, other

    cs.AI cs.CV physics.ao-ph

    Climate Variable Downscaling with Conditional Normalizing Flows

    Authors: Christina Winkler, Paula Harder, David Rolnick

    Abstract: Predictions of global climate models typically operate on coarse spatial scales due to the large computational costs of climate simulations. This has led to a considerable interest in methods for statistical downscaling, a similar process to super-resolution in the computer vision context, to provide more local and regional climate information. In this work, we apply conditional normalizing flows… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  5. arXiv:2404.06498  [pdf, other

    cs.LG stat.ML

    Simultaneous linear connectivity of neural networks modulo permutation

    Authors: Ekansh Sharma, Devin Kwok, Tom Denton, Daniel M. Roy, David Rolnick, Gintare Karolina Dziugaite

    Abstract: Neural networks typically exhibit permutation symmetries which contribute to the non-convexity of the networks' loss landscapes, since linearly interpolating between two permuted versions of a trained network tends to encounter a high loss barrier. Recent work has argued that permutation symmetries are the only sources of non-convexity, meaning there are essentially no such barriers between traine… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 11 pages, 6 figures

  6. arXiv:2403.18028  [pdf, other

    cs.LG cs.AI cs.CV q-bio.PE

    Predicting Species Occurrence Patterns from Partial Observations

    Authors: Hager Radi Abdelwahed, Mélisande Teng, David Rolnick

    Abstract: To address the interlinked biodiversity and climate crises, we need an understanding of where species occur and how these patterns are changing. However, observational data on most species remains very limited, and the amount of data available varies greatly between taxonomic groups. We introduce the problem of predicting species occurrence patterns given (a) satellite imagery, and (b) known infor… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Tackling Climate Change with Machine Learning workshop at ICLR 2024

  7. arXiv:2403.17381  [pdf, other

    cs.LG cs.AI

    Application-Driven Innovation in Machine Learning

    Authors: David Rolnick, Alan Aspuru-Guzik, Sara Beery, Bistra Dilkina, Priya L. Donti, Marzyeh Ghassemi, Hannah Kerner, Claire Monteleoni, Esther Rolf, Milind Tambe, Adam White

    Abstract: As applications of machine learning proliferate, innovative algorithms inspired by specific real-world challenges have become increasingly important. Such work offers the potential for significant impact not merely in domains of application but also in machine learning itself. In this paper, we describe the paradigm of application-driven research in machine learning, contrasting it with the more s… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 3 figures

  8. arXiv:2403.06634  [pdf, other

    cs.CR

    Stealing Part of a Production Language Model

    Authors: Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Itay Yona, Eric Wallace, David Rolnick, Florian Tramèr

    Abstract: We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under \… ▽ More

    Submitted 9 July, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  9. arXiv:2401.01867  [pdf, other

    cs.LG

    Dataset Difficulty and the Role of Inductive Bias

    Authors: Devin Kwok, Nikhil Anand, Jonathan Frankle, Gintare Karolina Dziugaite, David Rolnick

    Abstract: Motivated by the goals of dataset pruning and defect identification, a growing body of methods have been developed to score individual examples within a dataset. These methods, which we call "example difficulty scores", are typically used to rank or categorize examples, but the consistency of rankings between different training runs, scoring methods, and model architectures is generally unknown. T… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 10 pages, 6 figures

  10. arXiv:2312.10114  [pdf, other

    cs.CV

    FoMo-Bench: a multi-modal, multi-scale and multi-task Forest Monitoring Benchmark for remote sensing foundation models

    Authors: Nikolaos Ioannis Bountos, Arthur Ouaknine, David Rolnick

    Abstract: Forests are an essential part of Earth's ecosystems and natural systems, as well as providing services on which humanity depends, yet they are rapidly changing as a result of land use decisions and climate change. Understanding and mitigating negative effects requires parsing data on forests at global scale from a broad array of sensory modalities, and recently many such problems have been approac… ▽ More

    Submitted 27 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 26 pages

  11. arXiv:2312.02858  [pdf, other

    cs.LG cs.AI physics.ao-ph stat.ME

    Towards Causal Representations of Climate Model Data

    Authors: Julien Boussard, Chandni Nagda, Julia Kaltenborn, Charlotte Emilie Elektra Lange, Philippe Brouillard, Yaniv Gurwicz, Peer Nowack, David Rolnick

    Abstract: Climate models, such as Earth system models (ESMs), are crucial for simulating future climate change based on projected Shared Socioeconomic Pathways (SSP) greenhouse gas emissions scenarios. While ESMs are sophisticated and invaluable, machine learning-based emulators trained on existing simulation data can project additional climate scenarios much faster and are computationally efficient. Howeve… ▽ More

    Submitted 6 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  12. arXiv:2311.06958  [pdf, other

    cs.LG cs.AI

    Towards Climate Variable Prediction with Conditioned Spatio-Temporal Normalizing Flows

    Authors: Christina Winkler, David Rolnick

    Abstract: This study investigates how conditional normalizing flows can be applied to remote sensing data products in climate science for spatio-temporal prediction. The method is chosen due to its desired properties such as exact likelihood computation, predictive uncertainty estimation and efficient inference and sampling which facilitates faster exploration of climate scenarios. Experimental findings rev… ▽ More

    Submitted 31 May, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 5 pages

  13. arXiv:2311.03721  [pdf, other

    cs.LG cs.AI cs.CE physics.ao-ph

    ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning

    Authors: Julia Kaltenborn, Charlotte E. E. Lange, Venkatesh Ramesh, Philippe Brouillard, Yaniv Gurwicz, Chandni Nagda, Jakob Runge, Peer Nowack, David Rolnick

    Abstract: Climate models have been key for assessing the impact of climate change and simulating future climate scenarios. The machine learning (ML) community has taken an increased interest in supporting climate scientists' efforts on various tasks such as climate model emulation, downscaling, and prediction tasks. Many of those tasks have been addressed on datasets created with single climate models. Howe… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: To be published in the 37th Conference on Neural Information Processing Systems (NeurIPS 2023): Track on Datasets and Benchmarks. Project website: https://climateset.github.io/

  14. arXiv:2311.00936  [pdf, other

    cs.LG cs.CV q-bio.PE

    SatBird: Bird Species Distribution Modeling with Remote Sensing and Citizen Science Data

    Authors: Mélisande Teng, Amna Elmustafa, Benjamin Akera, Yoshua Bengio, Hager Radi Abdelwahed, Hugo Larochelle, David Rolnick

    Abstract: Biodiversity is declining at an unprecedented rate, impacting ecosystem services necessary to ensure food, water, and human health and well-being. Understanding the distribution of species and their habitats is crucial for conservation policy planning. However, traditional methods in ecology for species distribution models (SDMs) generally focus either on narrow sets of species or narrow geographi… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  15. arXiv:2311.00277  [pdf, other

    cs.CV

    OpenForest: A data catalogue for machine learning in forest monitoring

    Authors: Arthur Ouaknine, Teja Kattenborn, Etienne Laliberté, David Rolnick

    Abstract: Forests play a crucial role in Earth's system processes and provide a suite of social and economic ecosystem services, but are significantly impacted by human activities, leading to a pronounced disruption of the equilibrium within ecosystems. Advancing forest monitoring worldwide offers advantages in mitigating human impacts and enhancing our comprehension of forest composition, alongside the eff… ▽ More

    Submitted 1 November, 2023; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 43 pages, 3 figures, 9 tables. Preprint under review. The OpenForest catalogue is available at https://github.com/RolnickLab/OpenForest.git

  16. arXiv:2310.06682  [pdf, other

    cs.LG

    On the importance of catalyst-adsorbate 3D interactions for relaxed energy predictions

    Authors: Alvaro Carbonero, Alexandre Duval, Victor Schmidt, Santiago Miret, Alex Hernandez-Garcia, Yoshua Bengio, David Rolnick

    Abstract: The use of machine learning for material property prediction and discovery has traditionally centered on graph neural networks that incorporate the geometric configuration of all atoms. However, in practice not all this information may be readily available, e.g.~when evaluating the potentially unknown binding of adsorbates to catalyst. In this paper, we investigate whether it is possible to predic… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  17. arXiv:2308.01868  [pdf, other

    physics.ao-ph cs.LG

    Multi-variable Hard Physical Constraints for Climate Model Downscaling

    Authors: Jose González-Abad, Álex Hernández-García, Paula Harder, David Rolnick, José Manuel Gutiérrez

    Abstract: Global Climate Models (GCMs) are the primary tool to simulate climate evolution and assess the impacts of climate change. However, they often operate at a coarse spatial resolution that limits their accuracy in reproducing local-scale phenomena. Statistical downscaling methods leveraging deep learning offer a solution to this problem by approximating local-scale climate fields from coarse variable… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  18. arXiv:2306.06179  [pdf, other

    cs.LG math.CO math.GT

    Hidden symmetries of ReLU networks

    Authors: J. Elisenda Grigsby, Kathryn Lindsey, David Rolnick

    Abstract: The parameter space for any fixed architecture of feedforward ReLU neural networks serves as a proxy during training for the associated class of functions - but how faithful is this representation? It is known that many different parameter settings can determine the same function. Moreover, the degree of this redundancy is inhomogeneous: for some networks, the only symmetries are permutation of ne… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 27 pages, 11 figures, ICML 2023

    MSC Class: 57R70; 57Q99; 52B70; 52C35 ACM Class: I.2.6

  19. arXiv:2306.04226  [pdf, other

    cs.LG cs.CV

    Normalization Layers Are All That Sharpness-Aware Minimization Needs

    Authors: Maximilian Mueller, Tiffany Vlaar, David Rolnick, Matthias Hein

    Abstract: Sharpness-aware minimization (SAM) was proposed to reduce sharpness of minima and has been shown to enhance generalization performance in various settings. In this work we show that perturbing only the affine normalization parameters (typically comprising 0.1% of the total parameters) in the adversarial step of SAM can outperform perturbing all of the parameters.This finding generalizes to differe… ▽ More

    Submitted 17 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: camera ready version

  20. arXiv:2305.14452  [pdf, other

    cs.LG physics.ao-ph

    Fourier Neural Operators for Arbitrary Resolution Climate Data Downscaling

    Authors: Qidong Yang, Alex Hernandez-Garcia, Paula Harder, Venkatesh Ramesh, Prasanna Sattegeri, Daniela Szwarcman, Campbell D. Watson, David Rolnick

    Abstract: Climate simulations are essential in guiding our understanding of climate change and responding to its effects. However, it is computationally expensive to resolve complex climate processes at high spatial resolution. As one way to speed up climate simulations, neural networks have been used to downscale climate variables from fast-running low-resolution simulations, but high-resolution training d… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Presented at the ICLR 2023 workshop on "Tackling Climate Change with Machine Learning"

  21. arXiv:2305.05577  [pdf, other

    cs.LG

    FAENet: Frame Averaging Equivariant GNN for Materials Modeling

    Authors: Alexandre Duval, Victor Schmidt, Alex Hernandez Garcia, Santiago Miret, Fragkiskos D. Malliaros, Yoshua Bengio, David Rolnick

    Abstract: Applications of machine learning techniques for materials modeling typically involve functions known to be equivariant or invariant to specific symmetries. While graph neural networks (GNNs) have proven successful in such tasks, they enforce symmetries via the model architecture, which often reduces their expressivity, scalability and comprehensibility. In this paper, we introduce (1) a flexible f… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: Accepted at ICML 2023

  22. arXiv:2305.01079  [pdf, other

    cs.CV

    Bird Distribution Modelling using Remote Sensing and Citizen Science data

    Authors: Mélisande Teng, Amna Elmustafa, Benjamin Akera, Hugo Larochelle, David Rolnick

    Abstract: Climate change is a major driver of biodiversity loss, changing the geographic range and abundance of many species. However, there remain significant knowledge gaps about the distribution of species, due principally to the amount of effort and expertise required for traditional field monitoring. We propose an approach leveraging computer vision to improve species distribution modelling, combining… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Journal ref: Tackling Climate Change with Machine Learning Workshop, 11th International Conference on Learning Representations (ICLR 2023), Kigali, Rwanda

  23. arXiv:2304.14065  [pdf, other

    cs.CV cs.AI

    Lightweight, Pre-trained Transformers for Remote Sensing Timeseries

    Authors: Gabriel Tseng, Ruben Cartuyvels, Ivan Zvonkov, Mirali Purohit, David Rolnick, Hannah Kerner

    Abstract: Machine learning methods for satellite data have a range of societally relevant applications, but labels used to train models can be difficult or impossible to acquire. Self-supervision is a natural solution in settings with limited labeled data, but current self-supervised models for satellite data fail to take advantage of the characteristics of that data, including the temporal dimension (which… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

  24. arXiv:2212.07295  [pdf, other

    stat.ML cs.LG

    Maximal Initial Learning Rates in Deep ReLU Networks

    Authors: Gaurav Iyer, Boris Hanin, David Rolnick

    Abstract: Training a neural network requires choosing a suitable learning rate, which involves a trade-off between speed and effectiveness of convergence. While there has been considerable theoretical and empirical analysis of how large the learning rate can be, most prior work focuses only on late-stage training. In this work, we introduce the maximal initial learning rate $η^{\ast}$ - the largest learning… ▽ More

    Submitted 25 May, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: International Conference on Machine Learning (ICML) 2023

  25. arXiv:2211.12020  [pdf, other

    cs.LG physics.comp-ph

    PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

    Authors: Alexandre Duval, Victor Schmidt, Santiago Miret, Yoshua Bengio, Alex Hernández-García, David Rolnick

    Abstract: Mitigating the climate crisis requires a rapid transition towards lower-carbon energy. Catalyst materials play a crucial role in the electrochemical reactions involved in numerous industrial processes key to this transition, such as renewable energy storage and electrofuel synthesis. To reduce the energy spent on such activities, we must quickly discover more efficient catalysts to drive electroch… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Journal of Machine Learning Research (JMLR)

  26. arXiv:2210.13611  [pdf, other

    cs.LG cs.AI

    Understanding the Evolution of Linear Regions in Deep Reinforcement Learning

    Authors: Setareh Cohan, Nam Hee Kim, David Rolnick, Michiel van de Panne

    Abstract: Policies produced by deep reinforcement learning are typically characterised by their learning curves, but they remain poorly understood in many other respects. ReLU-based policies result in a partitioning of the input space into piecewise linear regions. We seek to understand how observed region counts and their densities evolve during deep reinforcement learning using empirical results that sp… ▽ More

    Submitted 7 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 camera ready

  27. arXiv:2208.11695  [pdf, other

    cs.CV cs.CY cs.LG

    Bugs in the Data: How ImageNet Misrepresents Biodiversity

    Authors: Alexandra Sasha Luccioni, David Rolnick

    Abstract: ImageNet-1k is a dataset often used for benchmarking machine learning (ML) models and evaluating tasks such as image recognition and object detection. Wild animals make up 27% of ImageNet-1k but, unlike classes representing people and objects, these data have not been closely scrutinized. In the current paper, we analyze the 13,450 images from 269 classes that represent wild animals in the ImageNe… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  28. arXiv:2208.05424  [pdf, other

    physics.ao-ph cs.LG

    Hard-Constrained Deep Learning for Climate Downscaling

    Authors: Paula Harder, Alex Hernandez-Garcia, Venkatesh Ramesh, Qidong Yang, Prasanna Sattigeri, Daniela Szwarcman, Campbell Watson, David Rolnick

    Abstract: The availability of reliable, high-resolution climate and weather data is important to inform long-term decisions on climate adaptation and mitigation and to guide rapid responses to extreme events. Forecasting models are limited by computational costs and, therefore, often generate coarse-resolution predictions. Statistical downscaling, including super-resolution methods from deep learning, can p… ▽ More

    Submitted 29 February, 2024; v1 submitted 8 August, 2022; originally announced August 2022.

  29. arXiv:2206.10999  [pdf, other

    cs.LG cs.NE

    Neural Networks as Paths through the Space of Representations

    Authors: Richard D. Lange, Devin Kwok, Jordan Matelsky, Xinyue Wang, David S. Rolnick, Konrad P. Kording

    Abstract: Deep neural networks implement a sequence of layer-by-layer operations that are each relatively easy to understand, but the resulting overall computation is generally difficult to understand. We consider a simple hypothesis for interpreting the layer-by-layer construction of useful representations: perhaps the role of each layer is to reformat information to reduce the "distance" to the desired ou… ▽ More

    Submitted 27 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 10 pages, submitted to ICLR 2023

  30. arXiv:2206.05056  [pdf, other

    cs.NE cs.AI cs.LG

    On Neural Architecture Inductive Biases for Relational Tasks

    Authors: Giancarlo Kerg, Sarthak Mittal, David Rolnick, Yoshua Bengio, Blake Richards, Guillaume Lajoie

    Abstract: Current deep learning approaches have shown good in-distribution generalization performance, but struggle with out-of-distribution generalization. This is especially true in the case of tasks involving abstract relations like recognizing rules in sequences, as we find in many intelligence tests. Recent work has explored how forcing relational representations to remain distinct from sensory represe… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  31. arXiv:2203.11815  [pdf, other

    cs.LG cs.NE stat.ML

    Clustering units in neural networks: upstream vs downstream information

    Authors: Richard D. Lange, David S. Rolnick, Konrad P. Kording

    Abstract: It has been hypothesized that some form of "modular" structure in artificial neural networks should be useful for learning, compositionality, and generalization. However, defining and quantifying modularity remains an open problem. We cast the problem of detecting functional modules into the problem of detecting clusters of similar-functioning units. This begs the question of what makes two units… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 12 main text pages, 4 main figures, 5 supplemental figures. Will be submitted to TMLR

    Journal ref: TMLR June (2022)

  32. arXiv:2202.02124  [pdf, other

    cs.LG

    TIML: Task-Informed Meta-Learning for Agriculture

    Authors: Gabriel Tseng, Hannah Kerner, David Rolnick

    Abstract: Labeled datasets for agriculture are extremely spatially imbalanced. When developing algorithms for data-sparse regions, a natural approach is to use transfer learning from data-rich regions. While standard transfer learning approaches typically leverage only direct inputs and outputs, geospatial imagery and agricultural data are rich in metadata that can inform transfer learning algorithms, such… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 12 pages, 4 figures

  33. arXiv:2111.14671  [pdf, other

    cs.LG physics.ao-ph stat.ML

    ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models

    Authors: Salva Rühling Cachay, Venkatesh Ramesh, Jason N. S. Cole, Howard Barker, David Rolnick

    Abstract: Numerical simulations of Earth's weather and climate require substantial amounts of computation. This has led to a growing interest in replacing subroutines that explicitly compute physical processes with approximate machine learning (ML) methods that are fast at inference time. Within weather and climate models, atmospheric radiative transfer (RT) calculations are especially expensive. This has m… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  34. arXiv:2106.11072  [pdf, other

    cs.AI cs.LG stat.ML

    Techniques for Symbol Grounding with SATNet

    Authors: Sever Topan, David Rolnick, Xujie Si

    Abstract: Many experts argue that the future of artificial intelligence is limited by the field's ability to integrate symbolic logical reasoning into deep learning architectures. The recently proposed differentiable MAXSAT solver, SATNet, was a breakthrough in its capacity to integrate with a traditional neural network and solve visual reasoning problems. For instance, it can learn the rules of Sudoku pure… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Code available at https://github.com/SeverTopan/SATNet

  35. arXiv:2104.12225  [pdf, other

    cs.LG math.OC stat.ML

    DC3: A learning method for optimization with hard constraints

    Authors: Priya L. Donti, David Rolnick, J. Zico Kolter

    Abstract: Large optimization problems with hard constraints arise in many settings, yet classical solvers are often prohibitively slow, motivating the use of deep networks as cheap "approximate solvers." Unfortunately, naive deep learning approaches typically cannot enforce the hard constraints of such problems, leading to infeasible solutions. In this work, we present Deep Constraint Completion and Correct… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: In ICLR 2021. Code available at https://github.com/locuslab/DC3

    Journal ref: International Conference on Learning Representations 2021

  36. arXiv:2103.11285  [pdf, other

    cs.CV cs.AI cs.LG

    Geo-Spatiotemporal Features and Shape-Based Prior Knowledge for Fine-grained Imbalanced Data Classification

    Authors: Charles A. Kantor, Marta Skreta, Brice Rauby, Léonard Boussioux, Emmanuel Jehanno, Alexandra Luccioni, David Rolnick, Hugues Talbot

    Abstract: Fine-grained classification aims at distinguishing between items with similar global perception and patterns, but that differ by minute details. Our primary challenges come from both small inter-class variations and large intra-class variations. In this article, we propose to combine several innovations to improve fine-grained classification within the use-case of wildlife, which is of practical i… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: Copyright by the authors. All rights reserved to authors only. Correspondence to: ckantor (at) stanford [dot] edu

    Journal ref: Proc. IJCAI 2021, Workshop on AI for Social Good, Harvard University (2021)

  37. arXiv:2102.10492  [pdf, other

    stat.ML cs.LG

    Deep ReLU Networks Preserve Expected Length

    Authors: Boris Hanin, Ryan Jeong, David Rolnick

    Abstract: Assessing the complexity of functions computed by a neural network helps us understand how the network will learn and generalize. One natural measure of complexity is how the network distorts length - if the network takes a unit-length curve as input, what is the length of the resulting curve of outputs? It has been widely believed that this length grows exponentially in network depth. We prove th… ▽ More

    Submitted 22 June, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: 18 pages, 4 figures

  38. arXiv:1910.00744  [pdf, other

    cs.LG stat.ML

    Reverse-Engineering Deep ReLU Networks

    Authors: David Rolnick, Konrad P. Kording

    Abstract: It has been widely assumed that a neural network cannot be recovered from its outputs, as the network depends on its parameters in a highly nonlinear way. Here, we prove that in fact it is often possible to identify the architecture, weights, and biases of an unknown deep ReLU network by observing only its output. Every ReLU network defines a piecewise linear function, where the boundaries between… ▽ More

    Submitted 22 February, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 15 pages, 4 figures

  39. arXiv:1906.05433  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Tackling Climate Change with Machine Learning

    Authors: David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

    Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine lea… ▽ More

    Submitted 5 November, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: For additional resources, please visit the website that accompanies this paper: https://www.climatechange.ai/

  40. arXiv:1906.00904  [pdf, other

    stat.ML cs.LG math.ST

    Deep ReLU Networks Have Surprisingly Few Activation Patterns

    Authors: Boris Hanin, David Rolnick

    Abstract: The success of deep networks has been attributed in part to their expressivity: per parameter, deep networks can approximate a richer class of functions than shallow networks. In ReLU networks, the number of activation patterns is one measure of expressivity; and the maximum number of patterns grows exponentially with the depth. However, recent work has showed that the practical expressivity of de… ▽ More

    Submitted 20 October, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 18 page, 7 figures

    Journal ref: NeurIPS 2019

  41. arXiv:1901.09021  [pdf, other

    stat.ML cs.LG math.PR

    Complexity of Linear Regions in Deep Networks

    Authors: Boris Hanin, David Rolnick

    Abstract: It is well-known that the expressivity of a neural network depends on its architecture, with deeper networks expressing more complex functions. In the case of networks that compute piecewise linear functions, such as those with ReLU activation, the number of distinct linear regions is a natural measure of expressivity. It is possible to construct networks with merely a single region, or for which… ▽ More

    Submitted 11 June, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: ICML 2019

  42. arXiv:1812.01157  [pdf, other

    cs.CV

    Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics

    Authors: Yaron Meirovitch, Lu Mi, Hayk Saribekyan, Alexander Matveev, David Rolnick, Nir Shavit

    Abstract: Pixel-accurate tracking of objects is a key element in many computer vision applications, often solved by iterated individual object tracking or instance segmentation followed by object matching. Here we introduce cross-classification clustering (3C), a technique that simultaneously tracks complex, interrelated objects in an image stack. The key idea in cross-classification is to efficiently turn… ▽ More

    Submitted 15 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: 11 figures

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8425-8435

  43. arXiv:1811.11682  [pdf, other

    cs.LG cs.AI stat.ML

    Experience Replay for Continual Learning

    Authors: David Rolnick, Arun Ahuja, Jonathan Schwarz, Timothy P. Lillicrap, Greg Wayne

    Abstract: Continual learning is the problem of learning new tasks or knowledge while protecting old knowledge and ideally generalizing from old experience to learn new tasks faster. Neural networks trained by stochastic gradient descent often degrade on old tasks when trained successively on new tasks with different data distributions. This phenomenon, referred to as catastrophic forgetting, is considered a… ▽ More

    Submitted 26 November, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: NeurIPS 2019

  44. arXiv:1805.08289  [pdf, other

    cs.NE cs.LG stat.ML

    Measuring and regularizing networks in function space

    Authors: Ari S. Benjamin, David Rolnick, Konrad Kording

    Abstract: To optimize a neural network one often thinks of optimizing its parameters, but it is ultimately a matter of optimizing the function that maps inputs to outputs. Since a change in the parameters might serve as a poor proxy for the change in the function, it is of some concern that primacy is given to parameters but that the correspondence has not been tested. Here, we show that it is simple and co… ▽ More

    Submitted 26 June, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Presented at ICLR 2019

    Journal ref: International Conference on Learning Representations, 2019, https://openreview.net/pdf?id=SkMwpiR9Y7

  45. arXiv:1803.01719  [pdf, other

    stat.ML cs.LG

    How to Start Training: The Effect of Initialization and Architecture

    Authors: Boris Hanin, David Rolnick

    Abstract: We identify and study two common failure modes for early training in deep ReLU nets. For each we give a rigorous proof of when it occurs and how to avoid it, for fully connected and residual architectures. The first failure mode, exploding/vanishing mean activation length, can be avoided by initializing weights from a symmetric distribution with variance 2/fan-in and, for ResNets, by correctly wei… ▽ More

    Submitted 13 November, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: Final Version, 16p, Accepted NIPS 2018

  46. arXiv:1705.10882  [pdf, other

    cs.CV cs.AI q-bio.NC stat.ML

    Morphological Error Detection in 3D Segmentations

    Authors: David Rolnick, Yaron Meirovitch, Toufiq Parag, Hanspeter Pfister, Viren Jain, Jeff W. Lichtman, Edward S. Boyden, Nir Shavit

    Abstract: Deep learning algorithms for connectomics rely upon localized classification, rather than overall morphology. This leads to a high incidence of erroneously merged objects. Humans, by contrast, can easily detect such errors by acquiring intuition for the correct morphology of objects. Biological neurons have complicated and variable shapes, which are challenging to learn, and merge errors take a mu… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

    Comments: 13 pages, 6 figures

  47. arXiv:1705.10694  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Deep Learning is Robust to Massive Label Noise

    Authors: David Rolnick, Andreas Veit, Serge Belongie, Nir Shavit

    Abstract: Deep neural networks trained on large supervised datasets have led to impressive results in image classification and other tasks. However, well-annotated datasets can be time-consuming and expensive to collect, lending increased interest to larger but noisy datasets that are more easily obtained. In this paper, we show that deep neural networks are capable of generalizing from training data for wh… ▽ More

    Submitted 26 February, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  48. arXiv:1705.05502  [pdf, other

    cs.LG cs.NE stat.ML

    The power of deeper networks for expressing natural functions

    Authors: David Rolnick, Max Tegmark

    Abstract: It is well-known that neural networks are universal approximators, but that deeper networks tend in practice to be more powerful than shallower ones. We shed light on this by proving that the total number of neurons $m$ required to approximate natural classes of multivariate polynomials of $n$ variables grows only linearly with $n$ for deep neural networks, but grows exponentially when merely a si… ▽ More

    Submitted 27 April, 2018; v1 submitted 15 May, 2017; originally announced May 2017.

    Comments: Replaced to match version published at ICLR 2018. 14 pages, 2 figs

  49. arXiv:1612.02120  [pdf, other

    q-bio.QM cs.AI q-bio.NC

    A Multi-Pass Approach to Large-Scale Connectomics

    Authors: Yaron Meirovitch, Alexander Matveev, Hayk Saribekyan, David Budden, David Rolnick, Gergely Odor, Seymour Knowles-Barley, Thouis Raymond Jones, Hanspeter Pfister, Jeff William Lichtman, Nir Shavit

    Abstract: The field of connectomics faces unprecedented "big data" challenges. To reconstruct neuronal connectivity, automated pixel-level segmentation is required for petabytes of streaming electron microscopy data. Existing algorithms provide relatively good accuracy but are unacceptably slow, and would require years to extract connectivity graphs from even a single cubic millimeter of neural tissue. Here… ▽ More

    Submitted 7 December, 2016; originally announced December 2016.

    Comments: 18 pages, 10 figures

  50. arXiv:1611.03780  [pdf, other

    cs.SI cs.DS

    Randomized Experimental Design via Geographic Clustering

    Authors: David Rolnick, Kevin Aydin, Jean Pouget-Abadie, Shahab Kamali, Vahab Mirrokni, Amir Najmi

    Abstract: Web-based services often run randomized experiments to improve their products. A popular way to run these experiments is to use geographical regions as units of experimentation, since this does not require tracking of individual users or browser cookies. Since users may issue queries from multiple geographical locations, geo-regions cannot be considered independent and interference may be present… ▽ More

    Submitted 15 February, 2019; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: 10 pages