Skip to main content

Showing 1–28 of 28 results for author: Pearlmutter, B A

  1. arXiv:2407.03847  [pdf, other

    cs.LO

    Comparing Differentiable Logics for Learning with Logical Constraints

    Authors: Thomas Flinkow, Barak A. Pearlmutter, Rosemary Monahan

    Abstract: Extensive research on formal verification of machine learning systems indicates that learning from data alone often fails to capture underlying background knowledge such as specifications implicitly available in the data. Various neural network verifiers have been developed to ensure that a machine-learnt model satisfies correctness and safety properties, however, they typically assume a trained n… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 20 pages, 8 figures. Submitted to Science of Computer Programming

  2. arXiv:2402.06751  [pdf, other

    cs.LG

    Low-Rank Learning by Design: the Role of Network Architecture and Activation Linearity in Gradient Rank Collapse

    Authors: Bradley T. Baker, Barak A. Pearlmutter, Robyn Miller, Vince D. Calhoun, Sergey M. Plis

    Abstract: Our understanding of learning dynamics of deep neural networks (DNNs) remains incomplete. Recent research has begun to uncover the mathematical principles underlying these networks, including the phenomenon of "Neural Collapse", where linear classifiers within DNNs converge to specific geometrical structures during late-stage training. However, the role of geometric constraints in learning extends… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  3. arXiv:2311.09809  [pdf, other

    cs.LO cs.AI cs.LG

    Comparing Differentiable Logics for Learning Systems: A Research Preview

    Authors: Thomas Flinkow, Barak A. Pearlmutter, Rosemary Monahan

    Abstract: Extensive research on formal verification of machine learning (ML) systems indicates that learning from data alone often fails to capture underlying background knowledge. A variety of verifiers have been developed to ensure that a machine-learnt model satisfies correctness and safety properties, however, these verifiers typically assume a trained network with fixed weights. ML-enabled autonomous s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: In Proceedings FMAS 2023, arXiv:2311.08987

    Journal ref: EPTCS 395, 2023, pp. 17-29

  4. Visualization of AI Systems in Virtual Reality: A Comprehensive Review

    Authors: Medet Inkarbekov, Rosemary Monahan, Barak A. Pearlmutter

    Abstract: This study provides a comprehensive review of the utilization of Virtual Reality (VR) for visualizing Artificial Intelligence (AI) systems, drawing on 18 selected studies. The results illuminate a complex interplay of tools, methods, and approaches, notably the prominence of VR engines like Unreal Engine and Unity. However, despite these tools, a universal solution for effective AI visualization r… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 19 pages

  5. arXiv:2202.08587  [pdf, other

    cs.LG stat.ML

    Gradients without Backpropagation

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Don Syme, Frank Wood, Philip Torr

    Abstract: Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic differentiation algorithms that also includes the forward mode. We present a method to compute gradients based solely on the directional derivative that one can comp… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 10 pages, 6 figures

    MSC Class: 68T07 ACM Class: I.2.6; I.2.5

  6. arXiv:2111.00343  [pdf, other

    cs.LG

    Continuous Convolutional Neural Networks: Coupled Neural PDE and ODE

    Authors: Mansura Habiba, Barak A. Pearlmutter

    Abstract: Recent work in deep learning focuses on solving physical systems in the Ordinary Differential Equation or Partial Differential Equation. This current work proposed a variant of Convolutional Neural Networks (CNNs) that can learn the hidden dynamics of a physical system using ordinary differential equation (ODEs) systems (ODEs) and Partial Differential Equation systems (PDEs). Instead of considerin… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Proc. of the International Conference on Electrical, Computer and Energy Technologies (ICECET)

  7. arXiv:2111.00326  [pdf, other

    cs.LG

    Neural Network based on Automatic Differentiation Transformation of Numeric Iterate-to-Fixedpoint

    Authors: Mansura Habiba, Barak A. Pearlmutter

    Abstract: This work proposes a Neural Network model that can control its depth using an iterate-to-fixed-point operator. The architecture starts with a standard layered Network but with added connections from current later to earlier layers, along with a gate to make them inactive under most circumstances. These ``temporal wormhole'' connections create a shortcut that allows the Neural Network to use the in… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Proc. of the International Conference on Electrical, Computer and Energy Technologies (ICECET)

  8. arXiv:2111.00314  [pdf, other

    cs.LG

    ECG synthesis with Neural ODE and GAN models

    Authors: Mansura Habiba, Eoin Brophy, Barak A. Pearlmutter, Tomas Ward

    Abstract: Continuous medical time series data such as ECG is one of the most complex time series due to its dynamic and high dimensional characteristics. In addition, due to its sensitive nature, privacy concerns and legal restrictions, it is often even complex to use actual data for different medical research. As a result, generating continuous medical time series is a very critical research area. Several… ▽ More

    Submitted 6 June, 2022; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: Proc. of the International Conference on Electrical, Computer and Energy Technologies (ICECET), 9-10 December 2021, Cape Town-South Africa

  9. arXiv:2105.06168  [pdf, other

    cs.LG cs.NE

    HeunNet: Extending ResNet using Heun's Methods

    Authors: Mehrdad Maleki, Mansura Habiba, Barak A. Pearlmutter

    Abstract: There is an analogy between the ResNet (Residual Network) architecture for deep neural networks and an Euler solver for an ODE. The transformation performed by each layer resembles an Euler step in solving an ODE. We consider the Heun Method, which involves a single predictor-corrector cycle, and complete the analogy, building a predictor-corrector variant of ResNet, which we call a HeunNet. Just… ▽ More

    Submitted 14 May, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Irish Signals & Systems Conference 2021

  10. arXiv:2005.10693  [pdf, other

    cs.LG stat.ML

    Neural ODEs for Informative Missingness in Multivariate Time Series

    Authors: Mansura Habiba, Barak A. Pearlmutter

    Abstract: Informative missingness is unavoidable in the digital processing of continuous time series, where the value for one or more observations at different time points are missing. Such missing observations are one of the major limitations of time series processing using deep learning. Practical applications, e.g., sensor data, healthcare, weather, generates data that is in truth continuous in time, and… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  11. arXiv:2005.09807  [pdf, other

    cs.LG stat.ML

    Neural Ordinary Differential Equation based Recurrent Neural Network Model

    Authors: Mansura Habiba, Barak A. Pearlmutter

    Abstract: Neural differential equations are a promising new member in the neural network family. They show the potential of differential equations for time series data analysis. In this paper, the strength of the ordinary differential equation (ODE) is explored with a new extension. The main goal of this work is to answer the following questions: (i)~can ODE be used to redefine the existing neural network m… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  12. arXiv:1911.03028  [pdf, other

    cs.DC cs.DS

    Lock-Free Hopscotch Hashing

    Authors: Robert Kelly, Barak A. Pearlmutter, Phil Maguire

    Abstract: In this paper we present a lock-free version of Hopscotch Hashing. Hopscotch Hashing is an open addressing algorithm originally proposed by Herlihy, Shavit, and Tzafrir, which is known for fast performance and excellent cache locality. The algorithm allows users of the table to skip or jump over irrelevant entries, allowing quick search, insertion, and removal of entries. Unlike traditional linear… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: 15 pages, to appear in APOCS20

  13. arXiv:1809.04339  [pdf, other

    cs.DC

    Concurrent Robin Hood Hashing

    Authors: Robert Kelly, Barak A. Pearlmutter, Phil Maguire

    Abstract: In this paper we examine the issues involved in adding concurrency to the Robin Hood hash table algorithm. We present a non-blocking obstruction-free K-CAS Robin Hood algorithm which requires only a single word compare-and-swap primitive, thus making it highly portable. The implementation maintains the attractive properties of the original Robin Hood structure, such as a low expected probe length,… ▽ More

    Submitted 14 November, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: 16 pages, 12 figures

  14. Divide-and-Conquer Checkpointing for Arbitrary Programs with No User Annotation

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: Classical reverse-mode automatic differentiation (AD) imposes only a small constant-factor overhead in operation count over the original computation, but has storage requirements that grow, in the worst case, in proportion to the time consumed by the original computation. This storage blowup can be ameliorated by checkpointing, a process that reorders application of classical reverse-mode AD over… ▽ More

    Submitted 29 March, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    MSC Class: 68N20; 68N18; 65F50; 65D25; 46G05; 58C20

    Journal ref: Optimization Methods and Software 33(04-06):1288-1330, 2018

  15. arXiv:1611.03777  [pdf, ps, other

    cs.LG stat.ML

    Tricks from Deep Learning

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: The deep learning community has devised a diverse set of methods to make gradient optimization, using large datasets, of large and highly complex models with deeply cascaded nonlinearities, practical. Taken as a whole, these methods constitute a breakthrough, allowing computational structures which are quite wide, very deep, and with an enormous number and variety of free parameters to be effectiv… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  16. arXiv:1611.03429  [pdf, ps, other

    cs.PL cs.LO

    Evolving the Incremental λ Calculus into a Model of Forward Automatic Differentiation (AD)

    Authors: Robert Kelly, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: Formal transformations somehow resembling the usual derivative are surprisingly common in computer science, with two notable examples being derivatives of regular expressions and derivatives of types. A newcomer to this list is the incremental $λ$-calculus, or ILC, a "theory of changes" that deploys a formal apparatus allowing the automatic generation of efficient update functions which perform in… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  17. arXiv:1611.03423  [pdf, ps, other

    cs.MS cs.LG

    DiffSharp: An AD Library for .NET Languages

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: DiffSharp is an algorithmic differentiation or automatic differentiation (AD) library for the .NET ecosystem, which is targeted by the C# and F# languages, among others. The library has been designed with machine learning applications in mind, allowing very succinct implementations of models and optimization routines. DiffSharp is implemented in F# and exposes forward and reverse AD operators as g… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  18. arXiv:1611.03416  [pdf, ps, other

    cs.PL cs.MS

    Efficient Implementation of a Higher-Order Language with Built-In AD

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: We show that Automatic Differentiation (AD) operators can be provided in a dynamic language without sacrificing numeric performance. To achieve this, general forward and reverse AD functions are added to a simple high-level dynamic language, and support for them is included in an aggressive optimizing compiler. Novel technical mechanisms are discussed, which have the ability to migrate the AD tran… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  19. arXiv:1611.03410  [pdf, other

    cs.PL cs.LG cs.MS

    Binomial Checkpointing for Arbitrary Programs with No User Annotation

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: Heretofore, automatic checkpointing at procedure-call boundaries, to reduce the space complexity of reverse mode, has been provided by systems like Tapenade. However, binomial checkpointing, or treeverse, has only been provided in Automatic Differentiation (AD) systems in special cases, e.g., through user-provided pragmas on DO loops in Tapenade, or as the nested taping mechanism in adol-c for tim… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  20. arXiv:1511.07727  [pdf, ps, other

    cs.MS

    DiffSharp: Automatic Differentiation Library

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: In this paper we introduce DiffSharp, an automatic differentiation (AD) library designed with machine learning in mind. AD is a family of techniques that evaluate derivatives at machine precision with only a small constant factor of overhead, by systematically applying the chain rule of calculus at the elementary operator level. DiffSharp aims to make an extensive array of AD techniques available,… ▽ More

    Submitted 26 November, 2015; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: 5 pages, 1 figure, minor fixes, added coauthor

    MSC Class: 68T05; 68W30 ACM Class: I.2.6; G.1.4

  21. arXiv:1502.05767  [pdf, ps, other

    cs.SC cs.LG stat.ML

    Automatic differentiation in machine learning: a survey

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind

    Abstract: Derivatives, mostly in the form of gradients and Hessians, are ubiquitous in machine learning. Automatic differentiation (AD), also called algorithmic differentiation or simply "autodiff", is a family of techniques similar to but more general than backpropagation for efficiently and accurately evaluating derivatives of numeric functions expressed as computer programs. AD is a small but established… ▽ More

    Submitted 5 February, 2018; v1 submitted 19 February, 2015; originally announced February 2015.

    Comments: 43 pages, 5 figures

    MSC Class: 68W30; 65D25; 68T05 ACM Class: G.1.4; I.2.6

    Journal ref: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind. Automatic differentiation in machine learning: a survey. The Journal of Machine Learning Research, 18(153):1--43, 2018

  22. arXiv:1409.7316  [pdf, other

    cs.DL cs.MS

    An Analysis of Publication Venues for Automatic Differentiation Research

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter

    Abstract: We present the results of our analysis of publication venues for papers on automatic differentiation (AD), covering academic journals and conference proceedings. Our data are collected from the AD publications database maintained by the autodiff.org community website. The database is purpose-built for the AD field and is expanding via submissions by AD researchers. Therefore, it provides a relativ… ▽ More

    Submitted 25 September, 2014; originally announced September 2014.

    Comments: 6 pages, 3 figures

    MSC Class: 00A15

  23. arXiv:1404.7456  [pdf, other

    cs.LG cs.SC stat.ML

    Automatic Differentiation of Algorithms for Machine Learning

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter

    Abstract: Automatic differentiation---the mechanical transformation of numeric computer programs to calculate derivatives efficiently and accurately---dates to the origin of the computer age. Reverse mode automatic differentiation both antedates and generalizes the method of backwards propagation of errors used in machine learning. Despite this, practitioners in a variety of fields, including machine learni… ▽ More

    Submitted 28 April, 2014; originally announced April 2014.

    Comments: 7 pages, 1 figure

    MSC Class: 68W30; 65D25; 68T05 ACM Class: G.1.4; I.2.6

  24. arXiv:1301.3527  [pdf, other

    cs.LG math.NA

    Block Coordinate Descent for Sparse NMF

    Authors: Vamsi K. Potluru, Sergey M. Plis, Jonathan Le Roux, Barak A. Pearlmutter, Vince D. Calhoun, Thomas P. Hayes

    Abstract: Nonnegative matrix factorization (NMF) has become a ubiquitous tool for data analysis. An important variant is the sparse NMF problem which arises when we explicitly require the learnt features to be sparse. A natural measure of sparsity is the L$_0$ norm, however its optimization is NP-hard. Mixed norms, such as L$_1$/L$_2$ measure, have been shown to model sparsity robustly, based on intuitive a… ▽ More

    Submitted 18 March, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

  25. arXiv:1211.4892  [pdf, ps, other

    cs.SC cs.MS math.DG

    Confusion of Tagged Perturbations in Forward Automatic Differentiation of Higher-Order Functions

    Authors: Oleksandr Manzyuk, Barak A. Pearlmutter, Alexey Andreyevich Radul, David R. Rush, Jeffrey Mark Siskind

    Abstract: Forward Automatic Differentiation (AD) is a technique for augmenting programs to compute derivatives. The essence of Forward AD is to attach perturbations to each number, and propagate these through the computation. When derivatives are nested, the distinct derivative calculations, and their associated perturbations, must be distinguished. This is typically accomplished by creating a unique tag fo… ▽ More

    Submitted 29 June, 2019; v1 submitted 20 November, 2012; originally announced November 2012.

  26. arXiv:1203.1450  [pdf, ps, other

    cs.PL cs.MS math.NA

    AD in Fortran, Part 2: Implementation via Prepreprocessor

    Authors: Alexey Radul, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: We describe an implementation of the Farfel Fortran AD extensions. These extensions integrate forward and reverse AD directly into the programming model, with attendant benefits to flexibility, modularity, and ease of use. The implementation we describe is a "prepreprocessor" that generates input to existing Fortran-based AD tools. In essence, blocks of code which are targeted for AD by Farfel con… ▽ More

    Submitted 8 March, 2012; v1 submitted 7 March, 2012; originally announced March 2012.

    Journal ref: Recent Advances in Algorithmic Differentiation, Springer Lecture Notes in Computational Science and Engineering volume 87, 2012, ISBN 978-3-642-30022-6, pages 273-284

  27. arXiv:1203.1448  [pdf, ps, other

    cs.PL cs.MS math.NA

    AD in Fortran, Part 1: Design

    Authors: Alexey Radul, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: We propose extensions to Fortran which integrate forward and reverse Automatic Differentiation (AD) directly into the programming model. Irrespective of implementation technology, embedding AD constructs directly into the language extends the reach and convenience of AD while allowing abstraction of concepts of interest to scientific-computing practice, such as root finding, optimization, and find… ▽ More

    Submitted 8 March, 2012; v1 submitted 7 March, 2012; originally announced March 2012.

  28. arXiv:cs/0511088  [pdf, ps, other

    cs.LG

    Bounds on Query Convergence

    Authors: Barak A. Pearlmutter

    Abstract: The problem of finding an optimum using noisy evaluations of a smooth cost function arises in many contexts, including economics, business, medicine, experiment design, and foraging theory. We derive an asymptotic bound E[ (x_t - x*)^2 ] >= O(1/sqrt(t)) on the rate of convergence of a sequence (x_0, x_1, >...) generated by an unbiased feedback process observing noisy evaluations of an unknown qu… ▽ More

    Submitted 25 November, 2005; originally announced November 2005.

    Comments: 6 pages, 2 figures

    ACM Class: G.1.6