Skip to main content

Showing 1–13 of 13 results for author: Hagemann, P

  1. arXiv:2403.18705  [pdf, other

    cs.LG math.OC

    Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching

    Authors: Jannis Chemseddine, Paul Hagemann, Gabriele Steidl, Christian Wald

    Abstract: In inverse problems, many conditional generative models approximate the posterior measure by minimizing a distance between the joint measure and its learned approximation. While this approach also controls the distance between the posterior measures in the case of the Kullback--Leibler divergence, this is in general not hold true for the Wasserstein distance. In this paper, we introduce a conditio… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: This paper supersedes arXiv:2310.13433

  2. arXiv:2402.02964  [pdf, other

    cs.LG physics.data-an

    Mixed Noise and Posterior Estimation with Conditional DeepGEM

    Authors: Paul Hagemann, Johannes Hertrich, Maren Casfor, Sebastian Heidenreich, Gabriele Steidl

    Abstract: Motivated by indirect measurements and applications from nanometrology with a mixed noise model, we develop a novel algorithm for jointly estimating the posterior and the noise parameters in Bayesian inverse problems. We propose to solve the problem by an expectation maximization (EM) algorithm. Based on the current noise parameters, we learn in the E-step a conditional normalizing flow that appro… ▽ More

    Submitted 5 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Published in Machine Learning: Science and Technology

    Journal ref: Machine Learning: Science and Technology, Volume 5, Number 3, 2024

  3. arXiv:2312.16611  [pdf, other

    cs.CV cs.LG eess.IV math.PR

    Learning from small data sets: Patch-based regularizers in inverse problems for image reconstruction

    Authors: Moritz Piening, Fabian Altekrüger, Johannes Hertrich, Paul Hagemann, Andrea Walther, Gabriele Steidl

    Abstract: The solution of inverse problems is of fundamental interest in medical and astronomical imaging, geophysics as well as engineering and life sciences. Recent advances were made by using methods from machine learning, in particular deep neural networks. Most of these methods require a huge amount of (paired) data and computer capacity to train the networks, which often may not be available. Our pape… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  4. arXiv:2310.13433  [pdf, other

    cs.LG math.ST stat.ML

    Y-Diagonal Couplings: Approximating Posteriors with Conditional Wasserstein Distances

    Authors: Jannis Chemseddine, Paul Hagemann, Christian Wald

    Abstract: In inverse problems, many conditional generative models approximate the posterior measure by minimizing a distance between the joint measure and its learned approximation. While this approach also controls the distance between the posterior measures in the case of the Kullback Leibler divergence, it does not hold true for the Wasserstein distance. We will introduce a conditional Wasserstein distan… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 26 pages, 9 figures

  5. arXiv:2310.03054  [pdf, other

    stat.ML cs.LG math.OC math.PR

    Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel

    Authors: Paul Hagemann, Johannes Hertrich, Fabian Altekrüger, Robert Beinert, Jannis Chemseddine, Gabriele Steidl

    Abstract: We propose conditional flows of the maximum mean discrepancy (MMD) with the negative distance kernel for posterior sampling and conditional generative modeling. This MMD, which is also known as energy distance, has several advantageous properties like efficient computation via slicing and sorting. We approximate the joint distribution of the ground truth and the observations using discrete Wassers… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICLR 2024

  6. arXiv:2305.11463  [pdf, other

    cs.LG math.PR stat.ML

    Generative Sliced MMD Flows with Riesz Kernels

    Authors: Johannes Hertrich, Christian Wald, Fabian Altekrüger, Paul Hagemann

    Abstract: Maximum mean discrepancy (MMD) flows suffer from high computational costs in large scale computations. In this paper, we show that MMD flows with Riesz kernels $K(x,y) = - \|x-y\|^r$, $r \in (0,2)$ have exceptional properties which allow their efficient computation. We prove that the MMD of Riesz kernels, which is also known as energy distance, coincides with the MMD of their sliced version. As a… ▽ More

    Submitted 20 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ICLR 2024

  7. arXiv:2303.15845  [pdf, other

    cs.LG math.ST

    Conditional Generative Models are Provably Robust: Pointwise Guarantees for Bayesian Inverse Problems

    Authors: Fabian Altekrüger, Paul Hagemann, Gabriele Steidl

    Abstract: Conditional generative models became a very powerful tool to sample from Bayesian inverse problem posteriors. It is well-known in classical Bayesian literature that posterior measures are quite robust with respect to perturbations of both the prior measure and the negative log-likelihood, which includes perturbations of the observations. However, to the best of our knowledge, the robustness of con… ▽ More

    Submitted 23 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted and published in Transactions on Machine Learning Research (07/2023)

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023

  8. arXiv:2303.04772  [pdf, other

    cs.LG cs.CV math.PR stat.ML

    Multilevel Diffusion: Infinite Dimensional Score-Based Diffusion Models for Image Generation

    Authors: Paul Hagemann, Sophie Mildenberger, Lars Ruthotto, Gabriele Steidl, Nicole Tianjiao Yang

    Abstract: Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. Existing SBDMs are typically formulated in a finite-dimensional setting, where images are considered as tensors of finite size. This paper develops SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. Besides the qu… ▽ More

    Submitted 4 November, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    MSC Class: 60H10; 65D18

  9. arXiv:2205.12021  [pdf, other

    cs.LG eess.IV math.PR

    PatchNR: Learning from Very Few Images by Patch Normalizing Flow Regularization

    Authors: Fabian Altekrüger, Alexander Denker, Paul Hagemann, Johannes Hertrich, Peter Maass, Gabriele Steidl

    Abstract: Learning neural networks using only few available information is an important ongoing research topic with tremendous potential for applications. In this paper, we introduce a powerful regularizer for the variational modeling of inverse problems in imaging. Our regularizer, called patch normalizing flow regularizer (patchNR), involves a normalizing flow learned on small patches of very few images.… ▽ More

    Submitted 21 November, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Journal ref: Inverse Problems, Volume 39, Number 6, 2023

  10. Generalized Normalizing Flows via Markov Chains

    Authors: Paul Hagemann, Johannes Hertrich, Gabriele Steidl

    Abstract: Normalizing flows, diffusion normalizing flows and variational autoencoders are powerful generative models. This chapter provides a unified framework to handle these approaches via Markov chains. We consider stochastic normalizing flows as a pair of Markov chains fulfilling some properties and show how many state-of-the-art models for data generation fit into this framework. Indeed numerical simul… ▽ More

    Submitted 20 July, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:2109.11375

  11. arXiv:2109.11375  [pdf, other

    cs.LG math.PR

    Stochastic Normalizing Flows for Inverse Problems: a Markov Chains Viewpoint

    Authors: Paul Hagemann, Johannes Hertrich, Gabriele Steidl

    Abstract: To overcome topological constraints and improve the expressiveness of normalizing flow architectures, Wu, Köhler and Noé introduced stochastic normalizing flows which combine deterministic, learnable flow transformations with stochastic sampling methods. In this paper, we consider stochastic normalizing flows from a Markov chain point of view. In particular, we replace transition densities by gene… ▽ More

    Submitted 7 February, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Journal ref: SIAM/ASA Journal on Uncertainty Quantification, vol. 10 (3), pp. 1162-1190, 2022

  12. arXiv:2102.03189  [pdf, other

    cs.LG physics.data-an

    Invertible Neural Networks versus MCMC for Posterior Reconstruction in Grazing Incidence X-Ray Fluorescence

    Authors: Anna Andrle, Nando Farchmin, Paul Hagemann, Sebastian Heidenreich, Victor Soltwisch, Gabriele Steidl

    Abstract: Grazing incidence X-ray fluorescence is a non-destructive technique for analyzing the geometry and compositional parameters of nanostructures appearing e.g. in computer chips. In this paper, we propose to reconstruct the posterior parameter distribution given a noisy measurement generated by the forward model by an appropriately learned invertible neural network. This network resembles the transpo… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  13. arXiv:2009.02994  [pdf, other

    cs.LG math.OC stat.ML

    Stabilizing Invertible Neural Networks Using Mixture Models

    Authors: Paul Hagemann, Sebastian Neumayer

    Abstract: In this paper, we analyze the properties of invertible neural networks, which provide a way of solving inverse problems. Our main focus lies on investigating and controlling the Lipschitz constants of the corresponding inverse networks. Without such an control, numerical simulations are prone to errors and not much is gained against traditional approaches. Fortunately, our analysis indicates that… ▽ More

    Submitted 1 February, 2021; v1 submitted 7 September, 2020; originally announced September 2020.