Skip to main content

Showing 1–50 of 116 results for author: Larsen, K G

  1. arXiv:2407.03155  [pdf, ps, other

    cs.LO

    The Complexity of Data-Free Nfer

    Authors: Sean Kauffman, Kim Guldstrand Larsen, Martin Zimmermann

    Abstract: Nfer is a Runtime Verification language for the analysis of event traces that applies rules to create hierarchies of time intervals. This work examines the complexity of the evaluation and satisfiability problems for the data-free fragment of nfer. The evaluation problem asks whether a given interval is generated by applying rules to a known input, while the satisfiability problem asks if an input… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2404.18282  [pdf, other

    cs.FL

    Monitoring Real-Time Systems under Parametric Delay

    Authors: Martin Fränzle, Thomas M. Grosen, Kim G. Larsen, Martin Zimmermann

    Abstract: Online monitoring of embedded real-time systems can be achieved by reduction of an adequate property language, like Metric Interval Temporal Logic, to timed automata and symbolic execution of the resulting automata on the trace observed from the system. This direct construction however only is faithful if observation of the trace is immediate in the sense that the monitor can assign exact time sta… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  3. arXiv:2403.08831  [pdf, ps, other

    stat.ML cs.LG math.ST

    Majority-of-Three: The Simplest Optimal Learner?

    Authors: Ishaq Aden-Ali, Mikael Møller Høgsgaard, Kasper Green Larsen, Nikita Zhivotovskiy

    Abstract: Developing an optimal PAC learning algorithm in the realizable setting, where empirical risk minimization (ERM) is suboptimal, was a major open problem in learning theory for decades. The problem was finally resolved by Hanneke a few years ago. Unfortunately, Hanneke's algorithm is quite complex as it returns the majority vote of many ERM classifiers that are trained on carefully selected subsets… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 22 pages

  4. arXiv:2402.13857  [pdf, ps, other

    cs.LG

    Replicable Learning of Large-Margin Halfspaces

    Authors: Alkis Kalavasis, Amin Karbasi, Kasper Green Larsen, Grigoris Velegkas, Felix Zhou

    Abstract: We provide efficient replicable algorithms for the problem of learning large-margin halfspaces. Our results improve upon the algorithms provided by Impagliazzo, Lei, Pitassi, and Sorrell [STOC, 2022]. We design the first dimension-independent replicable algorithms for this task which runs in polynomial time, is proper, and has strictly improved sample complexity compared to the one achieved by Imp… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: to be published in ICML 2024

  5. arXiv:2402.02976  [pdf, ps, other

    cs.LG stat.ML

    Boosting, Voting Classifiers and Randomized Sample Compression Schemes

    Authors: Arthur da Cunha, Kasper Green Larsen, Martin Ritzert

    Abstract: In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak learners. While many successful boosting algorithms, such as the iconic AdaBoost, produce voting classifiers, their theoretical performance has long remained sub-… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  6. arXiv:2311.10204  [pdf, ps, other

    cs.CC cs.DS cs.FL

    The NFA Acceptance Hypothesis: Non-Combinatorial and Dynamic Lower Bounds

    Authors: Karl Bringmann, Allan Grønlund, Marvin Künnemann, Kasper Green Larsen

    Abstract: We pose the fine-grained hardness hypothesis that the textbook algorithm for the NFA Acceptance problem is optimal up to subpolynomial factors, even for dense NFAs and fixed alphabets. We show that this barrier appears in many variations throughout the algorithmic literature by introducing a framework of Colored Walk problems. These yield fine-grained equivalent formulations of the NFA Acceptanc… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 31 pages, Accepted at ITCS

  7. CGAAL: Distributed On-The-Fly ATL Model Checker with Heuristics

    Authors: Falke B. Ø. Carlsen, Lars Bo P. Frydenskov, Nicolaj Ø. Jensen, Jener Rasmussen, Mathias M. Sørensen, Asger G. Weirsøe, Mathias C. Jensen, Kim G. Larsen

    Abstract: We present CGAAL, our efficient on-the-fly model checker for alternating-time temporal logic (ATL) on concurrent game structures (CGS). We present how our tool encodes ATL as extended dependency graphs with negation edges and employs the distributed on-the-fly algorithm by Dalsgaard et al. Our tool offers multiple novel search strategies for the algorithm, including DHS which is inspired by PageRa… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: In Proceedings GandALF 2023, arXiv:2309.17318

    ACM Class: D.2.1; D.2.4

    Journal ref: EPTCS 390, 2023, pp. 99-114

  8. arXiv:2308.16042  [pdf, ps, other

    cs.DS

    Optimal Non-Adaptive Cell Probe Dictionaries and Hashing

    Authors: Kasper Green Larsen, Rasmus Pagh, Giuseppe Persiano, Toniann Pitassi, Kevin Yeo, Or Zamir

    Abstract: We present a simple and provably optimal non-adaptive cell probe data structure for the static dictionary problem. Our data structure supports storing a set of n key-value pairs from [u]x[u] using s words of space and answering key lookup queries in t = O(lg(u/n)/ lg(s/n)) nonadaptive probes. This generalizes a solution to the membership problem (i.e., where no values are associated with keys) due… ▽ More

    Submitted 19 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Appears at ICALP 2024. This paper is a merge and revision of two previous reports [PY20] and [LPPZ23]

  9. arXiv:2308.14424  [pdf, other

    cs.LO cs.AI cs.LG eess.SY

    Shielded Reinforcement Learning for Hybrid Systems

    Authors: Asger Horn Brorholt, Peter Gjøl Jensen, Kim Guldstrand Larsen, Florian Lorber, Christian Schilling

    Abstract: Safe and optimal controller synthesis for switched-controlled hybrid systems, which combine differential equations and discrete changes of the system's state, is known to be intricately hard. Reinforcement learning has been leveraged to construct near-optimal controllers, but their behavior is not guaranteed to be safe, even when it is encouraged by reward engineering. One way of imposing safety t… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Journal ref: AISoLA 2023

  10. arXiv:2308.09510  [pdf, ps, other

    quant-ph cs.ET

    Forward and Backward Constrained Bisimulations for Quantum Circuits using Decision Diagrams

    Authors: Lukas Burgholzer, Antonio Jiménez-Pastor, Kim G. Larsen, Mirco Tribastone, Max Tschaikowski, Robert Wille

    Abstract: Efficient methods for the simulation of quantum circuits on classic computers are crucial for their analysis due to the exponential growth of the problem size with the number of qubits. Here we study lumping methods based on bisimulation, an established class of techniques that has been proven successful for (classic) stochastic and deterministic systems such as Markov chains and ordinary differen… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 20 pages, 1 algorithm, 2 tables

  11. arXiv:2307.06113  [pdf, ps, other

    cs.DS

    Sublinear Time Shortest Path in Expander Graphs

    Authors: Noga Alon, Allan Grønlund, Søren Fuglede Jørgensen, Kasper Green Larsen

    Abstract: Computing a shortest path between two nodes in an undirected unweighted graph is among the most basic algorithmic tasks. Breadth first search solves this problem in linear time, which is clearly also a lower bound in the worst case. However, several works have shown how to solve this problem in sublinear time in expectation when the input graph is drawn from one of several classes of random graphs… ▽ More

    Submitted 31 July, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  12. arXiv:2306.07583  [pdf, ps, other

    cs.DS cs.CR

    Invertible Bloom Lookup Tables with Less Memory and Randomness

    Authors: Nils Fleischhacker, Kasper Green Larsen, Maciej Obremski, Mark Simkin

    Abstract: In this work we study Invertible Bloom Lookup Tables (IBLTs) with small failure probabilities. IBLTs are highly versatile data structures that have found applications in set reconciliation protocols, error-correcting codes, and even the design of advanced cryptographic primitives. For storing $n$ elements and ensuring correctness with probability at least $1 - δ$, existing IBLT constructions requi… ▽ More

    Submitted 14 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  13. arXiv:2304.08745  [pdf, other

    cs.DS cs.CC

    Super-Logarithmic Lower Bounds for Dynamic Graph Problems

    Authors: Kasper Green Larsen, Huacheng Yu

    Abstract: In this work, we prove a $\tildeΩ(\lg^{3/2} n )$ unconditional lower bound on the maximum of the query time and update time for dynamic data structures supporting reachability queries in $n$-node directed acyclic graphs under edge insertions. This is the first super-logarithmic lower bound for any natural graph problem. In proving the lower bound, we also make novel contributions to the state-of-t… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  14. arXiv:2304.00277  [pdf, other

    cs.NI

    Energy Consumption Optimization in Radio Access Networks (ECO-RAN)

    Authors: Anders Mariegaard, Kim G. Larsen, Marco Muniz, Thomas Dyhre Nielsen

    Abstract: In recent years, mobile network operators are showing interest in reducing energy consumption. Toward this goal, in cooperation with the Danish company 2Operate we have developed a stochastic simulation environment for mobile networks. Our simulator interacts with historical data from 2Operate and allow us to turn on and off network cells, replay traffic loads, etc. We have developed an optimizati… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Report for Energy Cluster Denmark project of the year. https://www.energycluster.dk/en/eco-ran-wins-innovation-project-of-the-year/

  15. arXiv:2302.08588  [pdf, other

    cs.LG cs.FL

    MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

    Authors: Giovanni Bacci, Anna Ingólfsdóttir, Kim G. Larsen, Raphaël Reynouard

    Abstract: Continuous-time Markov chains (CTMCs) are popular modeling formalism that constitutes the underlying semantics for real-time probabilistic systems such as queuing networks, stochastic process algebras, and calculi for systems biology. Prism and Storm are popular model checking tools that provide a number of powerful analysis techniques for CTMCs. These tools accept models expressed as the parallel… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  16. arXiv:2302.06165  [pdf, ps, other

    cs.DS cs.LG

    Sparse Dimensionality Reduction Revisited

    Authors: Mikael Møller Høgsgaard, Lion Kamma, Kasper Green Larsen, Jelani Nelson, Chris Schwiegelshohn

    Abstract: The sparse Johnson-Lindenstrauss transform is one of the central techniques in dimensionality reduction. It supports embedding a set of $n$ points in $\mathbb{R}^d$ into $m=O(\varepsilon^{-2} \lg n)$ dimensions while preserving all pairwise distances to within $1 \pm \varepsilon$. Each input point $x$ is embedded to $Ax$, where $A$ is an $m \times d$ matrix having $s$ non-zeros per column, allowin… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  17. arXiv:2302.04529  [pdf, other

    cs.FL cs.SE

    Timed I/O Automata: It is never too late to complete your timed specification theory

    Authors: Martijn A. Goorden, Kim G. Larsen, Axel Legay, Florian Lorber, Ulrik Nyman, Andrzej Wasowski

    Abstract: A specification theory combines notions of specifications and implementations with a satisfaction relation, a refinement relation and a set of operators supporting stepwise design. We develop a complete specification framework for real-time systems using Timed I/O Automata as the specification formalism, with the semantics expressed in terms of Timed I/O Transition Systems. We provide constructs f… ▽ More

    Submitted 13 July, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Version submitted for review

  18. arXiv:2301.11571  [pdf, other

    cs.LG cs.CC cs.DS

    AdaBoost is not an Optimal Weak to Strong Learner

    Authors: Mikael Møller Høgsgaard, Kasper Green Larsen, Martin Ritzert

    Abstract: AdaBoost is a classic boosting algorithm for combining multiple inaccurate classifiers produced by a weak learner, to produce a strong learner with arbitrarily high accuracy when given enough training data. Determining the optimal number of samples necessary to obtain a given accuracy of the strong learner, is a basic learning theoretic question. Larsen and Ritzert (NeurIPS'22) recently presented… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  19. arXiv:2301.09627  [pdf, ps, other

    cs.LG cs.CC cs.DS

    The Impossibility of Parallelizing Boosting

    Authors: Amin Karbasi, Kasper Green Larsen

    Abstract: The aim of boosting is to convert a sequence of weak learners into a strong learner. At their heart, these methods are fully sequential. In this paper, we investigate the possibility of parallelizing boosting. Our main contribution is a strong negative result, implying that significant parallelization of boosting requires an exponential blow-up in the total computing resources needed for training.

    Submitted 21 August, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  20. arXiv:2301.01924  [pdf, other

    math.CO cs.CC cs.DM math.LO

    Diagonalization Games

    Authors: Noga Alon, Olivier Bousquet, Kasper Green Larsen, Shay Moran, Shlomo Moran

    Abstract: We study several variants of a combinatorial game which is based on Cantor's diagonal argument. The game is between two players called Kronecker and Cantor. The names of the players are motivated by the known fact that Leopold Kronecker did not appreciate Georg Cantor's arguments about the infinite, and even referred to him as a "scientific charlatan". In the game Kronecker maintains a list of m… ▽ More

    Submitted 22 January, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 11 pages, added acknowledgements

  21. arXiv:2212.02264  [pdf, ps, other

    cs.LG cs.DS

    Bagging is an Optimal PAC Learner

    Authors: Kasper Green Larsen

    Abstract: Determining the optimal sample complexity of PAC learning in the realizable setting was a central open problem in learning theory for decades. Finally, the seminal work by Hanneke (2016) gave an algorithm with a provably optimal sample complexity. His algorithm is based on a careful and structured sub-sampling of the training data and then returning a majority vote among hypotheses trained on each… ▽ More

    Submitted 16 June, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  22. arXiv:2211.08184  [pdf, other

    cs.CG cs.LG

    Improved Coresets for Euclidean $k$-Means

    Authors: Vincent Cohen-Addad, Kasper Green Larsen, David Saulpic, Chris Schwiegelshohn, Omar Ali Sheikh-Omar

    Abstract: Given a set of $n$ points in $d$ dimensions, the Euclidean $k$-means problem (resp. the Euclidean $k$-median problem) consists of finding $k$ centers such that the sum of squared distances (resp. sum of distances) from every point to its closest center is minimized. The arguably most popular way of dealing with this problem in the big data setting is to first compress the data by computing a weigh… ▽ More

    Submitted 16 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

  23. arXiv:2210.05403  [pdf, other

    cs.DS cs.CG

    Hierarchical Categories in Colored Searching

    Authors: Peyman Afshani, Rasmus Killman, Kasper Green Larsen

    Abstract: In colored range counting (CRC), the input is a set of points where each point is assigned a ``color'' (or a ``category'') and the goal is to store them in a data structure such that the number of distinct categories inside a given query range can be counted efficiently. CRC has strong motivations as it allows data structure to deal with categorical data. However, colors (i.e., the categories) in… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  24. arXiv:2207.03304  [pdf, ps, other

    cs.DS

    Barriers for Faster Dimensionality Reduction

    Authors: Ora Nova Fandina, Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: The Johnson-Lindenstrauss transform allows one to embed a dataset of $n$ points in $\mathbb{R}^d$ into $\mathbb{R}^m,$ while preserving the pairwise distance between any pair of points up to a factor $(1 \pm \varepsilon)$, provided that $m = Ω(\varepsilon^{-2} \lg n)$. The transform has found an overwhelming number of algorithmic applications, allowing to speed up algorithms and reducing memory co… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  25. arXiv:2207.03268  [pdf, ps, other

    cs.DS cs.CG

    Fast Discrepancy Minimization with Hereditary Guarantees

    Authors: Kasper Green Larsen

    Abstract: Efficiently computing low discrepancy colorings of various set systems, has been studied extensively since the breakthrough work by Bansal (FOCS 2010), who gave the first polynomial time algorithms for several important settings, including for general set systems, sparse set systems and for set systems with bounded hereditary discrepancy. The hereditary discrepancy of a set system, is the maximum… ▽ More

    Submitted 24 November, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  26. arXiv:2206.14590  [pdf, ps, other

    cs.FL cs.LO

    Monitoring Timed Properties (Revisited)

    Authors: Thomas Møller Grosen, Sean Kauffman, Kim Guldstrand Larsen, Martin Zimmermann

    Abstract: In this paper we revisit monitoring real-time systems with respect to properties expressed either in Metric Interval Temporal Logic or as Timed Büchi Automata. We offer efficient symbolic online monitoring algorithms in a number of settings, exploiting so-called zones well-known from efficient model checking of Timed Automata. The settings considered include new, much simplified treatment of time… ▽ More

    Submitted 5 September, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

  27. arXiv:2206.01563  [pdf, ps, other

    cs.LG stat.ML

    Optimal Weak to Strong Learning

    Authors: Kasper Green Larsen, Martin Ritzert

    Abstract: The classic algorithm AdaBoost allows to convert a weak learner, that is an algorithm that produces a hypothesis which is slightly better than chance, into a strong learner, achieving arbitrarily high accuracy when given enough training data. We present a new algorithm that constructs a strong learner from a weak learner but uses less training data than AdaBoost and all other weak to strong learne… ▽ More

    Submitted 25 November, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  28. arXiv:2204.01800  [pdf, ps, other

    cs.DS cs.LG

    The Fast Johnson-Lindenstrauss Transform is Even Faster

    Authors: Ora Nova Fandina, Mikael Møller Høgsgaard, Kasper Green Larsen

    Abstract: The seminal Fast Johnson-Lindenstrauss (Fast JL) transform by Ailon and Chazelle (SICOMP'09) embeds a set of $n$ points in $d$-dimensional Euclidean space into optimal $k=O(\varepsilon^{-2} \ln n)$ dimensions, while preserving all pairwise distances to within a factor $(1 \pm \varepsilon)$. The Fast JL transform supports computing the embedding of a data point in $O(d \ln d +k \ln^2 n)$ time, wher… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  29. Stronger 3SUM-Indexing Lower Bounds

    Authors: Eldon Chung, Kasper Green Larsen

    Abstract: The $3$SUM-Indexing problem was introduced as a data structure version of the $3$SUM problem, with the goal of proving strong conditional lower bounds for static data structures via reductions. Ideally, the conjectured hardness of $3$SUM-Indexing should be replaced by an unconditional lower bound. Unfortunately, we are far from proving this, with the strongest current lower bound being a logarithm… ▽ More

    Submitted 25 March, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: SODA 2023

    MSC Class: 68P05 ACM Class: E.1; F.2.0

  30. arXiv:2202.12793  [pdf, other

    cs.DS cs.CG cs.LG

    Towards Optimal Lower Bounds for k-median and k-means Coresets

    Authors: Vincent Cohen-Addad, Kasper Green Larsen, David Saulpic, Chris Schwiegelshohn

    Abstract: Given a set of points in a metric space, the $(k,z)$-clustering problem consists of finding a set of $k$ points called centers, such that the sum of distances raised to the power of $z$ of every data point to its closest center is minimized. Special cases include the famous k-median problem ($z = 1$) and k-means problem ($z = 2$). The $k$-median and $k$-means problems are at the heart of modern da… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  31. arXiv:2107.06626  [pdf, ps, other

    cs.DS cs.CG cs.LG

    Optimality of the Johnson-Lindenstrauss Dimensionality Reduction for Practical Measures

    Authors: Yair Bartal, Ora Nova Fandina, Kasper Green Larsen

    Abstract: It is well known that the Johnson-Lindenstrauss dimensionality reduction method is optimal for worst case distortion. While in practice many other methods and heuristics are used, not much is known in terms of bounds on their performance. The question of whether the JL method is optimal for practical measures of distortion was recently raised in BFN19 (NeurIPS'19). They provided upper bounds on it… ▽ More

    Submitted 15 March, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

    ACM Class: F.0; G.0

  32. arXiv:2106.07989   

    cs.LG

    Compression Implies Generalization

    Authors: Allan Grønlund, Mikael Høgsgaard, Lior Kamma, Kasper Green Larsen

    Abstract: Explaining the surprising generalization performance of deep neural networks is an active and important line of research in theoretical machine learning. Influential work by Arora et al. (ICML'18) showed that, noise stability properties of deep nets occurring in practice can be used to provably compress model representations. They then argued that the small representations of compressed networks i… ▽ More

    Submitted 1 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: There is a bug in the proof and that we are working on fixing it. No replacement paper at this point

  33. arXiv:2106.06453  [pdf, ps, other

    cs.CR cs.CC cs.DS

    Property-Preserving Hash Functions from Standard Assumptions

    Authors: Nils Fleischhacker, Kasper Green Larsen, and Mark Simkin

    Abstract: Property-preserving hash functions allow for compressing long inputs $x_0$ and $x_1$ into short hashes $h(x_0)$ and $h(x_1)$ in a manner that allows for computing a predicate $P(x_0, x_1)$ given only the two hash values without having access to the original data. Such hash functions are said to be adversarially robust if an adversary that gets to pick $x_0$ and $x_1$ after the hash function has be… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    MSC Class: 94A60 (Primary) 68P05 (Secondary)

  34. arXiv:2104.13160  [pdf, other

    cs.LO

    Efficient Local Computation of Differential Bisimulations via Coupling and Up-to Methods

    Authors: Giorgio Bacci, Giovanni Bacci, Kim G. Larsen, Mirco Tribastone, Max Tschaikowski, Andrea Vandin

    Abstract: We introduce polynomial couplings, a generalization of probabilistic couplings, to develop an algorithm for the computation of equivalence relations which can be interpreted as a lifting of probabilistic bisimulation to polynomial differential equations, a ubiquitous model of dynamical systems across science and engineering. The algorithm enjoys polynomial time complexity and complements classical… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  35. arXiv:2102.02193  [pdf, other

    cs.DS cs.LG stat.ML

    CountSketches, Feature Hashing and the Median of Three

    Authors: Kasper Green Larsen, Rasmus Pagh, Jakub Tětek

    Abstract: In this paper, we revisit the classic CountSketch method, which is a sparse, random projection that transforms a (high-dimensional) Euclidean vector $v$ to a vector of dimension $(2t-1) s$, where $t, s > 0$ are integer parameters. It is known that even for $t=1$, a CountSketch allows estimating coordinates of $v$ with variance bounded by $\|v\|_2^2/s$. For $t > 1$, the estimator takes the median o… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

  36. arXiv:2011.04998  [pdf, other

    cs.LG stat.ML

    Margins are Insufficient for Explaining Gradient Boosting

    Authors: Allan Grønlund, Lior Kamma, Kasper Green Larsen

    Abstract: Boosting is one of the most successful ideas in machine learning, achieving great practical performance with little fine-tuning. The success of boosted classifiers is most often attributed to improvements in margins. The focus on margin explanations was pioneered in the seminal work by Schapire et al. (1998) and has culminated in the $k$'th margin generalization bound by Gao and Zhou (2013), which… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  37. arXiv:2008.05145  [pdf, ps, other

    cs.DS

    Further Unifying the Landscape of Cell Probe Lower Bounds

    Authors: Kasper Green Larsen, Jonathan Lindegaard Starup, Jesper Steensgaard

    Abstract: In a landmark paper, Pǎtraşcu demonstrated how a single lower bound for the static data structure problem of reachability in the butterfly graph, could be used to derive a wealth of new and previous lower bounds via reductions. These lower bounds are tight for numerous static data structure problems. Moreover, he also showed that reachability in the butterfly graph reduces to dynamic marked ancest… ▽ More

    Submitted 2 November, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

  38. arXiv:2007.10539  [pdf, other

    cs.FL cs.LO eess.SY

    Verification and Parameter Synthesis for Real-Time Programs using Refinement of Trace Abstraction

    Authors: Franck Cassez, Peter Gjøl Jensen, Kim Guldstrand Larsen

    Abstract: We address the safety verification and synthesis problems for real-time systems. We introduce real-time programs that are made of instructions that can perform assignments to discrete and real-valued variables. They are general enough to capture interesting classes of timed systems such as timed automata, stopwatch automata, time(d) Petri nets and hybrid automata. We propose a semi-algorithm usi… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  39. arXiv:2006.16688  [pdf, other

    cs.LO cs.LG

    It's Time to Play Safe: Shield Synthesis for Timed Systems

    Authors: Roderick Bloem, Peter Gjøl Jensen, Bettina Könighofer, Kim Guldstrand Larsen, Florian Lorber, Alexander Palmisano

    Abstract: Erroneous behaviour in safety critical real-time systems may inflict serious consequences. In this paper, we show how to synthesize timed shields from timed safety properties given as timed automata. A timed shield enforces the safety of a running system while interfering with the system as little as possible. We present timed post-shields and timed pre-shields. A timed pre-shield is placed before… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Submitted to RV2020

  40. arXiv:2006.14923  [pdf, other

    cs.AI

    Approximating Euclidean by Imprecise Markov Decision Processes

    Authors: Manfred Jaeger, Giorgio Bacci, Giovanni Bacci, Kim Guldstrand Larsen, Peter Gjøl Jensen

    Abstract: Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are obtained when the Euclidean process is approximated… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  41. arXiv:2006.11122  [pdf, other

    cs.LG stat.ML

    A general framework for defining and optimizing robustness

    Authors: Alessandro Tibo, Manfred Jaeger, Kim G. Larsen

    Abstract: Robustness of neural networks has recently attracted a great amount of interest. The many investigations in this area lack a precise common foundation of robustness concepts. Therefore, in this paper, we propose a rigorous and flexible framework for defining different types of robustness properties for classifiers. Our robustness concept is based on postulates that robustness of a classifier shoul… ▽ More

    Submitted 29 May, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

  42. arXiv:2006.02175  [pdf, ps, other

    cs.LG stat.ML

    Near-Tight Margin-Based Generalization Bounds for Support Vector Machines

    Authors: Allan Grønlund, Lior Kamma, Kasper Green Larsen

    Abstract: Support Vector Machines (SVMs) are among the most fundamental tools for binary classification. In its simplest formulation, an SVM produces a hyperplane separating two classes of data using the largest possible margin to the data. The focus on maximizing the margin has been well motivated through numerous generalization bounds. In this paper, we revisit and improve the classic generalization bound… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

  43. Stubborn Set Reduction for Two-Player Reachability Games

    Authors: Frederik Meyer Bønneland, Peter Gjøl Jensen, Kim Guldstrand Larsen, Marco Muñiz, Jiří Srba

    Abstract: Partial order reductions have been successfully applied to model checking of concurrent systems and practical applications of the technique show nontrivial reduction in the size of the explored state space. We present a theory of partial order reduction based on stubborn sets in the game-theoretical setting of 2-player games with reachability objectives. Our stubborn reduction allows us to prune t… ▽ More

    Submitted 17 March, 2021; v1 submitted 20 December, 2019; originally announced December 2019.

    Journal ref: Logical Methods in Computer Science, Volume 17, Issue 1 (March 18, 2021) lmcs:5997

  44. arXiv:1909.12518  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Margin-Based Generalization Lower Bounds for Boosted Classifiers

    Authors: Allan Grønlund, Lior Kamma, Kasper Green Larsen, Alexander Mathiasen, Jelani Nelson

    Abstract: Boosting is one of the most successful ideas in machine learning. The most well-accepted explanations for the low generalization error of boosting algorithms such as AdaBoost stem from margin theory. The study of margins in the context of boosting algorithms was initiated by Schapire, Freund, Bartlett and Lee (1998) and has inspired numerous boosting algorithms and generalization bounds. To date,… ▽ More

    Submitted 7 May, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  45. arXiv:1909.09912  [pdf, ps, other

    cs.DS cs.DM cs.LG math.CO

    Optimal Learning of Joint Alignments with a Faulty Oracle

    Authors: Kasper Green Larsen, Michael Mitzenmacher, Charalampos E. Tsourakakis

    Abstract: We consider the following problem, which is useful in applications such as joint image and shape alignment. The goal is to recover $n$ discrete variables $g_i \in \{0, \ldots, k-1\}$ (up to some global offset) given noisy observations of a set of their pairwise differences $\{(g_i - g_j) \bmod k\}$; specifically, with probability $\frac{1}{k}+δ$ for some $δ> 0$ one obtains the correct answer, and… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: 10 pages

  46. Computing Probabilistic Bisimilarity Distances for Probabilistic Automata

    Authors: Giorgio Bacci, Giovanni Bacci, Kim G. Larsen, Radu Mardare, Qiyi Tang, Franck van Breugel

    Abstract: The probabilistic bisimilarity distance of Deng et al. has been proposed as a robust quantitative generalization of Segala and Lynch's probabilistic bisimilarity for probabilistic automata. In this paper, we present a characterization of the bisimilarity distance as the solution of a simple stochastic game. The characterization gives us an algorithm to compute the distances by applying Condon's si… ▽ More

    Submitted 1 February, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

    Journal ref: Logical Methods in Computer Science, Volume 17, Issue 1 (February 2, 2021) lmcs:5994

  47. arXiv:1906.12239  [pdf, ps, other

    cs.LG stat.ML

    L*-Based Learning of Markov Decision Processes (Extended Version)

    Authors: Martin Tappler, Bernhard K. Aichernig, Giovanni Bacci, Maria Eichlseder, Kim G. Larsen

    Abstract: Automata learning techniques automatically generate system models from test observations. These techniques usually fall into two categories: passive and active. Passive learning uses a predetermined data set, e.g., system logs. In contrast, active learning actively queries the system under learning, which is considered more efficient. An influential active learning technique is Angluin's L* algo… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

    Comments: an extended version of a conference paper accepted for presentation at FM 2019, the 23rd international symposium on formal methods

  48. SOS: Safe, Optimal and Small Strategies for Hybrid Markov Decision Processes

    Authors: Pranav Ashok, Jan Křetínský, Kim Guldstrand Larsen, Adrien Le Coënt, Jakob Haahr Taankvist, Maximilian Weininger

    Abstract: For hybrid Markov decision processes, UPPAAL Stratego can compute strategies that are safe for a given safety property and (in the limit) optimal for a given cost function. Unfortunately, these strategies cannot be exported easily since they are computed as a very long list. In this paper, we demonstrate methods to learn compact representations of the strategies in the form of decision trees. Thes… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  49. arXiv:1904.04828  [pdf, ps, other

    cs.DS cs.CR

    Lower Bounds for Oblivious Near-Neighbor Search

    Authors: Kasper Green Larsen, Tal Malkin, Omri Weinstein, Kevin Yeo

    Abstract: We prove an $Ω(d \lg n/ (\lg\lg n)^2)$ lower bound on the dynamic cell-probe complexity of statistically $\mathit{oblivious}$ approximate-near-neighbor search ($\mathsf{ANN}$) over the $d$-dimensional Hamming cube. For the natural setting of $d = Θ(\log n)$, our result implies an $\tildeΩ(\lg^2 n)$ lower bound, which is a quadratic improvement over the highest (non-oblivious) cell-probe lower boun… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 28 pages

  50. arXiv:1902.10935  [pdf, other

    cs.DS cs.CC

    Lower Bounds for Multiplication via Network Coding

    Authors: Peyman Afshani, Casper Benjamin Freksen, Lior Kamma, Kasper Green Larsen

    Abstract: Multiplication is one of the most fundamental computational problems, yet its true complexity remains elusive. The best known upper bound, by Fürer, shows that two $n$-bit numbers can be multiplied via a boolean circuit of size $O(n \lg n \cdot 4^{\lg^*n})$, where $\lg^*n$ is the very slowly growing iterated logarithm. In this work, we prove that if a central conjecture in the area of network codi… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.