Skip to main content

Showing 1–50 of 173 results for author: Roy, B

  1. arXiv:2407.12288  [pdf, other

    stat.ML cs.AI cs.LG

    Information-Theoretic Foundations for Machine Learning

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: The staggering progress of machine learning in the past decade has been a sight to behold. In retrospect, it is both remarkable and unsettling that these milestones were achievable with little to no rigorous theory to guide experimentation. Despite this fact, practitioners have been able to guide their future experimentation via observations from previous large-scale empirical investigations. Howe… ▽ More

    Submitted 18 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.12185  [pdf, other

    cs.LG cs.AI

    Satisficing Exploration for Deep Reinforcement Learning

    Authors: Dilip Arumugam, Saurabh Kumar, Ramki Gummadi, Benjamin Van Roy

    Abstract: A default assumption in the design of reinforcement-learning algorithms is that a decision-making agent always explores to learn optimal behavior. In sufficiently complex environments that approach the vastness and scale of the real world, however, attaining optimal performance may in fact be an entirely intractable endeavor and an agent may seldom find itself in a position to complete the requisi… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to the Finding the Frame Workshop at RLC 2024

  3. arXiv:2407.12178  [pdf, other

    cs.LG cs.AI

    Exploration Unbound

    Authors: Dilip Arumugam, Wanqiao Xu, Benjamin Van Roy

    Abstract: A sequential decision-making agent balances between exploring to gain new knowledge about an environment and exploiting current knowledge to maximize immediate reward. For environments studied in the traditional literature, optimal decisions gravitate over time toward exploitation as the agent accumulates sufficient knowledge and the benefits of further exploration vanish. What if, however, the en… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to the Finding the Frame Workshop at RLC 2024

  4. arXiv:2407.10023  [pdf, other

    cs.SE

    Reproducibility of Issues Reported in Stack Overflow Questions: Challenges, Impact & Estimation

    Authors: Saikat Mondal, Banani Roy

    Abstract: Software developers often submit questions to technical Q&A sites like Stack Overflow (SO) to resolve code-level problems. In practice, they include example code snippets with questions to explain the programming issues. Existing research suggests that users attempt to reproduce the reported issues using given code snippets when answering questions. Unfortunately, such code snippets could not alwa… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted in Journal of Systems and Software. arXiv admin note: text overlap with arXiv:2112.10056

  5. arXiv:2407.02177  [pdf, other

    cs.DS cs.DM

    Minsum Problem for Discrete and Weighted Set Flow on Dynamic Path Network

    Authors: Bubai Manna, Bodhayan Roy, Vorapong Suppakitpaisarn

    Abstract: In this research, we examine the minsum flow problem in dynamic path networks where flows are represented as discrete and weighted sets. The minsum flow problem has been widely studied for its relevance in finding evacuation routes during emergencies such as earthquakes. However, previous approaches often assume that individuals are separable and identical, which does not adequately account for th… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  6. arXiv:2407.01456  [pdf, other

    cs.LG cs.AI

    Information-Theoretic Foundations for Neural Scaling Laws

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we dev… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.01365

  7. arXiv:2406.16209  [pdf, other

    cs.CG

    Covering Simple Orthogonal Polygons with Rectangles

    Authors: Aniket Basu Roy

    Abstract: We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 29 pages, 19 figures

  8. Attention-Based Learning for Fluid State Interpolation and Editing in a Time-Continuous Framework

    Authors: Bruno Roy

    Abstract: In this work, we introduce FluidsFormer: a transformer-based approach for fluid interpolation within a continuous-time framework. By combining the capabilities of PITT and a residual neural network (RNN), we analytically predict the physical properties of the fluid state. This enables us to interpolate substep frames between simulated keyframes, enhancing the temporal smoothness and sharpness of a… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures, submitted and accepted to SIGGRAPH

  9. arXiv:2404.16329  [pdf, other

    cs.DS cs.CC cs.CG

    On Approximating the Dynamic and Discrete Network Flow Problem

    Authors: Bubai Manna, Bodhayan Roy, Vorapong Suppakitpaisarn

    Abstract: We examine the dynamic network flow problem under the assumption that the flow consists of discrete units. The dynamic network flow problem is commonly addressed in the context of developing evacuation plans, where the flow is typically treated as a continuous quantity. However, real-world scenarios often involve moving groups, such as families, as single units. We demonstrate that solving the dyn… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  10. arXiv:2404.15487  [pdf, other

    cs.CG cs.DS

    Minimum Consistent Subset in Trees and Interval Graphs

    Authors: Aritra Banik, Sayani Das, Anil Maheshwari, Bubai Manna, Subhas C Nandy, Krishna Priya K M, Bodhayan Roy, Sasanka Roy, Abhishek Sahu

    Abstract: In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of i… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  11. arXiv:2404.15446  [pdf, other

    cs.CR eess.SY

    OffRAMPS: An FPGA-based Intermediary for Analysis and Modification of Additive Manufacturing Control Systems

    Authors: Jason Blocklove, Md Raz, Prithwish Basu Roy, Hammond Pearce, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri

    Abstract: Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compr… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  12. arXiv:2402.00396  [pdf, other

    cs.LG cs.AI cs.CL stat.ME stat.ML

    Efficient Exploration for LLMs

    Authors: Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

    Abstract: We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demo… ▽ More

    Submitted 4 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  13. arXiv:2401.15530  [pdf, ps, other

    cs.LG cs.IT

    An Information-Theoretic Analysis of In-Context Learning

    Authors: Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

    Abstract: Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  14. arXiv:2401.13239  [pdf, other

    cs.LG cs.HC

    Adaptive Crowdsourcing Via Self-Supervised Learning

    Authors: Anmol Kagrecha, Henrik Marklund, Benjamin Van Roy, Hong Jun Jeon, Richard Zeckhauser

    Abstract: Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across c… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 33 pages, 3 figures

  15. arXiv:2401.04318  [pdf, ps, other

    cs.GT

    Contiguous Allocation of Indivisible Items on a Path

    Authors: Yasushi Kawase, Bodhayan Roy, Mohammad Azharuddin Sanpui

    Abstract: We study the problem of allocating indivisible items on a path among agents. The objective is to find a fair and efficient allocation in which each agent's bundle forms a contiguous block on the line. We demonstrate that, even when the valuations are binary additive, deciding whether every item can be allocated to an agent who wants it is NP-complete. Consequently, we provide two fixed-parameter t… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: A preliminary version was accepted at AAMAS 2024 as an extended abstract

  16. arXiv:2312.01057  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF and IIA: Perverse Incentives

    Authors: Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

    Abstract: Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms.

    Submitted 1 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  17. arXiv:2311.04581  [pdf, other

    cs.AR cs.CR

    KiD: A Hardware Design Framework Targeting Unified NTT Multiplication for CRYSTALS-Kyber and CRYSTALS-Dilithium on FPGA

    Authors: Suraj Mandal, Debapriya Basu Roy

    Abstract: Large-degree polynomial multiplication is an integral component of post-quantum secure lattice-based cryptographic algorithms like CRYSTALS-Kyber and Dilithium. The computational complexity of large-degree polynomial multiplication can be reduced significantly through Number Theoretic Transformation (NTT). In this paper, we aim to develop a unified and shared NTT architecture that can support poly… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  18. arXiv:2310.07786  [pdf, other

    cs.LG cs.IR

    Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

    Authors: Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy

    Abstract: Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends. While a number of non-stationary contextual bandit learning algorithms have been proposed in the literature, they excessively explore due to a lack of prioritization for information of enduring value, or are designed in ways that do not scale in modern applicati… ▽ More

    Submitted 14 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  19. arXiv:2309.07291  [pdf

    cs.SE

    Reusability Challenges of Scientific Workflows: A Case Study for Galaxy

    Authors: Khairul Alam, Banani Roy, Alexander Serebrenik

    Abstract: Scientific workflow has become essential in software engineering because it provides a structured approach to designing, executing, and analyzing scientific experiments. Software developers and researchers have developed hundreds of scientific workflow management systems so scientists in various domains can benefit from them by automating repetitive tasks, enhancing collaboration, and ensuring the… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted in APSEC 2023

  20. arXiv:2309.06424  [pdf

    cs.SE cs.AI cs.LG

    Unveiling the potential of large language models in generating semantic and cross-language clones

    Authors: Palash R. Roy, Ajmain I. Alam, Farouq Al-omari, Banani Roy, Chanchal K. Roy, Kevin A. Schneider

    Abstract: Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone posses… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted in IWSC

  21. arXiv:2309.05550  [pdf, other

    cs.CR

    Multiplierless Design of High-Speed Very Large Constant Multiplications

    Authors: Levent Aksoy, Debapriya Basu Roy, Malik Imran, Samuel Pagliarini

    Abstract: In cryptographic algorithms, the constants to be multiplied by a variable can be very large due to security requirements. Thus, the hardware complexity of such algorithms heavily depends on the design architecture handling large constants. In this paper, we introduce an electronic design automation tool, called LEIGER, which can automatically generate the realizations of very large constant multip… ▽ More

    Submitted 12 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  22. arXiv:2308.13963  [pdf

    cs.SE

    GPTCloneBench: A comprehensive benchmark of semantic clones and cross-language clones using GPT-3 model and SemanticCloneBench

    Authors: Ajmain Inqiad Alam, Palash Ranjan Roy, Farouq Al-omari, Chanchal Kumar Roy, Banani Roy, Kevin Schneider

    Abstract: With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting t… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted in 39th IEEE International Conference on Software Maintenance and Evolution(ICSME 2023)

  23. arXiv:2308.11958  [pdf, other

    cs.LG cs.AI

    Maintaining Plasticity in Continual Learning via Regenerative Regularization

    Authors: Saurabh Kumar, Henrik Marklund, Benjamin Van Roy

    Abstract: In continual learning, plasticity refers to the ability of an agent to quickly adapt to new information. Neural networks are known to lose plasticity when processing non-stationary data streams. In this paper, we propose L2 Init, a simple approach for maintaining plasticity by incorporating in the loss function L2 regularization toward initial parameters. This is very similar to standard L2 regula… ▽ More

    Submitted 3 October, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  24. arXiv:2307.14185  [pdf, other

    cs.LG

    A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia

    Authors: Diana McSpadden, Steven Goldenberg, Binata Roy, Malachi Schram, Jonathan L. Goodall, Heather Richter

    Abstract: Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using d… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 10 pages, 8 figures

  25. arXiv:2307.11046  [pdf, other

    cs.LG cs.AI

    A Definition of Continual Reinforcement Learning

    Authors: David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh

    Abstract: In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward. However, this perspective is based on a restricted view of learning as finding a solution, rather than treating learning as endless adaptation. In contrast, continual reinforcement learning refers to the setting in which the best agents never stop learning.… ▽ More

    Submitted 1 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  26. arXiv:2307.11044  [pdf, other

    cs.LG cs.AI

    On the Convergence of Bounded Agents

    Authors: David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh

    Abstract: When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  27. arXiv:2307.08593  [pdf, other

    physics.acc-ph cs.LG hep-ex nucl-ex nucl-th

    Artificial Intelligence for the Electron Ion Collider (AI4EIC)

    Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

  28. arXiv:2307.04345  [pdf, other

    cs.LG cs.AI

    Continual Learning as Computationally Constrained Reinforcement Learning

    Authors: Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy

    Abstract: An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning,… ▽ More

    Submitted 20 August, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  29. arXiv:2306.14834  [pdf, other

    cs.IR cs.AI

    Scalable Neural Contextual Bandit for Recommender Systems

    Authors: Zheqing Zhu, Benjamin Van Roy

    Abstract: High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been so… ▽ More

    Submitted 18 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Journal ref: ACM International Conference on Information and Knowledge Management (CIKM 2023) 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

  30. Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems

    Authors: Baolin Li, Rohan Basu Roy, Daniel Wang, Siddharth Samsi, Vijay Gadepally, Devesh Tiwari

    Abstract: The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which requires urgent intervention. In this work, we present a comprehensive analysis of the carbon footprint of high-performance computing (HPC) systems, considering the carbon footprint during both the hardware manufacturing and system operational stages. Our work employs HPC hardware component carbon footprint mod… ▽ More

    Submitted 18 November, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  31. Neural ShDF: Reviving an Efficient and Consistent Mesh Segmentation Method

    Authors: Bruno Roy

    Abstract: Partitioning a polygonal mesh into meaningful parts can be challenging. Many applications require decomposing such structures for further processing in computer graphics. In the last decade, several methods were proposed to tackle this problem, at the cost of intensive computational times. Recently, machine learning has proven to be effective for the segmentation task on 3D structures. Nevertheles… ▽ More

    Submitted 31 August, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 9 pages, 13 figures, and 3 tables. Short paper and poster published and presented at SIGGRAPH 2023

  32. arXiv:2305.14321  [pdf, other

    cs.CL

    ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

    Authors: William Brannon, Wonjune Kang, Suyash Fulay, Hang Jiang, Brandon Roy, Deb Roy, Jad Kabbara

    Abstract: Learning on text-attributed graphs (TAGs), in which nodes are associated with one or more texts, has been the subject of much recent work. However, most approaches tend to make strong assumptions about the downstream task of interest, are reliant on hand-labeled data, or fail to equally balance the importance of both text and graph representations. In this work, we propose Contrastive Graph-Text p… ▽ More

    Submitted 9 July, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: New visualizations, added references, and an application to community detection. To appear at the TextGraphs workshop @ ACL 2024. 21 pages, 5 figures, 13 tables

  33. arXiv:2305.11455  [pdf, other

    cs.CL cs.AI cs.LG

    Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models

    Authors: Wanqiao Xu, Shi Dong, Dilip Arumugam, Benjamin Van Roy

    Abstract: A centerpiece of the ever-popular reinforcement learning from human feedback (RLHF) approach to fine-tuning autoregressive language models is the explicit training of a reward model to emulate human feedback, distinct from the language model itself. This reward model is then coupled with policy-gradient methods to dramatically improve the alignment between language model outputs and desired respon… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  34. arXiv:2305.03263  [pdf, other

    cs.LG cs.AI

    Bayesian Reinforcement Learning with Limited Cognitive Load

    Authors: Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

    Abstract: All biological and artificial agents must learn and make decisions given limits on their ability to process information. As such, a general theory of adaptive behavior should be able to account for the complex interactions between an agent's learning history, decisions, and capacity constraints. Recent work in computer science has begun to clarify the principles that shape these dynamics by bridgi… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  35. arXiv:2305.03180  [pdf, other

    cs.CG

    On Range Summary Queries

    Authors: Peyman Afshani, Pingan Cheng, Aniket Basu Roy, Zhewei Wei

    Abstract: We study the query version of the approximate heavy hitter and quantile problems. In the former problem, the input is a parameter $\varepsilon$ and a set $P$ of $n$ points in $\mathbb{R}^d$ where each point is assigned a color from a set $C$, and we want to build a structure s.t. given any geometric range $γ$, we can efficiently find a list of approximate heavy hitters in $γ\cap P$, i.e., colors t… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: To appear in ICALP'23

  36. arXiv:2303.02337  [pdf, other

    cs.CG

    Some results on Minimum Consistent Subsets of Trees

    Authors: Bubai Manna, Bodhayan Roy

    Abstract: For a graph G = (V,E) where each vertex is coloured by one of k colours, consider a subset C of V such that for each vertex v in V\C, its set of nearest neighbours in C contains at least one vertex of the same colour as v. Such a C is called a consistent subset (CS). Computing a consistent subset of the minimum size is called the Minimum Consistent Subset problem (MCS). MCS is known to be NP-compl… ▽ More

    Submitted 30 May, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  37. arXiv:2302.12202  [pdf, ps, other

    cs.LG stat.ML

    A Definition of Non-Stationary Bandits

    Authors: Yueyang Liu, Xu Kuang, Benjamin Van Roy

    Abstract: Despite the subject of non-stationary bandit learning having attracted much recent attention, we have yet to identify a formal definition of non-stationarity that can consistently distinguish non-stationary bandits from stationary ones. Prior work has characterized non-stationary bandits as bandits for which the reward distribution changes over time. We demonstrate that this definition can ambiguo… ▽ More

    Submitted 28 July, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  38. arXiv:2302.09205  [pdf, other

    cs.LG cs.AI

    Approximate Thompson Sampling via Epistemic Neural Networks

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

    Abstract: Thompson sampling (TS) is a popular heuristic for action selection, but it requires sampling from a posterior distribution. Unfortunately, this can become computationally intractable in complex environments, such as those modeled using neural networks. Approximate posterior samples can produce effective actions, but only if they reasonably approximate joint predictive distributions of outputs acro… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  39. arXiv:2302.03319  [pdf, ps, other

    cs.LG math.ST stat.ML

    Leveraging Demonstrations to Improve Online Learning: Quality Matters

    Authors: Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen

    Abstract: We investigate the extent to which offline demonstration data can improve online learning. It is natural to expect some improvement, but the question is how, and by how much? We show that the degree of improvement must depend on the quality of the demonstration data. To generate portable insights, we focus on Thompson sampling (TS) applied to a multi-armed bandit as a prototypical online learning… ▽ More

    Submitted 17 May, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023

  40. arXiv:2301.10336  [pdf, other

    cs.CR

    A survey of Digital Manufacturing Hardware and Software Trojans

    Authors: Prithwish Basu Roy, Mudit Bhargava, Chia-Yun Chang, Ellen Hui, Nikhil Gupta, Ramesh Karri, Hammond Pearce

    Abstract: Digital Manufacturing (DM) refers to the on-going adoption of smarter, more agile manufacturing processes and cyber-physical systems. This includes modern techniques and technologies such as Additive Manufacturing (AM)/3D printing, as well as the Industrial Internet of Things (IIoT) and the broader trend toward Industry 4.0. However, this adoption is not without risks: with a growing complexity an… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 15 pages

  41. arXiv:2212.12633  [pdf, other

    cs.LG cs.AI

    Inclusive Artificial Intelligence

    Authors: Dilip Arumugam, Shi Dong, Benjamin Van Roy

    Abstract: Prevailing methods for assessing and comparing generative AIs incentivize responses that serve a hypothetical representative individual. Evaluating models in these terms presumes homogeneous preferences across the population and engenders selection of agglomerative AIs, which fail to represent the diverse range of interests across individuals. We propose an alternative evaluation method that inste… ▽ More

    Submitted 3 March, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

  42. arXiv:2212.03399  [pdf, other

    cs.SE

    Utilizing Source Code Syntax Patterns to Detect Bug Inducing Commits using Machine Learning Models

    Authors: Md Nadim, Banani Roy

    Abstract: Detecting Bug Inducing Commit (BIC) or Just in Time (JIT) defect prediction using Machine Learning (ML) based models requires tabulated feature values extracted from the source code or historical maintenance data of a software system. Existing studies have utilized meta-data from source code repositories (we named them GitHub Statistics or GS), n-gram-based source code text processing, and develop… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  43. arXiv:2212.01365  [pdf, other

    cs.LG

    An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: We study the compute-optimal trade-off between model and training data set sizes for large neural networks. Our result suggests a linear relation similar to that supported by the empirical analysis of chinchilla. While that work studies transformer-based large language models trained on the MassiveText corpus gopher, as a starting point for development of a mathematical theory, we focus on a simpl… ▽ More

    Submitted 18 October, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

  44. arXiv:2211.15931  [pdf, other

    cs.LG stat.ML

    Posterior Sampling for Continuing Environments

    Authors: Wanqiao Xu, Shi Dong, Benjamin Van Roy

    Abstract: We develop an extension of posterior sampling for reinforcement learning (PSRL) that is suited for a continuing agent-environment interface and integrates naturally into agent designs that scale to complex environments. The approach, continuing PSRL, maintains a statistically plausible model of the environment and follows a policy that maximizes expected $γ$-discounted return in that model. At eac… ▽ More

    Submitted 1 February, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

  45. arXiv:2211.01568  [pdf, other

    cs.CL cs.AI

    Fine-Tuning Language Models via Epistemic Neural Networks

    Authors: Ian Osband, Seyed Mohammad Asghari, Benjamin Van Roy, Nat McAleese, John Aslanides, Geoffrey Irving

    Abstract: Language models often pre-train on large unsupervised text corpora, then fine-tune on additional task-specific data. However, typical fine-tuning schemes do not prioritize the examples that they tune on. We show that, if you can prioritize informative training data, you can achieve better performance while using fewer labels. To do this we augment a language model with an epinet: a small additiona… ▽ More

    Submitted 10 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  46. arXiv:2210.16877  [pdf, ps, other

    cs.LG cs.AI

    On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning

    Authors: Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

    Abstract: Throughout the cognitive-science literature, there is widespread agreement that decision-making agents operating in the real world do so under limited information-processing capabilities and without access to unbounded cognitive or computational resources. Prior work has drawn inspiration from this fact and leveraged an information-theoretic model of such behaviors or policies as communication cha… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted to the NeurIPS Workshop on Information-Theoretic Principles in Cognitive Systems (InfoCog) 2022. arXiv admin note: text overlap with arXiv:2206.02072

  47. arXiv:2209.08627  [pdf, other

    cs.LG

    Is Stochastic Gradient Descent Near Optimal?

    Authors: Yifan Zhu, Hong Jun Jeon, Benjamin Van Roy

    Abstract: The success of neural networks over the past decade has established them as effective models for many relevant data generating processes. Statistical theory on neural networks indicates graceful scaling of sample complexity. For example, Joen & Van Roy (arXiv:2203.00246) demonstrate that, when data is generated by a ReLU teacher network with $W$ parameters, an optimal learner needs only… ▽ More

    Submitted 6 October, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.00246

  48. arXiv:2209.07065  [pdf, other

    cs.SI cs.AI cs.CL

    CommunityLM: Probing Partisan Worldviews from Language Models

    Authors: Hang Jiang, Doug Beeferman, Brandon Roy, Deb Roy

    Abstract: As political attitudes have diverged ideologically in the United States, political speech has diverged lingusitically. The ever-widening polarization between the US political parties is accelerated by an erosion of mutual understanding between them. We aim to make these communities more comprehensible to each other with a framework that probes community-specific responses to the same survey questi… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Paper accepted by COLING 2022

  49. RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

    Authors: Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari

    Abstract: Deep learning model inference is a key service in many businesses and scientific discovery processes. This paper introduces RIBBON, a novel deep learning inference serving system that meets two competing objectives: quality-of-service (QoS) target and cost-effectiveness. The key idea behind RIBBON is to intelligently employ a diverse set of cloud computing instances (heterogeneous instances) to me… ▽ More

    Submitted 28 July, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

  50. arXiv:2207.00137  [pdf, other

    cs.LG

    Robustness of Epinets against Distributional Shifts

    Authors: Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy

    Abstract: Recent work introduced the epinet as a new approach to uncertainty modeling in deep learning. An epinet is a small neural network added to traditional neural networks, which, together, can produce predictive distributions. In particular, using an epinet can greatly improve the quality of joint predictions across multiple inputs, a measure of how well a neural network knows what it does not know. I… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.