Skip to main content

Showing 1–32 of 32 results for author: Ho, A

  1. arXiv:2407.00071  [pdf, other

    cs.AI cs.CL cs.ET cs.LG

    Combinatorial Reasoning: Selecting Reasons in Generative AI Pipelines via Combinatorial Optimization

    Authors: Mert Esencan, Tarun Advaith Kumar, Ata Akbari Asanjan, P. Aaron Lott, Masoud Mohseni, Can Unlu, Davide Venturelli, Alan Ho

    Abstract: Recent Large Language Models (LLMs) have demonstrated impressive capabilities at tasks that require human intelligence and are a significant step towards human-like artificial intelligence (AI). Yet the performance of LLMs at reasoning tasks have been subpar and the reasoning capability of LLMs is a matter of significant debate. While it has been shown that the choice of the prompting technique to… ▽ More

    Submitted 19 June, 2024; originally announced July 2024.

    Comments: 13 pages, 3 figures

  2. arXiv:2403.11863  [pdf, other

    eess.SY cs.RO

    Context-aware LLM-based Safe Control Against Latent Risks

    Authors: Quan Khanh Luu, Xiyu Deng, Anh Van Ho, Yorie Nakahira

    Abstract: It is challenging for autonomous control systems to perform complex tasks in the presence of latent risks. Motivated by this challenge, this paper proposes an integrated framework that involves Large Language Models (LLMs), stochastic gradient descent (SGD), and optimization-based control. In the first phrase, the proposed framework breaks down complex tasks into a sequence of smaller subtasks, wh… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2403.05812  [pdf, other

    cs.CL cs.AI

    Algorithmic progress in language models

    Authors: Anson Ho, Tamay Besiroglu, Ege Erdil, David Owen, Robi Rahman, Zifan Carl Guo, David Atkinson, Neil Thompson, Jaime Sevilla

    Abstract: We investigate the rate at which algorithms for pre-training language models have improved since the advent of deep learning. Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months,… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  4. ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects

    Authors: Son Tien Bui, Shinya Kawano, Van Anh Ho

    Abstract: Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: 9 pages, 9 figures, RSS2023 conference

    Journal ref: Robotics: Science and System 2023

  5. arXiv:2402.00760  [pdf, other

    physics.plasm-ph cs.LG

    EuroPED-NN: Uncertainty aware surrogate model

    Authors: A. Panera Alvarez, A. Ho, A. Jarvinen, S. Saarelma, S. Wiesen, JET Contributors

    Abstract: This work successfully generates uncertainty aware surrogate models, via the Bayesian neural network with noise contrastive prior (BNN-NCP) technique, of the EuroPED plasma pedestal model using data from the JET-ILW pedestal database and subsequent model evaluations. All this conform EuroPED-NN. The BNN-NCP technique is proven to be a good fit for uncertainty aware surrogate models, matching the o… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  6. arXiv:2312.11671  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Language-Model Agents on Realistic Autonomous Tasks

    Authors: Megan Kinniment, Lucas Jun Koba Sato, Haoxing Du, Brian Goodrich, Max Hasin, Lawrence Chan, Luke Harold Miles, Tao R. Lin, Hjalmar Wijk, Joel Burget, Aaron Ho, Elizabeth Barnes, Paul Christiano

    Abstract: In this report, we explore the ability of language model agents to acquire resources, create copies of themselves, and adapt to novel challenges they encounter in the wild. We refer to this cluster of capabilities as "autonomous replication and adaptation" or ARA. We believe that systems capable of ARA could have wide-reaching and hard-to-anticipate consequences, and that measuring and forecasting… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 14 pages

  7. arXiv:2312.08595  [pdf, other

    cs.ET

    Limits to the Energy Efficiency of CMOS Microprocessors

    Authors: Anson Ho, Ege Erdil, Tamay Besiroglu

    Abstract: CMOS microprocessors have achieved massive energy efficiency gains but may reach limits soon. This paper presents an approach to estimating the limits on the maximum floating point operations per Joule (FLOP/J) for CMOS microprocessors. We analyze the three primary sources of energy dissipation: transistor switching, interconnect capacitances and leakage power. Using first-principles calculations… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  8. arXiv:2309.01656  [pdf, other

    cs.CV

    Building Footprint Extraction in Dense Areas using Super Resolution and Frame Field Learning

    Authors: Vuong Nguyen, Anh Ho, Duc-Anh Vu, Nguyen Thi Ngoc Anh, Tran Ngoc Thang

    Abstract: Despite notable results on standard aerial datasets, current state-of-the-arts fail to produce accurate building footprints in dense areas due to challenging properties posed by these areas and limited data availability. In this paper, we propose a framework to address such issues in polygonal building extraction. First, super resolution is employed to enhance the spatial resolution of aerial imag… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted at The 12th International Conference on Awareness Science and Technology

  9. arXiv:2306.01382  [pdf, other

    cs.CL

    Leveraging Auxiliary Domain Parallel Data in Intermediate Task Fine-tuning for Low-resource Translation

    Authors: Shravan Nayak, Surangika Ranathunga, Sarubi Thillainathan, Rikki Hung, Anthony Rinaldi, Yining Wang, Jonah Mackey, Andrew Ho, En-Shiun Annie Lee

    Abstract: NMT systems trained on Pre-trained Multilingual Sequence-Sequence (PMSS) models flounder when sufficient amounts of parallel data is not available for fine-tuning. This specifically holds for languages missing/under-represented in these models. The problem gets aggravated when the data comes from different domains. In this paper, we show that intermediate-task fine-tuning (ITFT) of PMSS models is… ▽ More

    Submitted 23 September, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted for poster presentation at the Practical Machine Learning for Developing Countries (PML4DC) workshop, ICLR 2023

  10. arXiv:2211.04325  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Will we run out of data? Limits of LLM scaling based on human-generated data

    Authors: Pablo Villalobos, Anson Ho, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, Marius Hobbhahn

    Abstract: We investigate the potential constraints on LLM scaling posed by the availability of public human-generated text data. We forecast the growing demand for training data based on current trends and estimate the total stock of public human text data. Our findings indicate that if current LLM development trends continue, models will be trained on datasets roughly equal in size to the available stock o… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 October, 2022; originally announced November 2022.

  11. arXiv:2211.03253  [pdf, other

    cs.RO

    Soft Robotic Link with Controllable Transparency for Vision-based Tactile and Proximity Sensing

    Authors: Quan Khanh Luu, Dinh Quang Nguyen, Nhan Huu Nguyen, Van Anh Ho

    Abstract: Robots have been brought to work close to humans in many scenarios. For coexistence and collaboration, robots should be safe and pleasant for humans to interact with. To this end, the robots could be both physically soft with multimodal sensing/perception, so that the robots could have better awareness of the surrounding environment, as well as to respond properly to humans' action/intention. This… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: Submitted to RoboSoft 2023 for review. Final content subjected to change

  12. arXiv:2207.13243  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

    Authors: Tilman Räuker, Anson Ho, Stephen Casper, Dylan Hadfield-Menell

    Abstract: The last decade of machine learning has seen drastic increases in scale and capabilities. Deep neural networks (DNNs) are increasingly being deployed in the real world. However, they are difficult to analyze, raising concerns about using them without a rigorous understanding of how they function. Effective tools for interpreting them will be important for building more trustworthy AI by helping to… ▽ More

    Submitted 18 August, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

  13. arXiv:2207.02852  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Machine Learning Model Sizes and the Parameter Gap

    Authors: Pablo Villalobos, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, Anson Ho, Marius Hobbhahn

    Abstract: We study trends in model size of notable machine learning systems over time using a curated dataset. From 1950 to 2018, model size in language models increased steadily by seven orders of magnitude. The trend then accelerated, with model size increasing by another five orders of magnitude in just 4 years from 2018 to 2022. Vision models grew at a more constant pace, totaling 7 orders of magnitude… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  14. arXiv:2204.12928  [pdf

    q-fin.ST cs.AI cs.CE math.NA

    Causal Analysis of Generic Time Series Data Applied for Market Prediction

    Authors: Anton Kolonin, Ali Raheman, Mukul Vishwas, Ikram Ansari, Juan Pinzon, Alice Ho

    Abstract: We explore the applicability of the causal analysis based on temporally shifted (lagged) Pearson correlation applied to diverse time series of different natures in context of the problem of financial market prediction. Theoretical discussion is followed by description of the practical approach for specific environment of time series data with diverse nature and sparsity, as applied for environment… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: 10 pages, 4 figures, submitted to Artificial General Intelligence 2022 conference

  15. arXiv:2202.07177  [pdf, other

    cs.RO

    Tombo Propeller: Bio-Inspired Deformable Structure toward Collision-Accommodated Control for Drones

    Authors: Son Tien Bui, Quan Khanh Luu, Dinh Quang Nguyen, Nhat Dinh Minh Le, Giuseppe Loianno, Van Anh Ho

    Abstract: There is a growing need for vertical take-off and landing vehicles, including drones, which are safe to use and can adapt to collisions. The risks of damage by collision, to humans, obstacles in the environment, and drones themselves, are significant. This has prompted a search into nature for a highly resilient structure that can inform a design of propellers to reduce those risks and enhance saf… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  16. Compute Trends Across Three Eras of Machine Learning

    Authors: Jaime Sevilla, Lennart Heim, Anson Ho, Tamay Besiroglu, Marius Hobbhahn, Pablo Villalobos

    Abstract: Compute, data, and algorithmic advances are the three fundamental factors that guide the progress of modern Machine Learning (ML). In this paper we study trends in the most readily quantified factor - compute. We show that before 2010 training compute grew in line with Moore's law, doubling roughly every 20 months. Since the advent of Deep Learning in the early 2010s, the scaling of training compu… ▽ More

    Submitted 9 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN), Padua, Italy, 2022, pp. 1-8

  17. arXiv:2111.13628  [pdf, other

    cond-mat.dis-nn cs.LG quant-ph

    Nonequilibrium Monte Carlo for unfreezing variables in hard combinatorial optimization

    Authors: Masoud Mohseni, Daniel Eppens, Johan Strumpfer, Raffaele Marino, Vasil Denchev, Alan K. Ho, Sergei V. Isakov, Sergio Boixo, Federico Ricci-Tersenghi, Hartmut Neven

    Abstract: Optimizing highly complex cost/energy functions over discrete variables is at the heart of many open problems across different scientific disciplines and industries. A major obstacle is the emergence of many-body effects among certain subsets of variables in hard instances leading to critical slowing down or collective freezing for known stochastic local search strategies. An exponential computati… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 28 pages, 18 figures

  18. arXiv:2111.12559  [pdf, ps, other

    physics.comp-ph cs.DB physics.plasm-ph

    Two step clustering for data reduction combining DBSCAN and k-means clustering

    Authors: Bart J. J. Kremers, Aaron Ho, Jonathan Citrin, Karel L. van de Plassche

    Abstract: A novel combination of two widely-used clustering algorithms is proposed here for the detection and reduction of high data density regions. The Density Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm is used for the detection of high data density regions and the k-means algorithm for reduction. The proposed algorithm iterates while successively decrementing the DBSCAN search… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    ACM Class: I.0

  19. arXiv:2111.09649  [pdf

    cs.CE

    HRnV-Calc: A software package for heart rate n-variability and heart rate variability analysis

    Authors: Chenglin Niu, Dagang Guo, Marcus Eng Hock Ong, Zhi Xiong Koh, Andrew Fu Wah Ho, Zhiping Lin, Chengyu Liu, Gari D. Clifford, Nan Liu

    Abstract: Objective: Heart rate variability (HRV) has been proven to be an important indicator of physiological status for numerous applications. Despite the progress and active developments made in HRV metric research over the last few decades, the representation of the heartbeat sequence upon which HRV is based has received relatively little attention. The recently introduced heart rate n-variability (HRn… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  20. arXiv:2110.05428  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Temporally Causal Latent Processes from General Temporal Data

    Authors: Weiran Yao, Yuewen Sun, Alex Ho, Changyin Sun, Kun Zhang

    Abstract: Our goal is to recover time-delayed latent causal variables and identify their relations from measured temporal data. Estimating causally-related latent variables from observations is particularly challenging as the latent variables are not uniquely recoverable in the most general case. In this work, we consider both a nonparametric, nonstationary setting and a parametric setting for the latent pr… ▽ More

    Submitted 8 February, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: ICLR 2022: https://openreview.net/forum?id=RDlLMjLJXdq

  21. arXiv:2103.11522  [pdf, other

    cs.RO

    Multi-directional Bicycle Robot for Steel Structure Inspection

    Authors: Son Thanh Nguyen, Hai Nguyen, Son Tien Bui, Van Anh Ho, Hung Manh La

    Abstract: This paper presents a novel design of a multi-directional bicycle robot, which targets inspecting general ferromagnetic structures including complex-shaped structures. The locomotion concept is based on arranging two magnetic wheels in a bicycle-like configuration with two independent steering actuators. This configuration allows the robot to possess multi-directional mobility. An additional free… ▽ More

    Submitted 27 March, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: Under review at IROS 2021

  22. BPActuators: Lightweight and Low-Cost Soft Actuators by Balloons and Plastics

    Authors: Qiukai Qi, Shogo Yoshida, Genki Kakihana, Takuma Torii, Van Anh Ho, Haoran Xie

    Abstract: To increase the awareness and impact, soft robotics needs to go beyond the lab environment and should be readily accessible to those even with no robotic expertise. However, most prevailing manufacturing methodologies require either professional equipment or materials that are not usually available to common people, thereby constraining the accessibility of soft robotics. In this communication, we… ▽ More

    Submitted 4 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Accepted to the 4th IEEE International Conference on Soft Robotics (RoboSoft), IEEE copyright

  23. arXiv:2009.13117  [pdf, other

    cs.CL cs.LG

    Generative latent neural models for automatic word alignment

    Authors: Anh Khoa Ngo Ho, François Yvon

    Abstract: Word alignments identify translational correspondences between words in a parallel sentence pair and are used, for instance, to learn bilingual dictionaries, to train statistical machine translation systems or to perform quality estimation. Variational autoencoders have been recently used in various of natural language processing to learn in an unsupervised way latent representations that are usef… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Journal ref: The Association for Machine Translation in the Americas, Oct 2020, Florida, United States

  24. arXiv:2009.13116  [pdf, other

    cs.CL cs.LG

    Neural Baselines for Word Alignment

    Authors: Anh Khoa Ngo Ho, François Yvon

    Abstract: Word alignments identify translational correspondences between words in a parallel sentence pair and is used, for instance, to learn bilingual dictionaries, to train statistical machine translation systems , or to perform quality estimation. In most areas of natural language processing, neural network models nowadays constitute the preferred approach, a situation that might also apply to word alig… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: The 16th International Workshop on Spoken Language Translation, Nov 2019, Hong Kong, Hong Kong SAR China

  25. arXiv:2003.02989  [pdf, other

    quant-ph cond-mat.dis-nn cs.LG cs.PL

    TensorFlow Quantum: A Software Framework for Quantum Machine Learning

    Authors: Michael Broughton, Guillaume Verdon, Trevor McCourt, Antonio J. Martinez, Jae Hyeon Yoo, Sergei V. Isakov, Philip Massey, Ramin Halavati, Murphy Yuezhen Niu, Alexander Zlokapa, Evan Peters, Owen Lockwood, Andrea Skolik, Sofiene Jerbi, Vedran Dunjko, Martin Leib, Michael Streif, David Von Dollen, Hongxiang Chen, Shuxiang Cao, Roeland Wiersema, Hsin-Yuan Huang, Jarrod R. McClean, Ryan Babbush, Sergio Boixo , et al. (4 additional authors not shown)

    Abstract: We introduce TensorFlow Quantum (TFQ), an open source library for the rapid prototyping of hybrid quantum-classical models for classical or quantum data. This framework offers high-level abstractions for the design and training of both discriminative and generative quantum models under TensorFlow and supports high-performance quantum circuit simulators. We provide an overview of the software archi… ▽ More

    Submitted 26 August, 2021; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 56 pages, 34 figures, many updates throughout the manuscript, several new sections are added

  26. arXiv:1911.09339  [pdf, other

    cs.CL

    Emotion Recognition for Vietnamese Social Media Text

    Authors: Vong Anh Ho, Duong Huynh-Cong Nguyen, Danh Hoang Nguyen, Linh Thi-Van Pham, Duc-Vu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Emotion recognition or emotion prediction is a higher approach or a special case of sentiment analysis. In this task, the result is not produced in terms of either polarity: positive or negative or in the form of rating (from 1 to 5) but of a more detailed level of analysis in which the results are depicted in more expressions like sadness, enjoyment, anger, disgust, fear, and surprise. Emotion re… ▽ More

    Submitted 26 January, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: PACLING 2019

    Journal ref: In Proceeding of PACLING 2019

  27. arXiv:1905.00444  [pdf, other

    quant-ph cs.CC physics.comp-ph

    Establishing the Quantum Supremacy Frontier with a 281 Pflop/s Simulation

    Authors: Benjamin Villalonga, Dmitry Lyakh, Sergio Boixo, Hartmut Neven, Travis S. Humble, Rupak Biswas, Eleanor G. Rieffel, Alan Ho, Salvatore Mandrà

    Abstract: Noisy Intermediate-Scale Quantum (NISQ) computers are entering an era in which they can perform computational tasks beyond the capabilities of the most powerful classical computers, thereby achieving "Quantum Supremacy", a major milestone in quantum computing. NISQ Supremacy requires comparison with a state-of-the-art classical simulator. We report HPC simulations of hard random quantum circuits (… ▽ More

    Submitted 6 May, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: The paper has been published in Quantum Science and Technology

    Journal ref: Quantum Science and Technology 5, 3 (2020)

  28. Lost in the Digital Wild: Hiding Information in Digital Activities

    Authors: Shujun Li, Anthony T. S. Ho, Zichi Wang, Xinpeng Zhang

    Abstract: This paper presents a new general framework of information hiding, in which the hidden information is embedded into a collection of activities conducted by selected human and computer entities (e.g., a number of online accounts of one or more online social networks) in a selected digital world. Different from other traditional schemes, where the hidden information is embedded into one or more sele… ▽ More

    Submitted 8 September, 2018; originally announced September 2018.

    Comments: 11 pages, 4 figures, accepted to MPS 2018 (2nd International Workshop on Multimedia Privacy and Security)

  29. arXiv:1803.06089  [pdf, other

    cs.DB cs.DC

    Distributed Caching for Complex Querying of Raw Arrays

    Authors: Weijie Zhao, Florin Rusu, Bin Dong, Kesheng Wu, Anna Y. Q. Ho, Peter Nugent

    Abstract: As applications continue to generate multi-dimensional data at exponentially increasing rates, fast analytics to extract meaningful results is becoming extremely important. The database community has developed array databases that alleviate this problem through a series of techniques. In-situ mechanisms provide direct access to raw data in the original format---without loading and partitioning. Pa… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

  30. arXiv:1609.04214  [pdf, ps, other

    cs.CR cs.AI cs.NI

    "Flow Size Difference" Can Make a Difference: Detecting Malicious TCP Network Flows Based on Benford's Law

    Authors: Aamo Iorliam, Santosh Tirunagari, Anthony T. S. Ho, Shujun Li, Adrian Waller, Norman Poh

    Abstract: Statistical characteristics of network traffic have attracted a significant amount of research for automated network intrusion detection, some of which looked at applications of natural statistical laws such as Zipf's law, Benford's law and the Pareto distribution. In this paper, we present the application of Benford's law to a new network flow metric "flow size difference", which have not been st… ▽ More

    Submitted 20 January, 2017; v1 submitted 14 September, 2016; originally announced September 2016.

    Comments: 13 pages, 3 figures

    ACM Class: C.2; K.6.5

  31. arXiv:1508.05699  [pdf

    cs.CY cs.SI

    Detecting and Preventing "Multiple-Account" Cheating in Massive Open Online Courses

    Authors: Curtis G. Northcutt, Andrew D. Ho, Isaac L. Chuang

    Abstract: We describe a cheating strategy enabled by the features of massive open online courses (MOOCs) and detectable by virtue of the sophisticated data systems that MOOCs provide. The strategy, Copying Answers using Multiple Existences Online (CAMEO), involves a user who gathers solutions to assessment questions using a "harvester" account and then submits correct answers using a separate "master" accou… ▽ More

    Submitted 8 September, 2015; v1 submitted 24 August, 2015; originally announced August 2015.

  32. arXiv:1506.00243  [pdf, other

    cs.MM cs.CR cs.PF

    OR-Benchmark: An Open and Reconfigurable Digital Watermarking Benchmarking Framework

    Authors: Hui Wang, Anthony TS Ho, Shujun Li

    Abstract: Benchmarking digital watermarking algorithms is not an easy task because different applications of digital watermarking often have very different sets of requirements and trade-offs between conflicting requirements. While there have been some general-purpose digital watermarking benchmarking systems available, they normally do not support complicated benchmarking tasks and cannot be easily reconfi… ▽ More

    Submitted 5 June, 2015; v1 submitted 31 May, 2015; originally announced June 2015.