Skip to main content

Showing 1–50 of 64 results for author: Hughes, C

  1. arXiv:2407.03685  [pdf, other

    cs.PL cs.LO

    Verifying Peephole Rewriting In SSA Compiler IRs

    Authors: Siddharth Bhat, Alex Keizer, Chris Hughes, Andrés Goens, Tobias Grosser

    Abstract: There is an increasing need for domain-specific reasoning in modern compilers. This has fueled the use of tailored intermediate representations (IRs) based on static single assignment (SSA), like in the MLIR compiler framework. Interactive theorem provers (ITPs) provide strong guarantees for the end-to-end verification of compilers (e.g., CompCert). However, modern compilers and their IRs evolve a… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: accepted at ITP 2024

  2. arXiv:2405.15583  [pdf, other

    cs.LG

    Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported

    Authors: Ethan Harvey, Mikhail Petrov, Michael C. Hughes

    Abstract: We pursue transfer learning to improve classifier accuracy on a target task with few labeled examples available for training. Recent work suggests that using a source task to learn a prior distribution over neural net weights, not just an initialization, can boost target task performance. In this study, we carefully compare transfer learning with and without source task informed priors across 5 da… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2403.10658  [pdf, other

    cs.CV cs.LG

    InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised Learning

    Authors: Zhe Huang, Xiaowei Yu, Dajiang Zhu, Michael C. Hughes

    Abstract: Semi-supervised learning (SSL) seeks to enhance task performance by training on both labeled and unlabeled data. Mainstream SSL image classification methods mostly optimize a loss that additively combines a supervised classification objective with a regularization term derived solely from unlabeled data. This formulation neglects the potential for interaction between labeled and unlabeled images.… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Semi-supervised Learning; Vision Transformers

  4. arXiv:2403.06024  [pdf, other

    cs.CV cs.ET cs.LG

    Semi-Supervised Multimodal Multi-Instance Learning for Aortic Stenosis Diagnosis

    Authors: Zhe Huang, Xiaowei Yu, Benjamin S. Wessler, Michael C. Hughes

    Abstract: Automated interpretation of ultrasound imaging of the heart (echocardiograms) could improve the detection and treatment of aortic stenosis (AS), a deadly heart disease. However, existing deep learning pipelines for assessing AS from echocardiograms have two key limitations. First, most methods rely on limited 2D cineloops, thereby ignoring widely available Doppler imaging that contains important c… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Echocardiography; Multimodal; Semi-supervised Learning; Multiple-Instance Learning

  5. arXiv:2401.14973  [pdf, other

    stat.ML cs.LG

    Discovering group dynamics in synchronous time series via hierarchical recurrent switching-state models

    Authors: Michael Wojnowicz, Preetish Rath, Eric Miller, Jeffrey Miller, Clifford Hancock, Meghan O'Donovan, Seth Elkin-Frankston, Thaddeus Brunye, Michael C. Hughes

    Abstract: We seek to model a collection of time series arising from multiple entities interacting over the same time period. Recent work focused on modeling individual time series is inadequate for our intended applications, where collective system-level behavior influences the trajectories of individual entities. To address such problems, we present a new hierarchical switching-state model that can be trai… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  6. arXiv:2311.18025  [pdf, other

    cs.LG

    A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data

    Authors: Ethan Harvey, Wansu Chen, David M. Kent, Michael C. Hughes

    Abstract: Practitioners building classifiers often start with a smaller pilot dataset and plan to grow to larger data in the near future. Such projects need a toolkit for extrapolating how much classifier accuracy may improve from a 2x, 10x, or 50x increase in data size. While existing work has focused on finding a single "best-fit" curve using various functional forms like power laws, we argue that modelin… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  7. arXiv:2311.05367  [pdf, other

    cs.IT cs.GT

    Reducing Disorder: An Information-Theory Formulation of MEV

    Authors: Ciaran Hughes

    Abstract: Maximal Extractable Value (MEV) has garnered significant attention in the cryptocurrency community. Such attention is a consequence of the revenue that can be generated from MEV, as well as the risks MEV poses to the fundamental value proposition of the underlying blockchain technology. In this work, we provide an information-theoretic formulation of MEV. With this formulation, we make common stat… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 8 pages

  8. arXiv:2309.14277  [pdf, other

    cs.CV cs.LG

    SINCERE: Supervised Information Noise-Contrastive Estimation REvisited

    Authors: Patrick Feeney, Michael C. Hughes

    Abstract: The information noise-contrastive estimation (InfoNCE) loss function provides the basis of many self-supervised deep learning methods due to its strong empirical results and theoretic motivation. Previous work suggests a supervised contrastive (SupCon) loss to extend InfoNCE to learn from available class labels. This SupCon loss has been widely-used due to reports of good empirical performance. Ho… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  9. arXiv:2309.08742  [pdf, other

    cs.RO math.OC

    RoSSO: A High-Performance Python Package for Robotic Surveillance Strategy Optimization Using JAX

    Authors: Yohan John, Connor Hughes, Gilberto Diaz-Garcia, Jason R. Marden, Francesco Bullo

    Abstract: To enable the computation of effective randomized patrol routes for single- or multi-robot teams, we present RoSSO, a Python package designed for solving Markov chain optimization problems. We exploit machine-learning techniques such as reverse-mode automatic differentiation and constraint parametrization to achieve superior efficiency compared to general-purpose nonlinear programming solvers. Add… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures, 3 tables, submitted to the 2024 IEEE International Conference on Robotics and Automation. See https://github.com/conhugh/RoSSO for associated codebase

  10. arXiv:2308.14160  [pdf, other

    cs.CV cs.AI

    A Unified Transformer-based Network for multimodal Emotion Recognition

    Authors: Kamran Ali, Charles E. Hughes

    Abstract: The development of transformer-based models has resulted in significant advances in addressing various vision and NLP-based research challenges. However, the progress made in transformer-based methods has not been effectively applied to biosensing research. This paper presents a novel Unified Biosensor-Vision Multi-modal Transformer-based (UBVMT) method to classify emotions in an arousal-valence s… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: 12 pages

  11. arXiv:2307.08919  [pdf, other

    cs.CV cs.LG

    Systematic comparison of semi-supervised and self-supervised learning for medical image classification

    Authors: Zhe Huang, Ruijie Jiang, Shuchin Aeron, Michael C. Hughes

    Abstract: In typical medical image classification problems, labeled data is scarce while unlabeled data is more available. Semi-supervised learning and self-supervised learning are two different research directions that can improve accuracy by learning from extra unlabeled data. Recent methods from both directions have reported significant gains on traditional benchmarks. Yet past benchmarks do not focus on… ▽ More

    Submitted 29 March, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: CVPR 2024

  12. arXiv:2306.00003  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning

    Authors: Zhe Huang, Benjamin S. Wessler, Michael C. Hughes

    Abstract: Aortic stenosis (AS) is a degenerative valve condition that causes substantial morbidity and mortality. This condition is under-diagnosed and under-treated. In clinical practice, AS is diagnosed with expert review of transthoracic echocardiography, which produces dozens of ultrasound images of the heart. Only some of these views show the aortic valve. To automate screening for AS, deep networks mu… ▽ More

    Submitted 4 April, 2024; v1 submitted 25 May, 2023; originally announced June 2023.

    Comments: Echocardiogram; multiple-instance learning; self-supervised learning; semi-supervised learning; medical imaging

    Journal ref: MLHC 2023

  13. arXiv:2302.08687  [pdf, other

    cs.AR cs.AI cs.LG

    VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs

    Authors: Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

    Abstract: Deep Learning (DL) acceleration support in CPUs has recently gained a lot of traction, with several companies (Arm, Intel, IBM) announcing products with specialized matrix engines accessible via GEMM instructions. CPUs are pervasive and need to handle diverse requirements across DL workloads running in edge/HPC/cloud platforms. Therefore, as DL workloads embrace sparsity to reduce the computations… ▽ More

    Submitted 23 February, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: This paper is accepted to HPCA 2023

  14. Nonparametric and Regularized Dynamical Wasserstein Barycenters for Sequential Observations

    Authors: Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller

    Abstract: We consider probabilistic models for sequential observations which exhibit gradual transitions among a finite number of states. We are particularly motivated by applications such as human activity analysis where observed accelerometer time series contains segments representing distinct activities, which we call pure states, as well as periods characterized by continuous transition among these pure… ▽ More

    Submitted 21 September, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Journal ref: IEEE Transactions on Signal Processing (2023), volume 71, pages 3164 - 3178

  15. arXiv:2208.11870  [pdf, other

    cs.LG cs.AI cs.CV

    Fix-A-Step: Semi-supervised Learning from Uncurated Unlabeled Data

    Authors: Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, Michael C. Hughes

    Abstract: Semi-supervised learning (SSL) promises improved accuracy compared to training classifiers on small labeled datasets by also training on many unlabeled images. In real applications like medical imaging, unlabeled data will be collected for expediency and thus uncurated: possibly different from the labeled set in classes or features. Unfortunately, modern deep SSL often makes accuracy worse when gi… ▽ More

    Submitted 25 May, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: AISTATS 2023 (Oral)

  16. arXiv:2207.12231  [pdf, other

    cs.AR

    FAT-PIM: Low-Cost Error Detection for Processing-In-Memory

    Authors: Kazi Abu Zubair, Sumit Kumar Jha, David Mohaisen, Clayton Hughes, Amro Awad

    Abstract: Processing In Memory (PIM) accelerators are promising architecture that can provide massive parallelization and high efficiency in various applications. Such architectures can instantaneously provide ultra-fast operation over extensive data, allowing real-time performance in data-intensive workloads. For instance, Resistive Memory (ReRAM) based PIM architectures are widely known for their inherent… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: This paper is currently under submission. We arXiv our paper to establish credit for inventing this work

  17. arXiv:2206.11736  [pdf, other

    cs.CV cs.AI cs.LG

    NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds

    Authors: Patrick Feeney, Sarah Schneider, Panagiotis Lymperopoulos, Li-Ping Liu, Matthias Scheutz, Michael C. Hughes

    Abstract: In order for artificial agents to successfully perform tasks in changing environments, they must be able to both detect and adapt to novelty. However, visual novelty detection research often only evaluates on repurposed datasets such as CIFAR-10 originally intended for object classification, where images focus on one distinct, well-centered object. New benchmarks are needed to represent the challe… ▽ More

    Submitted 28 March, 2023; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (03/2023)

  18. arXiv:2206.00093  [pdf, other

    stat.ML cs.LG stat.CO

    Easy Variational Inference for Categorical Models via an Independent Binary Approximation

    Authors: Michael T. Wojnowicz, Shuchin Aeron, Eric L. Miller, Michael C. Hughes

    Abstract: We pursue tractable Bayesian analysis of generalized linear models (GLMs) for categorical data. Thus far, GLMs are difficult to scale to more than a few dozen categories due to non-conjugacy or strong posterior dependencies when using conjugate auxiliary variable methods. We define a new class of GLMs for categorical data called categorical-from-binary (CB) models. Each CB model has a likelihood t… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: to appear at ICML 2022

  19. arXiv:2205.13066  [pdf, other

    cs.LG

    Semi-supervised Drifted Stream Learning with Short Lookback

    Authors: Weijieying Ren, Pengyang Wang, Xiaolin Li, Charles E. Hughes, Yanjie Fu

    Abstract: In many scenarios, 1) data streams are generated in real time; 2) labeled data are expensive and only limited labels are available in the beginning; 3) real-world data is not always i.i.d. and data drift over time gradually; 4) the storage of historical streams is limited and model updating can only be achieved based on a very short lookback window. This learning setting limits the applicability a… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: To appear in KDD 2022

  20. arXiv:2204.05959  [pdf

    cs.DC cs.PF

    "Smarter" NICs for faster molecular dynamics: a case study

    Authors: Sara Karamati, Clayton Hughes, K. Scott Hemmert, Ryan E. Grant, W. Whit Schonbein, Scott Levy, Thomas M. Conte, Jeffrey Young, Richard W. Vuduc

    Abstract: This work evaluates the benefits of using a "smart" network interface card (SmartNIC) as a compute accelerator for the example of the MiniMD molecular dynamics proxy application. The accelerator is NVIDIA's BlueField-2 card, which includes an 8-core Arm processor along with a small amount of DRAM and storage. We test the networking and data movement performance of these cards compared to a standar… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  21. arXiv:2203.06550  [pdf, other

    cs.AI cs.LG

    Reinforced Imitative Graph Learning for Mobile User Profiling

    Authors: Dongjie Wang, Pengyang Wang, Yanjie Fu, Kunpeng Liu, Hui Xiong, Charles E. Hughes

    Abstract: Mobile user profiling refers to the efforts of extracting users' characteristics from mobile activities. In order to capture the dynamic varying of user characteristics for generating effective user profiling, we propose an imitation-based mobile user profiling framework. Considering the objective of teaching an autonomous agent to imitate user mobility based on the user's profile, the user profil… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

    Comments: TKDE Under Review

  22. FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

    Authors: Sonia Natalie Mitchell, Andrew Lahiff, Nathan Cummings, Jonathan Hollocombe, Bram Boskamp, Ryan Field, Dennis Reddyhoff, Kristian Zarebski, Antony Wilson, Bruno Viola, Martin Burke, Blair Archibald, Paul Bessell, Richard Blackwell, Lisa A Boden, Alys Brett, Sam Brett, Ruth Dundas, Jessica Enright, Alejandra N. Gonzalez-Beltran, Claire Harris, Ian Hinder, Christopher David Hughes, Martin Knight, Vino Mano , et al. (13 additional authors not shown)

    Abstract: Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily da… ▽ More

    Submitted 4 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  23. arXiv:2110.06741  [pdf, other

    cs.LG stat.ML

    Dynamical Wasserstein Barycenters for Time-series Modeling

    Authors: Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Eric L. Miller

    Abstract: Many time series can be modeled as a sequence of segments representing high-level discrete states, such as running and walking in a human activity application. Flexible models should describe the system state and observations in stationary "pure-state" periods as well as transition periods between adjacent segments, such as a gradual slowdown between running and walking. However, most prior work a… ▽ More

    Submitted 31 October, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To appear at Neurips 2021

  24. arXiv:2110.01752  [pdf, other

    cs.AR cs.AI cs.LG

    RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

    Authors: Geonhwa Jeong, Eric Qin, Ananda Samajdar, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

    Abstract: As AI-based applications become pervasive, CPU vendors are starting to incorporate matrix engines within the datapath to boost efficiency. Systolic arrays have been the premier architectural choice as matrix engines in offload accelerators. However, we demonstrate that incorporating them inside CPUs can introduce under-utilization and stalls due to limited register storage to amortize the fill and… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: This paper is accepted to DAC 2021

  25. arXiv:2108.00080  [pdf, other

    cs.CV cs.LG

    A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from Echocardiograms

    Authors: Zhe Huang, Gary Long, Benjamin Wessler, Michael C. Hughes

    Abstract: Semi-supervised image classification has shown substantial progress in learning from limited labeled data, but recent advances remain largely untested for clinical applications. Motivated by the urgent need to improve timely diagnosis of life-threatening heart conditions, especially aortic stenosis, we develop a benchmark dataset to assess semi-supervised approaches to two tasks relevant to cardia… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

    Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages (including 7 tables & 3 figures). 13 additional pages of references and supplementary material

  26. arXiv:2107.13379  [pdf, other

    cs.CV cs.LG

    Evaluating the Use of Reconstruction Error for Novelty Localization

    Authors: Patrick Feeney, Michael C. Hughes

    Abstract: The pixelwise reconstruction error of deep autoencoders is often utilized for image novelty detection and localization under the assumption that pixels with high error indicate which parts of the input image are unfamiliar and therefore likely to be novel. This assumed correlation between pixels with high reconstruction error and novel regions of input images has not been verified and may limit th… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  27. arXiv:2106.02206  [pdf, other

    cs.LG cs.AI stat.ML

    Stochastic Iterative Graph Matching

    Authors: Linfeng Liu, Michael C. Hughes, Soha Hassoun, Li-Ping Liu

    Abstract: Recent works leveraging Graph Neural Networks to approach graph matching tasks have shown promising results. Recent progress in learning discrete distributions poses new opportunities for learning graph matching models. In this work, we propose a new model, Stochastic Iterative Graph MAtching (SIGMA), to address the graph matching problem. Our model defines a distribution of matchings for a graph… ▽ More

    Submitted 12 September, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  28. arXiv:2105.00773  [pdf, other

    stat.AP cs.LG stat.ML

    Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories

    Authors: Gian Marco Visani, Alexandra Hope Lee, Cuong Nguyen, David M. Kent, John B. Wong, Joshua T. Cohen, Michael C. Hughes

    Abstract: We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we mo… ▽ More

    Submitted 28 July, 2021; v1 submitted 28 April, 2021; originally announced May 2021.

    Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages, 7 figures and 1 table. 26 additional pages of supplementary material

  29. Vision-based Driver Assistance Systems: Survey, Taxonomy and Advances

    Authors: Jonathan Horgan, Ciarán Hughes, John McDonald, Senthil Yogamani

    Abstract: Vision-based driver assistance systems is one of the rapidly growing research areas of ITS, due to various factors such as the increased level of safety requirements in automotive, computational power in embedded systems, and desire to get closer to autonomous driving. It is a cross disciplinary area encompassing specialised fields like computer vision, machine learning, robotic navigation, embedd… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Journal ref: 2015 IEEE 18th International Conference on Intelligent Transportation Systems

  30. Computer vision in automated parking systems: Design, implementation and challenges

    Authors: Markus Heimberger, Jonathan Horgan, Ciaran Hughes, John McDonald, Senthil Yogamani

    Abstract: Automated driving is an active area of research in both industry and academia. Automated Parking, which is automated driving in a restricted scenario of parking with low speed manoeuvring, is a key enabling product for fully autonomous driving systems. It is also an important milestone from the perspective of a higher end system built from the previous generation driver assistance systems comprisi… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Journal ref: Image and Vision Computing, Volume 68, December 2017, Pages 88-101

  31. arXiv:2104.09327  [pdf, other

    stat.ML cs.LG

    Forecasting COVID-19 Counts At A Single Hospital: A Hierarchical Bayesian Approach

    Authors: Alexandra Hope Lee, Panagiotis Lymperopoulos, Joshua T. Cohen, John B. Wong, Michael C. Hughes

    Abstract: We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: In ICLR 2021 Workshop on Machine Learning for Preventing and Combating Pandemics

  32. arXiv:2103.15966  [pdf, other

    cs.LG stat.ML

    Modeling Graph Node Correlations with Neighbor Mixture Models

    Authors: Linfeng Liu, Michael C. Hughes, Li-Ping Liu

    Abstract: We propose a new model, the Neighbor Mixture Model (NMM), for modeling node labels in a graph. This model aims to capture correlations between the labels of nodes in a local neighborhood. We carefully design the model so it could be an alternative to a Markov Random Field but with more affordable computations. In particular, drawing samples and evaluating marginal probabilities of single labels ca… ▽ More

    Submitted 18 April, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

  33. arXiv:2101.02634  [pdf, other

    cs.AI

    Reinforced Imitative Graph Representation Learning for Mobile User Profiling: An Adversarial Training Perspective

    Authors: Dongjie Wang, Pengyang Wang, Kunpeng Liu, Yuanchun Zhou, Charles Hughes, Yanjie Fu

    Abstract: In this paper, we study the problem of mobile user profiling, which is a critical component for quantifying users' characteristics in the human mobility modeling pipeline. Human mobility is a sequential decision-making process dependent on the users' dynamic interests. With accurate user profiles, the predictive model can perfectly reproduce users' mobility trajectories. In the reverse direction,… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: AAAI 2021

  34. arXiv:2012.06718  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints

    Authors: Gabriel Hope, Madina Abdrakhmanova, Xiaoyin Chen, Michael C. Hughes, Michael C. Hughes, Erik B. Sudderth

    Abstract: We develop a new framework for learning variational autoencoders and other deep generative models that balances generative and discriminative goals. Our framework optimizes model parameters to maximize a variational lower bound on the likelihood of observed data, subject to a task-specific prediction constraint that prevents model misspecification from leading to inaccurate predictions. We further… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  35. arXiv:2008.00171  [pdf, other

    cs.AR

    DeACT: Architecture-Aware Virtual Memory Support for Fabric Attached Memory Systems

    Authors: Vamsee Reddy Kommareddy, Clayton Hughes, Simon David Hammond, Amro Awad

    Abstract: The exponential growth of data has driven technology providers to develop new protocols, such as cache coherent interconnects and memory semantic fabrics, to help users and facilities leverage advances in memory technologies to satisfy these growing memory and storage demands. Using these new protocols, fabric-attached memories (FAM) can be directly attached to a system interconnect and be easily… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

  36. arXiv:2005.03373  [pdf

    cs.MM

    Accessibility in 360-degree video players

    Authors: Chris Hughes, Mario Montagud

    Abstract: Any media experience must be fully inclusive and accessible to all users regardless of their ability. With the current trend towards immersive experiences, such as Virtual Reality (VR) and 360-degree video, it becomes key that these environments are adapted to be fully accessible. However, until recently the focus has been mostly on adapting the existing techniques to fit immersive displays, rathe… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: 25 pages, journal paper

  37. arXiv:2005.00499  [pdf, other

    cs.CV

    An Efficient Integration of Disentangled Attended Expression and Identity FeaturesFor Facial Expression Transfer andSynthesis

    Authors: Kamran Ali, Charles E. Hughes

    Abstract: In this paper, we present an Attention-based Identity Preserving Generative Adversarial Network (AIP-GAN) to overcome the identity leakage problem from a source image to a generated face image, an issue that is encountered in a cross-subject facial expression transfer and synthesis process. Our key insight is that the identity preserving network should be able to disentangle and compose shape, app… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: 10 Pages, excluding references

  38. arXiv:2003.03262  [pdf, other

    cs.CV

    Spherical formulation of moving object geometric constraints for monocular fisheye cameras

    Authors: Letizia Mariotti, Ciaran Hughes

    Abstract: In this paper, we introduce a moving object detection algorithm for fisheye cameras used in autonomous driving. We reformulate the three commonly used constraints in rectilinear images (epipolar, positive depth and positive height constraints) to spherical coordinates which is invariant to specific camera configuration once the calibration is known. One of the main challenging use case in autonomo… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Comments: 8 pages, 9 figures, 2 tables Conference ITSC 2019

    Journal ref: 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand, 2019, pp. 816-823

  39. arXiv:2002.07327  [pdf

    q-bio.CB cs.LG

    Enzyme promiscuity prediction using hierarchy-informed multi-label classification

    Authors: Gian Marco Visani, Michael C. Hughes, Soha Hassoun

    Abstract: As experimental efforts are costly and time consuming, computational characterization of enzyme capabilities is an attractive alternative. We present and evaluate several machine-learning models to predict which of 983 distinct enzymes, as defined via the Enzyme Commission, EC, numbers, are likely to interact with a given query molecule. Our data consists of enzyme-substrate interactions from the… ▽ More

    Submitted 25 January, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Presented as a poster at the 2019 Machine Learning for Computational Biology Symposium, Vancouver, CA Accepted for publication, Bioinformatics, Jan 22, 2021

  40. arXiv:2001.04032  [pdf, other

    stat.ML cs.LG

    POPCORN: Partially Observed Prediction COnstrained ReiNforcement Learning

    Authors: Joseph Futoma, Michael C. Hughes, Finale Doshi-Velez

    Abstract: Many medical decision-making tasks can be framed as partially observed Markov decision processes (POMDPs). However, prevailing two-stage approaches that first learn a POMDP and then solve it often fail because the model that best fits the data may not be well suited for planning. We introduce a new optimization objective that (a) produces both high-performing policies and high-quality generative m… ▽ More

    Submitted 31 March, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

    Comments: Accepted to AISTATS 2020, Palermo, Italy

  41. arXiv:1912.11066  [pdf, other

    cs.CV cs.RO

    FisheyeMultiNet: Real-time Multi-task Learning Architecture for Surround-view Automated Parking System

    Authors: Pullarao Maddu, Wayne Doherty, Ganesh Sistu, Isabelle Leang, Michal Uricar, Sumanth Chennupati, Hazem Rashed, Jonathan Horgan, Ciaran Hughes, Senthil Yogamani

    Abstract: Automated Parking is a low speed manoeuvring scenario which is quite unstructured and complex, requiring full 360° near-field sensing around the vehicle. In this paper, we discuss the design and implementation of an automated parking system from the perspective of camera based deep learning algorithms. We provide a holistic overview of an industrial system covering the embedded system, use cases a… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted for publication at Irish Machine Vision and Image Processing (IMVIP) 2019

  42. arXiv:1912.01456  [pdf, other

    cs.CV

    Facial Expression Representation Learning by Synthesizing Expression Images

    Authors: Kamran Ali, Charles E. Hughes

    Abstract: Representations used for Facial Expression Recognition (FER) usually contain expression information along with identity features. In this paper, we propose a novel Disentangled Expression learning-Generative Adversarial Network (DE-GAN) which combines the concept of disentangled representation learning with residue learning to explicitly disentangle facial expression representation from identity i… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

    Comments: 7 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:1909.13135

  43. arXiv:1911.10175  [pdf, other

    cs.LG cs.DC stat.ML

    SparseTrain:Leveraging Dynamic Sparsity in Training DNNs on General-Purpose SIMD Processors

    Authors: Zhangxiaowen Gong, Houxiang Ji, Christopher Fletcher, Christopher Hughes, Josep Torrellas

    Abstract: Our community has greatly improved the efficiency of deep learning applications, including by exploiting sparsity in inputs. Most of that work, though, is for inference, where weight sparsity is known statically, and/or for specialized hardware. We propose a scheme to leverage dynamic sparsity during training. In particular, we exploit zeros introduced by the ReLU activation function to both featu… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  44. arXiv:1911.07050  [pdf, other

    cs.CV

    All-In-One: Facial Expression Transfer, Editing and Recognition Using A Single Network

    Authors: Kamran Ali, Charles E. Hughes

    Abstract: In this paper, we present a unified architecture known as Transfer-Editing and Recognition Generative Adversarial Network (TER-GAN) which can be used: 1. to transfer facial expressions from one identity to another identity, known as Facial Expression Transfer (FET), 2. to transform the expression of a given image to a target expression, while preserving the identity of the image, known as Facial E… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

    Comments: 10 pages, 5 figures

  45. arXiv:1911.01325  [pdf, other

    eess.SP cs.LG

    Optimal Transport Based Change Point Detection and Time Series Segment Clustering

    Authors: Kevin C. Cheng, Shuchin Aeron, Michael C. Hughes, Erika Hussey, Eric L. Miller

    Abstract: Two common problems in time series analysis are the decomposition of the data stream into disjoint segments that are each in some sense "homogeneous" - a problem known as Change Point Detection (CPD) - and the grouping of similar nonadjacent segments, a problem that we call Time Series Segment Clustering (TSSC). Building upon recent theoretical advances characterizing the limiting distribution-fre… ▽ More

    Submitted 20 February, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

  46. arXiv:1910.05595  [pdf, other

    cs.CV

    Facial Expression Recognition Using Human to Animated-Character Expression Translation

    Authors: Kamran Ali, Ilkin Isler, Charles Hughes

    Abstract: Facial expression recognition is a challenging task due to two major problems: the presence of inter-subject variations in facial expression recognition dataset and impure expressions posed by human subjects. In this paper we present a novel Human-to-Animation conditional Generative Adversarial Network (HA-GAN) to overcome these two problems by using many (human faces) to one (animated face) mappi… ▽ More

    Submitted 12 October, 2019; originally announced October 2019.

    Comments: 8 Pages

  47. arXiv:1909.13135  [pdf, other

    cs.CV

    Facial Expression Recognition Using Disentangled Adversarial Learning

    Authors: Kamran Ali, Charles E. Hughes

    Abstract: The representation used for Facial Expression Recognition (FER) usually contain expression information along with other variations such as identity and illumination. In this paper, we propose a novel Disentangled Expression learning-Generative Adversarial Network (DE-GAN) to explicitly disentangle facial expression representation from identity information. In this learning by reconstruction method… ▽ More

    Submitted 28 September, 2019; originally announced September 2019.

  48. arXiv:1908.05254  [pdf, other

    cs.LG stat.ML

    Optimizing for Interpretability in Deep Neural Networks with Tree Regularization

    Authors: Mike Wu, Sonali Parbhoo, Michael C. Hughes, Volker Roth, Finale Doshi-Velez

    Abstract: Deep models have advanced prediction in many domains, but their lack of interpretability remains a key barrier to the adoption in many real world applications. There exists a large body of work aiming to help humans understand these black box functions to varying levels of granularity -- for example, through distillation, gradients, or adversarial examples. These methods however, all tackle interp… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1908.04494, arXiv:1711.06178

  49. arXiv:1908.00690  [pdf, other

    cs.LG stat.ML

    Feature Robustness in Non-stationary Health Records: Caveats to Deployable Model Performance in Common Clinical Machine Learning Tasks

    Authors: Bret Nestor, Matthew B. A. McDermott, Willie Boag, Gabriela Berner, Tristan Naumann, Michael C. Hughes, Anna Goldenberg, Marzyeh Ghassemi

    Abstract: When training clinical prediction models from electronic health records (EHRs), a key concern should be a model's ability to sustain performance over time when deployed, even as care practices, database systems, and population demographics evolve. Due to de-identification requirements, however, current experimental practices for public EHR benchmarks (such as the MIMIC-III critical care dataset) a… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

  50. MIMIC-Extract: A Data Extraction, Preprocessing, and Representation Pipeline for MIMIC-III

    Authors: Shirly Wang, Matthew B. A. McDermott, Geeticka Chauhan, Michael C. Hughes, Tristan Naumann, Marzyeh Ghassemi

    Abstract: Robust machine learning relies on access to data that can be used with standardized frameworks in important tasks and the ability to develop models whose performance can be reasonably reproduced. In machine learning for healthcare, the community faces reproducibility challenges due to a lack of publicly accessible data and a lack of standardized data processing frameworks. We present MIMIC-Extract… ▽ More

    Submitted 19 August, 2020; v1 submitted 18 July, 2019; originally announced July 2019.