Skip to main content

Showing 1–50 of 292 results for author: Banerjee, A

  1. arXiv:2407.06727  [pdf, other

    eess.IV cs.CV

    Towards Physics-informed Cyclic Adversarial Multi-PSF Lensless Imaging

    Authors: Abeer Banerjee, Sanjay Singh

    Abstract: Lensless imaging has emerged as a promising field within inverse imaging, offering compact, cost-effective solutions with the potential to revolutionize the computational camera market. By circumventing traditional optical components like lenses and mirrors, novel approaches like mask-based lensless imaging eliminate the need for conventional hardware. However, advancements in lensless image recon… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.02968  [pdf, other

    cs.CV cs.AI cs.CC cs.ET

    Unified Anomaly Detection methods on Edge Device using Knowledge Distillation and Quantization

    Authors: Sushovan Jena, Arya Pulkit, Kajal Singh, Anoushka Banerjee, Sharad Joshi, Ananth Ganesh, Dinesh Singh, Arnav Bhavsar

    Abstract: With the rapid advances in deep learning and smart manufacturing in Industry 4.0, there is an imperative for high-throughput, high-performance, and fully integrated visual inspection systems. Most anomaly detection approaches using defect detection datasets, such as MVTec AD, employ one-class models that require fitting separate models for each class. On the contrary, unified models eliminate the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages

    MSC Class: 68T07 ACM Class: I.2.10

  3. arXiv:2407.00035  [pdf, other

    cs.DC

    Achieving Observability on Fog Computing with the use of open-source tools

    Authors: Breno Costa, Abhik Banerjee, Prem Prakash Jayaraman, Leonardo R. Carvalho, João Bachiega Jr., Aleteia Araujo

    Abstract: Fog computing can provide computational resources and low-latency communication at the network edge. But with it comes uncertainties that must be managed in order to guarantee Service Level Agreements. Service observability can help the environment better deal with uncertainties, delivering relevant and up-to-date information in a timely manner to support decision making. Observability is consider… ▽ More

    Submitted 25 May, 2024; originally announced July 2024.

    Comments: Paper presented at Mobiquitous 2023

  4. arXiv:2407.00013  [pdf, other

    cs.DC cs.NI

    A Hybrid Approach to Monitor Context Parameters for Optimising Caching for Context-Aware IoT Applications

    Authors: Ashish Manchanda, Prem Prakash Jayaraman, Abhik Banerjee, Arkady Zaslavsky, Shakthi Weerasinghe, Guang-Li Huang

    Abstract: Internet of Things (IoT) has seen a prolific rise in recent times and provides the ability to solve several key challenges faced by our societies and environment. Data produced by IoT provides a significant opportunity to infer context that is key for IoT applications to make decisions/actuations. Context Management Platform (CMP) is a middleware to facilitate the exchange and management of such c… ▽ More

    Submitted 30 April, 2024; originally announced July 2024.

  5. arXiv:2406.08226  [pdf, other

    cs.CV cs.AI cs.LG

    DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

    Authors: Jordy Van Landeghem, Subhajit Maity, Ayan Banerjee, Matthew Blaschko, Marie-Francine Moens, Josep Lladós, Sanket Biswas

    Abstract: This work explores knowledge distillation (KD) for visually-rich document (VRD) applications such as document layout analysis (DLA) and document image classification (DIC). While VRD research is dependent on increasingly sophisticated and cumbersome models, the field has neglected to study efficiency via model compression. Here, we design a KD experimentation methodology for more lean, performant… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to ICDAR 2024 (Athens, Greece)

  6. arXiv:2406.07712  [pdf, other

    cs.LG

    Loss Gradient Gaussian Width based Generalization and Optimization Guarantees

    Authors: Arindam Banerjee, Qiaobo Li, Yingxue Zhou

    Abstract: Generalization and optimization guarantees on the population loss in machine learning often rely on uniform convergence based analysis, typically based on the Rademacher complexity of the predictors. The rich representation power of modern models has led to concerns about this approach. In this paper, we present generalization and optimization guarantees in terms of the complexity of the gradients… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  7. arXiv:2406.02977  [pdf, other

    cs.CV cs.RO

    Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices

    Authors: Xingjian Yang, Zhitao Yu, Ashis G. Banerjee

    Abstract: As robotics and augmented reality applications increasingly rely on precise and efficient 6D object pose estimation, real-time performance on edge devices is required for more interactive and responsive systems. Our proposed Sparse Color-Code Net (SCCN) embodies a clear and concise pipeline design to effectively address this requirement. SCCN performs pixel-level predictions on the target object i… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in the Proceedings of the 2024 IEEE 20th International Conference on Automation Science and Engineering

  8. arXiv:2405.19679  [pdf, other

    cs.LG math.NA math.OC

    Efficient Trajectory Inference in Wasserstein Space Using Consecutive Averaging

    Authors: Amartya Banerjee, Harlin Lee, Nir Sharon, Caroline Moosmüller

    Abstract: Capturing data from dynamic processes through cross-sectional measurements is seen in many fields such as computational biology. Trajectory inference deals with the challenge of reconstructing continuous processes from such observations. In this work, we propose methods for B-spline approximation and interpolation of point clouds through consecutive averaging that is instrinsic to the Wasserstein… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.18511  [pdf, other

    cs.CV

    Feasibility and benefits of joint learning from MRI databases with different brain diseases and modalities for segmentation

    Authors: Wentian Xu, Matthew Moffat, Thalia Seale, Ziyun Liang, Felix Wagner, Daniel Whitehouse, David Menon, Virginia Newcombe, Natalie Voets, Abhirup Banerjee, Konstantinos Kamnitsas

    Abstract: Models for segmentation of brain lesions in multi-modal MRI are commonly trained for a specific pathology using a single database with a predefined set of MRI modalities, determined by a protocol for the specific disease. This work explores the following open questions: Is it feasible to train a model using multiple databases that contain varying sets of MRI modalities and annotations for differen… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to MIDL 2024

    Journal ref: Proceedings of Machine Learning Research, MIDL 2024

  10. arXiv:2405.13863  [pdf, other

    cs.AI cs.LG

    Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

    Authors: Arko Banerjee, Kia Rahmani, Joydeep Biswas, Isil Dillig

    Abstract: Among approaches for provably safe reinforcement learning, Model Predictive Shielding (MPS) has proven effective at complex tasks in continuous, high-dimensional state spaces, by leveraging a backup policy to ensure safety when the learned policy attempts to take risky actions. However, while MPS can ensure safety both during and after training, it often hinders task progress due to the conservati… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2405.11458  [pdf, other

    cs.AI eess.SY

    CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System

    Authors: Ayan Banerjee, Aranyak Maity, Payal Kamboj, Sandeep K. S. Gupta

    Abstract: We explore the usage of large language models (LLM) in human-in-the-loop human-in-the-plant cyber-physical systems (CPS) to translate a high-level prompt into a personalized plan of actions, and subsequently convert that plan into a grounded inference of sequential decision-making automated by a real-world CPS controller to achieve a control goal. We show that it is relatively straightforward to c… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in AAAI 2024, Planning for Cyber Physical Systems

  12. arXiv:2405.11243  [pdf, other

    cs.HC

    A User Interface Study on Sustainable City Trip Recommendations

    Authors: Ashmi Banerjee, Tunar Mahmudov, Wolfgang Wörndl

    Abstract: The importance of promoting sustainable and environmentally responsible practices is becoming increasingly recognized in all domains, including tourism. The impact of tourism extends beyond its immediate stakeholders and affects passive participants such as the environment, local businesses, and residents. City trips, in particular, offer significant opportunities to encourage sustainable tourism… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  13. arXiv:2405.06467  [pdf, other

    cs.CV

    Attend, Distill, Detect: Attention-aware Entropy Distillation for Anomaly Detection

    Authors: Sushovan Jena, Vishwas Saini, Ujjwal Shaw, Pavitra Jain, Abhay Singh Raihal, Anoushka Banerjee, Sharad Joshi, Ananth Ganesh, Arnav Bhavsar

    Abstract: Unsupervised anomaly detection encompasses diverse applications in industrial settings where a high-throughput and precision is imperative. Early works were centered around one-class-one-model paradigm, which poses significant challenges in large-scale production environments. Knowledge-distillation based multi-class anomaly detection promises a low latency with a reasonably good performance but w… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages

    MSC Class: 68T07 ACM Class: I.2.10

  14. arXiv:2405.04545  [pdf, other

    cs.LG cs.IR

    Learning label-label correlations in Extreme Multi-label Classification via Label Features

    Authors: Siddhant Kharbanda, Devaansh Gupta, Erik Schultheis, Atmadeep Banerjee, Cho-Jui Hsieh, Rohit Babbar

    Abstract: Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent works in this domain have increasingly focused on a symmetric problem setting where both input instances and label features are short-text in nature. Short-text XMC with label features has found numerous applications in a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  15. arXiv:2404.17045  [pdf, other

    eess.SY cs.RO

    Toward Automated Formation of Composite Micro-Structures Using Holographic Optical Tweezers

    Authors: Tommy Zhang, Nicole Werner, Ashis G. Banerjee

    Abstract: Holographic Optical Tweezers (HOT) are powerful tools that can manipulate micro and nano-scale objects with high accuracy and precision. They are most commonly used for biological applications, such as cellular studies, and more recently, micro-structure assemblies. Automation has been of significant interest in the HOT field, since human-run experiments are time-consuming and require skilled oper… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in the Proceedings of the 2024 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)

  16. arXiv:2404.00412  [pdf, other

    cs.CV cs.LG

    SVGCraft: Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout

    Authors: Ayan Banerjee, Nityanand Mathur, Josep Lladós, Umapada Pal, Anjan Dutta

    Abstract: Generating VectorArt from text prompts is a challenging vision task, requiring diverse yet realistic depictions of the seen as well as unseen entities. However, existing research has been mostly limited to the generation of single objects, rather than comprehensive scenes comprising multiple elements. In response, this work introduces SVGCraft, a novel end-to-end framework for the creation of vect… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  17. arXiv:2403.18604  [pdf, other

    cs.IR

    Modeling Sustainable City Trips: Integrating CO2e Emissions, Popularity, and Seasonality into Tourism Recommender Systems

    Authors: Ashmi Banerjee, Tunar Mahmudov, Emil Adler, Fitri Nur Aisyah, Wolfgang Wörndl

    Abstract: Tourism affects not only the tourism industry but also society and stakeholders such as the environment, local businesses, and residents. Tourism Recommender Systems (TRS) can be pivotal in promoting sustainable tourism by guiding travelers toward destinations with minimal negative impact. Our paper introduces a composite sustainability indicator for a city trip TRS based on the users' starting po… ▽ More

    Submitted 3 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  18. arXiv:2403.10581  [pdf, other

    q-bio.QM cs.AI cs.CL cs.LG eess.SP

    Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction

    Authors: Chen Chen, Lei Li, Marcel Beetz, Abhirup Banerjee, Ramneek Gupta, Vicente Grau

    Abstract: Heart failure (HF) poses a significant public health challenge, with a rising global mortality rate. Early detection and prevention of HF could significantly reduce its impact. We introduce a novel methodology for predicting HF risk using 12-lead electrocardiograms (ECGs). We present a novel, lightweight dual-attention ECG network designed to capture complex ECG features essential for early HF ris… ▽ More

    Submitted 22 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Under journal revision

  19. arXiv:2403.05591  [pdf, other

    cs.HC cs.LG

    Data-Driven Ergonomic Risk Assessment of Complex Hand-intensive Manufacturing Processes

    Authors: Anand Krishnan, Xingjian Yang, Utsav Seth, Jonathan M. Jeyachandran, Jonathan Y. Ahn, Richard Gardner, Samuel F. Pedigo, Adriana, Blom-Schieber, Ashis G. Banerjee, Krithika Manohar

    Abstract: Hand-intensive manufacturing processes, such as composite layup and textile draping, require significant human dexterity to accommodate task complexity. These strenuous hand motions often lead to musculoskeletal disorders and rehabilitation surgeries. We develop a data-driven ergonomic risk assessment system with a special focus on hand and finger activity to better identify and address ergonomic… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 26 pages, 7 figures

  20. arXiv:2403.02909  [pdf, other

    cs.CV cs.HC eess.IV

    Gaze-Vector Estimation in the Dark with Temporally Encoded Event-driven Neural Networks

    Authors: Abeer Banerjee, Naval K. Mehta, Shyam S. Prasad, Himanshu, Sumeet Saurav, Sanjay Singh

    Abstract: In this paper, we address the intricate challenge of gaze vector prediction, a pivotal task with applications ranging from human-computer interaction to driver monitoring systems. Our innovative approach is designed for the demanding setting of extremely low-light conditions, leveraging a novel temporal event encoding scheme, and a dedicated neural network architecture. The temporal encoding metho… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  21. arXiv:2402.11728  [pdf, other

    cs.CL cs.LG q-fin.CP

    Numerical Claim Detection in Finance: A New Financial Dataset, Weak-Supervision Model, and Market Analysis

    Authors: Agam Shah, Arnav Hiray, Pratvi Shah, Arkaprabha Banerjee, Anushka Singh, Dheeraj Eidnani, Bhaskar Chaudhury, Sudheer Chava

    Abstract: In this paper, we investigate the influence of claims in analyst reports and earnings calls on financial market returns, considering them as significant quarterly events for publicly traded companies. To facilitate a comprehensive analysis, we construct a new financial dataset for the claim detection task in the financial domain. We benchmark various language models on this dataset and propose a n… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  22. arXiv:2402.11401  [pdf, other

    cs.CV cs.LG

    GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

    Authors: Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

    Abstract: Object detection in documents is a key step to automate the structural elements identification process in a digital or scanned document through understanding the hierarchical structure and relationships between different elements. Large and complex models, while achieving high accuracy, can be computationally expensive and memory-intensive, making them impractical for deployment on resource constr… ▽ More

    Submitted 20 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  23. arXiv:2402.05853  [pdf, other

    cs.RO

    On Experimental Emulation of Printability and Fleet Aware Generic Mesh Decomposition for Enabling Aerial 3D Printing

    Authors: Marios-Nektarios Stamatopoulos, Avijit Banerjee, George Nikolakopoulos

    Abstract: This article introduces an experimental emulation of a novel chunk-based flexible multi-DoF aerial 3D printing framework. The experimental demonstration of the overall autonomy focuses on precise motion planning and task allocation for a UAV, traversing through a series of planned space-filling paths involved in the aerial 3D printing process without physically depositing the overlaying material.… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted for publication at IEEE International Conference on Robotics and Automation (ICRA) 2024

  24. arXiv:2402.01758  [pdf, other

    cs.CY cs.AI cs.CL

    Aalap: AI Assistant for Legal & Paralegal Functions in India

    Authors: Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta

    Abstract: Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an eq… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  25. arXiv:2401.15939  [pdf, other

    cs.IT

    Correcting a Single Deletion in Reads from a Nanopore Sequencer

    Authors: Anisha Banerjee, Yonatan Yehezkeally, Antonia Wachter-Zeh, Eitan Yaakobi

    Abstract: Owing to its several merits over other DNA sequencing technologies, nanopore sequencers hold an immense potential to revolutionize the efficiency of DNA storage systems. However, their higher error rates necessitate further research to devise practical and efficient coding schemes that would allow accurate retrieval of the data stored. Our work takes a step in this direction by adopting a simplifi… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted at IEEE ISIT'24

  26. arXiv:2401.13961  [pdf, other

    cs.CV

    TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images

    Authors: Jia Wan, Wanhua Li, Jason Ken Adhinarta, Atmadeep Banerjee, Evelina Sjostedt, Jingpeng Wu, Jeff Lichtman, Hanspeter Pfister, Donglai Wei

    Abstract: While imaging techniques at macro and mesoscales have garnered substantial attention and resources, microscale Volume Electron Microscopy (vEM) imaging, capable of revealing intricate vascular details, has lacked the necessary benchmarking infrastructure. In this paper, we address a significant gap in this field of neuroimaging by introducing the first-in-class public benchmark, BvEM, designed spe… ▽ More

    Submitted 17 June, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: BvEM-Mouse can be visualized at: https://tinyurl.com/yc2s38x9

  27. arXiv:2401.03154  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents

    Authors: Arundhati Banerjee, Jeff Schneider

    Abstract: Multi-agent multi-target tracking has a wide range of applications, including wildlife patrolling, security surveillance or environment monitoring. Such algorithms often make restrictive assumptions: the number of targets and/or their initial locations may be assumed known, or agents may be pre-assigned to monitor disjoint partitions of the environment, reducing the burden of exploration. This als… ▽ More

    Submitted 9 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Under review

    ACM Class: I.2.9; I.2.11

  28. arXiv:2312.14844  [pdf, other

    eess.AS cs.SD physics.med-ph

    An Implantable Piezofilm Middle Ear Microphone: Performance in Human Cadaveric Temporal Bones

    Authors: John Z. Zhang, Lukas Graf, Annesya Banerjee, Aaron Yeiser, Christopher I. McHugh, Ioannis Kymissis, Jeffrey H. Lang, Elizabeth S. Olson, Hideko Heidi Nakajima

    Abstract: Purpose: One of the major reasons that totally implantable cochlear microphones are not readily available is the lack of good implantable microphones. An implantable microphone has the potential to provide a range of benefits over external microphones for cochlear implant users including the filtering ability of the outer ear, cosmetics, and usability in all situations. This paper presents results… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  29. arXiv:2312.13976  [pdf

    physics.med-ph cs.AI cs.CG eess.IV q-bio.QM

    Anatomical basis of human sex differences in ECG identified by automated torso-cardiac three-dimensional reconstruction

    Authors: Hannah J. Smith, Blanca Rodriguez, Yuling Sang, Marcel Beetz, Robin Choudhury, Vicente Grau, Abhirup Banerjee

    Abstract: Background and Aims: The electrocardiogram (ECG) is routinely used for diagnosis and risk stratification following myocardial infarction (MI), though its interpretation is confounded by anatomical variability and sex differences. Women have a higher incidence of missed MI diagnosis and poorer outcomes following infarction. Sex differences in ECG biomarkers and torso-ventricular anatomy have not be… ▽ More

    Submitted 17 July, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Paper under revision

  30. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Weiping Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, Pingyu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  31. The Expert Knowledge combined with AI outperforms AI Alone in Seizure Onset Zone Localization using resting state fMRI

    Authors: Payal Kamboj, Ayan Banerjee, Varina L. Boerwinkle, Sandeep K. S. Gupta

    Abstract: We evaluated whether integration of expert guidance on seizure onset zone (SOZ) identification from resting state functional MRI (rs-fMRI) connectomics combined with deep learning (DL) techniques enhances the SOZ delineation in patients with refractory epilepsy (RE), compared to utilizing DL alone. Rs-fMRI were collected from 52 children with RE who had subsequently undergone ic-EEG and then, if i… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted in Frontiers in Neurology journal, section Artificial Intelligence

  32. arXiv:2312.07145  [pdf, other

    cs.LG stat.ML

    Contextual Bandits with Online Neural Regression

    Authors: Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee

    Abstract: Recent works have shown a reduction from contextual bandits to online regression under a realizability assumption [Foster and Rakhlin, 2020, Foster and Krishnamurthy, 2021]. In this work, we investigate the use of neural networks for such online regression and associated Neural Contextual Bandits (NeuCBs). Using existing results for wide networks, one can readily show a ${\mathcal{O}}(\sqrt{T})$ r… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  33. arXiv:2311.10246  [pdf, other

    cs.LG cs.AI stat.ML

    Surprisal Driven $k$-NN for Robust and Interpretable Nonparametric Learning

    Authors: Amartya Banerjee, Christopher J. Hazard, Jacob Beel, Cade Mack, Jack Xia, Michael Resnick, Will Goddin

    Abstract: Nonparametric learning is a fundamental concept in machine learning that aims to capture complex patterns and relationships in data without making strong assumptions about the underlying data distribution. Owing to simplicity and familiarity, one of the most well-known algorithms under this paradigm is the $k$-nearest neighbors ($k$-NN) algorithm. Driven by the usage of machine learning in safety-… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  34. arXiv:2311.06514  [pdf, other

    cs.FL

    Set Augmented Finite Automata over Infinite Alphabets

    Authors: Ansuman Banerjee, Kingshuk Chatterjee, Shibashis Guha

    Abstract: A data language is a set of finite words defined on an infinite alphabet. Data languages are used to express properties associated with data values (domain defined over a countably infinite set). In this paper, we introduce set augmented finite automata (SAFA), a new class of automata for expressing data languages. We investigate the decision problems, closure properties, and expressiveness of SAF… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: This is a full version of a paper with the same name accepted in DLT 2023. Other than the full proofs, this paper contains several new results concerning more closure properties, universality problem, comparison of expressiveness with register automata and class counter automata, and more results on deterministic SAFA

    ACM Class: F.4.3

  35. arXiv:2310.00917  [pdf, other

    cs.CV

    Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance

    Authors: Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya

    Abstract: The adaptation capability to a wide range of domains is crucial for scene text spotting models when deployed to real-world conditions. However, existing state-of-the-art (SOTA) approaches usually incorporate scene text detection and recognition simply by pretraining on natural scene text datasets, which do not directly exploit the intermediate feature representations between multiple domains. Here… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  36. arXiv:2310.00588  [pdf, other

    cs.RO

    Active Anomaly Detection in Confined Spaces Using Ergodic Traversal of Directed Region Graphs

    Authors: Benjamin Wong, Tyler M. Paine, Santosh Devasia, Ashis G. Banerjee

    Abstract: We provide the first step toward developing a hierarchical control-estimation framework to actively plan robot trajectories for anomaly detection in confined spaces. The space is represented globally using a directed region graph, where a region is a landmark that needs to be visited (inspected). We devise a fast mixing Markov chain to find an ergodic route that traverses this graph so that the re… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  37. arXiv:2309.08239  [pdf, other

    cs.CV cs.RO

    Human-Inspired Topological Representations for Visual Object Recognition in Unseen Environments

    Authors: Ekta U. Samani, Ashis G. Banerjee

    Abstract: Visual object recognition in unseen and cluttered indoor environments is a challenging problem for mobile robots. Toward this goal, we extend our previous work to propose the TOPS2 descriptor, and an accompanying recognition framework, THOR2, inspired by a human reasoning mechanism known as object unity. We interleave color embeddings obtained using the Mapper algorithm for topological soft cluste… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted for presentation at the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Workshop on Robotic Perception and Mapping: Frontier Vision & Learning Techniques

  38. arXiv:2309.06558  [pdf, other

    eess.SY cs.AI math.DS math.NA

    High Fidelity Fast Simulation of Human in the Loop Human in the Plant (HIL-HIP) Systems

    Authors: Ayan Banerjee, Payal Kamboj, Aranyak Maity, Riya Sudhakar Salian, Sandeep K. S. Gupta

    Abstract: Non-linearities in simulation arise from the time variance in wireless mobile networks when integrated with human in the loop, human in the plant (HIL-HIP) physical systems under dynamic contexts, leading to simulation slowdown. Time variance is handled by deriving a series of piece wise linear time invariant simulations (PLIS) in intervals, which are then concatenated in time domain. In this pape… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: To appear in ACM MSWIM 2023

  39. arXiv:2309.04856  [pdf, other

    cs.LG cs.AI eess.IV

    AmbientFlow: Invertible generative models from incomplete, noisy measurements

    Authors: Varun A. Kelkar, Rucha Deshpande, Arindam Banerjee, Mark A. Anastasio

    Abstract: Generative models have gained popularity for their potential applications in imaging science, such as image reconstruction, posterior sampling and data sharing. Flow-based generative models are particularly attractive due to their ability to tractably provide exact density estimates along with fast, inexpensive and diverse samples. Training such models, however, requires a large, high quality data… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR). OpenReview: https://openreview.net/forum?id=txpYITR8oa

  40. arXiv:2309.02603  [pdf, other

    cs.AI eess.SY

    Detection of Unknown-Unknowns in Human-in-Plant Human-in-Loop Systems Using Physics Guided Process Models

    Authors: Aranyak Maity, Ayan Banerjee, Sandeep Gupta

    Abstract: Unknown-unknowns are operational scenarios in systems that are not accounted for in the design and test phase. In such scenarios, the operational behavior of the Human-in-loop (HIL) Human-in-Plant (HIP) systems is not guaranteed to meet requirements such as safety and efficacy. We propose a novel framework for analyzing the operational output characteristics of safety-critical HIL-HIP systems that… ▽ More

    Submitted 12 December, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

  41. Towards Individual and Multistakeholder Fairness in Tourism Recommender Systems

    Authors: Ashmi Banerjee, Paromita Banik, Wolfgang Wörndl

    Abstract: This position paper summarizes our published review on individual and multistakeholder fairness in Tourism Recommender Systems (TRS). Recently, there has been growing attention to fairness considerations in recommender systems (RS). It has been acknowledged in research that fairness in RS is often closely tied to the presence of multiple stakeholders, such as end users, item providers, and platfor… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Position Paper for FAcctRec 2023 at RecSys 2023

    Journal ref: Frontiers in Big Data 2023, Volume 6, 1168692

  42. arXiv:2309.01336  [pdf, other

    cs.LG cs.AI

    Learning for Interval Prediction of Electricity Demand: A Cluster-based Bootstrapping Approach

    Authors: Rohit Dube, Natarajan Gautam, Amarnath Banerjee, Harsha Nagarajan

    Abstract: Accurate predictions of electricity demands are necessary for managing operations in a small aggregation load setting like a Microgrid. Due to low aggregation, the electricity demands can be highly stochastic and point estimates would lead to inflated errors. Interval estimation in this scenario, would provide a range of values within which the future values might lie and helps quantify the errors… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  43. arXiv:2308.11806  [pdf, other

    cs.RO

    Flexible Multi-DoF Aerial 3D Printing Supported with Automated Optimal Chunking

    Authors: Marios-Nektarios Stamatopoulos, Avijit Banerjee, George Nikolakopoulos

    Abstract: The future of 3D printing utilizing unmanned aerial vehicles (UAVs) presents a promising capability to revolutionize manufacturing and to enable the creation of large-scale structures in remote and hard- to-reach areas e.g. in other planetary systems. Nevertheless, the limited payload capacity of UAVs and the complexity in the 3D printing of large objects pose significant challenges. In this artic… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted for publication at 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

  44. arXiv:2308.06690  [pdf, ps, other

    cs.IT

    Two-Dimensional Z-Complementary Array Quads with Low Column Sequence PMEPRs

    Authors: Shibsankar Das, Adrish Banerjee, Udaya Parampalli

    Abstract: In this paper, we first propose a new design strategy of 2D $Z$-complementary array quads (2D-ZCAQs) with feasible array sizes. A 2D-ZCAQ consists of four distinct unimodular arrays satisfying zero 2D auto-correlation sums for non-trivial 2D time-shifts within certain zone. Then, we obtain the upper bounds on the column sequence peak-to-mean envelope power ratio (PMEPR) of the constructed 2D-ZCAQs… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: This work has been presented in 2023 IEEE International Symposium on Information Theory (ISIT), Taipei, Taiwan

  45. Root Cross Z-Complementary Pairs with Large ZCZ Width

    Authors: Shibsankar Das, Adrish Banerjee, Zilong Liu

    Abstract: In this paper, we present a new family of cross $Z$-complementary pairs (CZCPs) based on generalized Boolean functions and two roots of unity. Our key idea is to consider an arbitrary partition of the set $\{1,2,\cdots, n\}$ with two subsets corresponding to two given roots of unity for which two truncated sequences of new alphabet size determined by the two roots of unity are obtained. We show th… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: This work has been presented in 2022 IEEE International Symposium on Information Theory (ISIT), Espoo, Finland

    Journal ref: 2022 IEEE International Symposium on Information Theory (ISIT), Espoo, Finland, 2022, pp. 522-527

  46. arXiv:2308.06382  [pdf, other

    cs.SD cs.LG eess.AS

    Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion

    Authors: Siyuan Shan, Yang Li, Amartya Banerjee, Junier B. Oliva

    Abstract: Voice conversion (VC) aims at altering a person's voice to make it sound similar to the voice of another person while preserving linguistic content. Existing methods suffer from a dilemma between content intelligibility and speaker similarity; i.e., methods with higher intelligibility usually have a lower speaker similarity, while methods with higher speaker similarity usually require plenty of ta… ▽ More

    Submitted 30 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: AAAI 2024 Demo, Codes: https://phonemehallucinator.github.io/

  47. arXiv:2307.15830  [pdf, other

    cs.LG

    A Distance Correlation-Based Approach to Characterize the Effectiveness of Recurrent Neural Networks for Time Series Forecasting

    Authors: Christopher Salazar, Ashis G. Banerjee

    Abstract: Time series forecasting has received a lot of attention, with recurrent neural networks (RNNs) being one of the widely used models due to their ability to handle sequential data. Previous studies on RNN time series forecasting, however, show inconsistent outcomes and offer few explanations for performance variations among the datasets. In this paper, we provide an approach to link time series char… ▽ More

    Submitted 25 April, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  48. arXiv:2307.11017  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-objective point cloud autoencoders for explainable myocardial infarction prediction

    Authors: Marcel Beetz, Abhirup Banerjee, Vicente Grau

    Abstract: Myocardial infarction (MI) is one of the most common causes of death in the world. Image-based biomarkers commonly used in the clinic, such as ejection fraction, fail to capture more complex patterns in the heart's 3D anatomy and thus limit diagnostic accuracy. In this work, we present the multi-objective point cloud autoencoder as a novel geometric deep learning approach for explainable infarctio… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  49. arXiv:2307.10927  [pdf, other

    eess.IV cs.CV cs.LG

    Modeling 3D cardiac contraction and relaxation with point cloud deformation networks

    Authors: Marcel Beetz, Abhirup Banerjee, Vicente Grau

    Abstract: Global single-valued biomarkers of cardiac function typically used in clinical practice, such as ejection fraction, provide limited insight on the true 3D cardiac deformation process and hence, limit the understanding of both healthy and pathological cardiac mechanics. In this work, we propose the Point Cloud Deformation Network (PCD-Net) as a novel geometric deep learning approach to model 3D car… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  50. arXiv:2307.10045  [pdf, ps, other

    cs.LO

    Alignment complete relational Hoare logics for some and all

    Authors: Ramana Nagasamudram, Anindya Banerjee, David A. Naumann

    Abstract: In relational verification, judicious alignment of computational steps facilitates proof of relations between programs using simple relational assertions. Relational Hoare logics (RHL) provide compositional rules that embody various alignments of executions. Seemingly more flexible alignments can be expressed in terms of product automata based on program transition relations. A single degenerate a… ▽ More

    Submitted 9 July, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Vsn2 fixes a def, adds semantic completeness for filtered automata and Cook completeness for all-exists logic; V3 adds section on entailment completeness and additional proof rules; V4 makes minor changes in exposition; V5 makes minor changes in exposition, expands discussion of Cook completeness and control determinacy, and expands a key example; V6 abridges for journal submission. arXiv admin note: text overlap with arXiv:2212.10338

    ACM Class: F.3.1