Skip to main content

Showing 1–50 of 93 results for author: Yu, S

  1. arXiv:2407.10446  [pdf, other

    cs.SD cs.AI eess.AS

    DDFAD: Dataset Distillation Framework for Audio Data

    Authors: Wenbo Jiang, Rui Zhang, Hongwei Li, Xiaoyuan Liu, Haomiao Yang, Shui Yu

    Abstract: Deep neural networks (DNNs) have achieved significant success in numerous applications. The remarkable performance of DNNs is largely attributed to the availability of massive, high-quality training datasets. However, processing such massive training data requires huge computational and storage resources. Dataset distillation is a promising solution to this problem, offering the capability to comp… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.05726  [pdf, other

    cs.CV eess.IV

    Gait Patterns as Biomarkers: A Video-Based Approach for Classifying Scoliosis

    Authors: Zirui Zhou, Junhao Liang, Zizhao Peng, Chao Fan, Fengwei An, Shiqi Yu

    Abstract: Scoliosis poses significant diagnostic challenges, particularly in adolescents, where early detection is crucial for effective treatment. Traditional diagnostic and follow-up methods, which rely on physical examinations and radiography, face limitations due to the need for clinical expertise and the risk of radiation exposure, thus restricting their use for widespread early screening. In response,… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI 2024

  3. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  4. arXiv:2403.03526  [pdf, other

    eess.SP cs.LG q-bio.NC

    FingerNet: EEG Decoding of A Fine Motor Imagery with Finger-tapping Task Based on A Deep Neural Network

    Authors: Young-Min Go, Seong-Hyun Yu, Hyeong-Yeong Park, Minji Lee, Ji-Hoon Jeong

    Abstract: Brain-computer interface (BCI) technology facilitates communication between the human brain and computers, primarily utilizing electroencephalography (EEG) signals to discern human intentions. Although EEG-based BCI systems have been developed for paralysis individuals, ongoing studies explore systems for speech imagery and motor imagery (MI). This study introduces FingerNet, a specialized network… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 12 pages,5 figures, and 2 tables

  5. arXiv:2403.01428  [pdf, other

    cs.RO eess.SP

    Localization matters too: How localization error affects UAV flight

    Authors: Suquan Zhang, Yuanfan Xu, Shu'ang Yu, Qingmin Liao, Jincheng Yu, Yu Wang

    Abstract: The maximum safe flight speed of a Unmanned Aerial Vehicle (UAV) is an important indicator for measuring its efficiency in completing various tasks. This indicator is influenced by numerous parameters such as UAV localization error, perception range, and system latency. However, in terms of localization errors, although there have been many studies dedicated to improving the localization capabilit… ▽ More

    Submitted 7 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 8 pages,8 figures

  6. arXiv:2403.00221  [pdf, ps, other

    eess.SY

    Mode Consensus Algorithms With Finite Convergence Time

    Authors: Chao Huang, Hyungbo Shim, Siliang Yu, Brian D. O. Anderson

    Abstract: This paper studies the distributed mode consensus problem in a multi-agent system, in which the agents each possess a certain attribute and they aim to agree upon the mode (the most frequent attribute owned by the agents) via distributed computation. Three algorithms are proposed. The first one directly calculates the frequency of all attributes at every agent, with protocols based on blended dyna… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  7. arXiv:2402.19013  [pdf, other

    eess.SY

    Ultraviolet Positioning via TDOA: Error Analysis and System Prototype

    Authors: Shihui Yu, Chubing Lv, Yueke Yang, Yuchen Pan, Lei Sun, Juliang Cao, Ruihang Yu, Chen Gong, Wenqi Wu, Zhengyuan Xu

    Abstract: This work performs the design, real-time hardware realization, and experimental evaluation of a positioning system by ultra-violet (UV) communication under photon-level signal detection. The positioning is based on time-difference of arrival (TDOA) principle. Time division-based transmission of synchronization sequence from three transmitters with known positions is applied. We investigate the pos… ▽ More

    Submitted 14 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  8. arXiv:2402.08788  [pdf

    cs.CL cs.SD eess.AS

    Syllable based DNN-HMM Cantonese Speech to Text System

    Authors: Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

    Abstract: This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventi… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures, LREC 2016

    MSC Class: 94-06 ACM Class: I.2.7

  9. arXiv:2401.04394  [pdf, other

    cs.MM cs.SD eess.AS

    SonicVisionLM: Playing Sound with Vision Language Models

    Authors: Zhifeng Xie, Shengye Yu, Qile He, Mengtian Li

    Abstract: There has been a growing interest in the task of generating sound for silent videos, primarily because of its practicality in streamlining video post-production. However, existing methods for video-sound generation attempt to directly create sound from visual representations, which can be challenging due to the difficulty of aligning visual representations with audio representations. In this paper… ▽ More

    Submitted 3 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: CVPR 2024

  10. arXiv:2311.08271  [pdf, other

    cs.LG cs.IT cs.NI eess.SP

    Mobility-Induced Graph Learning for WiFi Positioning

    Authors: Kyuwon Han, Seung Min Yu, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: A smartphone-based user mobility tracking could be effective in finding his/her location, while the unpredictable error therein due to low specification of built-in inertial measurement units (IMUs) rejects its standalone usage but demands the integration to another positioning technique like WiFi positioning. This paper aims to propose a novel integration technique using a graph neural network ca… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: submitted to a possible IEEE journal

  11. arXiv:2311.06834  [pdf, other

    eess.IV cs.CV

    Osteoporosis Prediction from Hand and Wrist X-rays using Image Segmentation and Self-Supervised Learning

    Authors: Hyungeun Lee, Ung Hwang, Seungwon Yu, Chang-Hun Lee, Kijung Yoon

    Abstract: Osteoporosis is a widespread and chronic metabolic bone disease that often remains undiagnosed and untreated due to limited access to bone mineral density (BMD) tests like Dual-energy X-ray absorptiometry (DXA). In response to this challenge, current advancements are pivoting towards detecting osteoporosis by examining alternative indicators from peripheral bone areas, with the goal of increasing… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 10 pages

  12. arXiv:2309.01950  [pdf, other

    cs.CV cs.AI cs.LG cs.SD eess.AS

    RADIO: Reference-Agnostic Dubbing Video Synthesis

    Authors: Dongyeun Lee, Chaewon Kim, Sangjoon Yu, Jaejun Yoo, Gyeong-Moon Park

    Abstract: One of the most challenging problems in audio-driven talking head generation is achieving high-fidelity detail while ensuring precise synchronization. Given only a single reference image, extracting meaningful identity attributes becomes even more challenging, often causing the network to mirror the facial and lip structures too closely. To address these issues, we introduce RADIO, a framework eng… ▽ More

    Submitted 6 November, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by WACV 2024

  13. arXiv:2308.01201  [pdf, ps, other

    eess.SY

    A Real-Time Robust Ecological-Adaptive Cruise Control Strategy for Battery Electric Vehicles

    Authors: Sheng Yu, Xiao Pan, Anastasis Georgiou, Boli Chen, Imad M. Jaimoukha, Simos A. Evangelou

    Abstract: This work addresses the ecological-adaptive cruise control problem for connected electric vehicles by a computationally efficient robust control strategy. The problem is formulated in the space-domain with a realistic description of the nonlinear electric powertrain model and motion dynamics to yield a convex optimal control problem (OCP). The OCP is approached by a novel robust model predictive c… ▽ More

    Submitted 15 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 15 pages, 12 figures and 2 tables. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  14. arXiv:2307.03812  [pdf

    eess.IV eess.SY physics.optics

    Coordinate-based neural representations for computational adaptive optics in widefield microscopy

    Authors: Iksung Kang, Qinrong Zhang, Stella X. Yu, Na Ji

    Abstract: Widefield microscopy is widely used for non-invasive imaging of biological structures at subcellular resolution. When applied to complex specimen, its image quality is degraded by sample-induced optical aberration. Adaptive optics can correct wavefront distortion and restore diffraction-limited resolution but require wavefront sensing and corrective devices, increasing system complexity and cost.… ▽ More

    Submitted 24 June, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 60 pages, 20 figures, 2 tables. Nat Mach Intell (2024)

  15. arXiv:2306.05358  [pdf, other

    cs.CR cs.AI cs.LG cs.SD eess.AS

    Trustworthy Sensor Fusion against Inaudible Command Attacks in Advanced Driver-Assistance System

    Authors: Jiwei Guan, Lei Pan, Chen Wang, Shui Yu, Longxiang Gao, Xi Zheng

    Abstract: There are increasing concerns about malicious attacks on autonomous vehicles. In particular, inaudible voice command attacks pose a significant threat as voice commands become available in autonomous driving systems. How to empirically defend against these inaudible attacks remains an open question. Previous research investigates utilizing deep learning-based multimodal fusion for defense, without… ▽ More

    Submitted 29 May, 2023; originally announced June 2023.

  16. arXiv:2305.19467  [pdf

    eess.IV cs.CV

    Synthetic CT Generation from MRI using 3D Transformer-based Denoising Diffusion Model

    Authors: Shaoyan Pan, Elham Abouei, Jacob Wynne, Tonghe Wang, Richard L. J. Qiu, Yuheng Li, Chih-Wei Chang, Junbo Peng, Justin Roper, Pretesh Patel, David S. Yu, Hui Mao, Xiaofeng Yang

    Abstract: Magnetic resonance imaging (MRI)-based synthetic computed tomography (sCT) simplifies radiation therapy treatment planning by eliminating the need for CT simulation and error-prone image registration, ultimately reducing patient radiation dose and setup uncertainty. We propose an MRI-to-CT transformer-based denoising diffusion probabilistic model (MC-DDPM) to transform MRI into high-quality sCT to… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  17. arXiv:2305.04208  [pdf, other

    eess.IV cs.CV

    Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network

    Authors: Xiaoyu Yang, Lijian Xu, Simon Yu, Qing Xia, Hongsheng Li, Shaoting Zhang

    Abstract: Segmentation of the coronary artery is an important task for the quantitative analysis of coronary computed tomography angiography (CCTA) images and is being stimulated by the field of deep learning. However, the complex structures with tiny and narrow branches of the coronary artery bring it a great challenge. Coupled with the medical image limitations of low resolution and poor contrast, fragmen… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  18. arXiv:2305.00750  [pdf

    physics.soc-ph eess.SY

    Experimental features of emissions and fuel consumption in a car-following platoon

    Authors: Shirui Zhou, Ying-En Ge, Shaowei Yu, Junfang Tian, Rui Jiang

    Abstract: The paper investigates the features of emissions and fuel consumption (EFC) in a car-following (CF) platoon based on two experimental datasets. Four classical EFC models are employed and a universal concave growth pattern of the EFC along a platoon has been demonstrated. A general framework of coupling EFC and CF models is tested by calibrating and simulating three classical CF models. This work f… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  19. arXiv:2303.17751  [pdf, other

    cs.LO eess.SY

    Pacti: Scaling Assume-Guarantee Reasoning for System Analysis and Design

    Authors: Inigo Incer, Apurva Badithela, Josefine Graebener, Piergiuseppe Mallozzi, Ayush Pandey, Sheng-Jung Yu, Albert Benveniste, Benoit Caillaud, Richard M. Murray, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: Contract-based design is a method to facilitate modular system design. While there has been substantial progress on the theory of contracts, there has been less progress on scalable algorithms for the algebraic operations in this theory. In this paper, we present: 1) principles to implement a contract-based design tool at scale and 2) Pacti, a tool that can efficiently compute these operations. We… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  20. arXiv:2302.11997  [pdf, other

    eess.SP

    Beamforming Design with Partial Channel Estimation and Feedback for FDD RIS-Assisted Systems

    Authors: Xiaochun Ge, Shanping Yu, Wenqian Shen, Chengwen Xing, Byonghyo Shim

    Abstract: Beamforming design with partial channel estimation and feedback for frequency-division duplexing (FDD) reconfigurable intelligent surface (RIS) assisted systems is considered in this paper. We leverage the observation that path angle information (PAI) varies more slowly than path gain information (PGI). Then, several dominant paths are selected among all the cascaded paths according to the known P… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  21. arXiv:2302.06611  [pdf, other

    eess.IV

    Deep Learning and Medical Imaging for COVID-19 Diagnosis: A Comprehensive Survey

    Authors: Song Wu, Yazhou Ren, Aodi Yang, Xinyue Chen, Xiaorong Pu, Jing He, Liqiang Nie, Philip S. Yu

    Abstract: COVID-19 (Coronavirus disease 2019) has been quickly spreading since its outbreak, impacting financial markets and healthcare systems globally. Countries all around the world have adopted a number of extraordinary steps to restrict the spreading virus, where early COVID-19 diagnosis is essential. Medical images such as X-ray images and Computed Tomography scans are becoming one of the main diagnos… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  22. arXiv:2302.00953  [pdf

    eess.IV cs.CV cs.LG

    Deep-Learning Tool for Early Identifying Non-Traumatic Intracranial Hemorrhage Etiology based on CT Scan

    Authors: Meng Zhao, Yifan Hu, Ruixuan Jiang, Yuanli Zhao, Dong Zhang, Yan Zhang, Rong Wang, Yong Cao, Qian Zhang, Yonggang Ma, Jiaxi Li, Shaochen Yu, Wenjie Li, Ran Zhang, Yefeng Zheng, Shuo Wang, Jizong Zhao

    Abstract: Background: To develop an artificial intelligence system that can accurately identify acute non-traumatic intracranial hemorrhage (ICH) etiology based on non-contrast CT (NCCT) scans and investigate whether clinicians can benefit from it in a diagnostic setting. Materials and Methods: The deep learning model was developed with 1868 eligible NCCT scans with non-traumatic ICH collected between Janua… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  23. arXiv:2301.06574  [pdf, other

    cs.LG eess.SP

    Causal Recurrent Variational Autoencoder for Medical Time Series Generation

    Authors: Hongming Li, Shujian Yu, Jose Principe

    Abstract: We propose causal recurrent variational autoencoder (CR-VAE), a novel generative model that is able to learn a Granger causal graph from a multivariate time series x and incorporates the underlying causal mechanism into its data generation process. Distinct to the classical recurrent VAEs, our CR-VAE uses a multi-head decoder, in which the $p$-th head is responsible for generating the $p$-th dimen… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: Manuscript accepted by AAAI-23. Code is publicly available at \url{https://github.com/hongmingli1995/CR-VAE}

  24. arXiv:2301.06267  [pdf, other

    cs.CV cs.AI cs.LG cs.SD eess.AS

    Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models

    Authors: Zhiqiu Lin, Samuel Yu, Zhiyi Kuang, Deepak Pathak, Deva Ramanan

    Abstract: The ability to quickly learn a new task with minimal instruction - known as few-shot learning - is a central aspect of intelligent agents. Classical few-shot benchmarks make use of few-shot samples from a single modality, but such samples may not be sufficient to characterize an entire concept class. In contrast, humans use cross-modal information to learn new concepts efficiently. In this work, w… ▽ More

    Submitted 2 August, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: CVPR 2023. Project website: https://linzhiqiu.github.io/papers/cross_modal/

  25. arXiv:2212.11486  [pdf, other

    cs.CR eess.SP

    Over-the-Air Federated Learning with Enhanced Privacy

    Authors: Xiaochan Xue, Moh Khalid Hasan, Shucheng Yu, Laxima Niure Kandel, Min Song

    Abstract: Federated learning (FL) has emerged as a promising learning paradigm in which only local model parameters (gradients) are shared. Private user data never leaves the local devices thus preserving data privacy. However, recent research has shown that even when local data is never shared by a user, exchanging model parameters without protection can also leak private information. Moreover, in wireless… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 6 pages

  26. arXiv:2212.10817  [pdf, other

    eess.IV cs.CV

    High-fidelity Direct Contrast Synthesis from Magnetic Resonance Fingerprinting

    Authors: Ke Wang, Mariya Doneva, Jakob Meineke, Thomas Amthor, Ekin Karasan, Fei Tan, Jonathan I. Tamir, Stella X. Yu, Michael Lustig

    Abstract: Magnetic Resonance Fingerprinting (MRF) is an efficient quantitative MRI technique that can extract important tissue and system parameters such as T1, T2, B0, and B1 from a single scan. This property also makes it attractive for retrospectively synthesizing contrast-weighted images. In general, contrast-weighted images like T1-weighted, T2-weighted, etc., can be synthesized directly from parameter… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 19 pages, 8 figures

  27. A Computationally Efficient Robust Model Predictive Control Framework for Ecological Adaptive Cruise Control Strategy of Electric Vehicles

    Authors: Sheng Yu, Xiao Pan, Anastasis Georgiou, Boli Chen, Imad M. Jaimoukha, Simos A. Evangelou

    Abstract: The recent advancement in vehicular networking technology provides novel solutions for designing intelligent and sustainable vehicle motion controllers. This work addresses a car-following task, where the feedback linearisation method is combined with a robust model predictive control (RMPC) scheme to safely, optimally and efficiently control a connected electric vehicle. In particular, the nonlin… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. arXiv:2211.06041  [pdf, other

    eess.AS

    An Adapter based Multi-label Pre-training for Speech Separation and Enhancement

    Authors: Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu

    Abstract: In recent years, self-supervised learning (SSL) has achieved tremendous success in various speech tasks due to its power to extract representations from massive unlabeled data. However, compared with tasks such as speech recognition (ASR), the improvements from SSL representation in speech separation (SS) and enhancement (SE) are considerably smaller. Based on HuBERT, this work investigates improv… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 5 pages

  29. arXiv:2210.16674  [pdf, other

    eess.IV cs.CV

    Semantic-SuPer: A Semantic-aware Surgical Perception Framework for Endoscopic Tissue Identification, Reconstruction, and Tracking

    Authors: Shan Lin, Albert J. Miao, Jingpei Lu, Shunkai Yu, Zih-Yun Chiu, Florian Richter, Michael C. Yip

    Abstract: Accurate and robust tracking and reconstruction of the surgical scene is a critical enabling technology toward autonomous robotic surgery. Existing algorithms for 3D perception in surgery mainly rely on geometric information, while we propose to also leverage semantic information inferred from the endoscopic video using image segmentation algorithms. In this paper, we present a novel, comprehensiv… ▽ More

    Submitted 20 February, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2023

  30. arXiv:2209.01710  [pdf, other

    cs.RO cs.LG eess.SY

    Perception Simplex: Verifiable Collision Avoidance in Autonomous Vehicles Amidst Obstacle Detection Faults

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Advances in deep learning have revolutionized cyber-physical applications, including the development of Autonomous Vehicles. However, real-world collisions involving autonomous control of vehicles have raised significant safety concerns regarding the use of Deep Neural Networks (DNN) in safety-critical tasks, particularly Perception. The inherent unverifiability of DNNs poses a key challenge in en… ▽ More

    Submitted 28 November, 2023; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.14403

    ACM Class: D.2.11; I.2.9; C.4; J.7

    Journal ref: Software Testing, Verification and Reliability. 2024. e1879

  31. Verifiable Obstacle Detection

    Authors: Ayoosh Bansal, Hunmin Kim, Simon Yu, Bo Li, Naira Hovakimyan, Marco Caccamo, Lui Sha

    Abstract: Perception of obstacles remains a critical safety concern for autonomous vehicles. Real-world collisions have shown that the autonomy faults leading to fatal collisions originate from obstacle existence detection. Open source autonomous driving implementations show a perception pipeline with complex interdependent Deep Neural Networks. These networks are not fully verifiable, making them unsuitabl… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at ISSRE 2022

    ACM Class: D.2.4; I.2.9; I.4.8

    Journal ref: 33rd International Symposium on Software Reliability Engineering (ISSRE), pp. 61-72. IEEE, 2022

  32. arXiv:2208.03028  [pdf, other

    eess.IV cs.CV

    Multimodal Brain Disease Classification with Functional Interaction Learning from Single fMRI Volume

    Authors: Wei Dai, Ziyao Zhang, Lixia Tian, Shengyuan Yu, Shuhui Wang, Zhao Dong, Hairong Zheng

    Abstract: In neuroimaging analysis, fMRI can well assess the function changes for brain diseases with no obvious structural lesions. To date, most deep-learning-based fMRI studies have employed functional connectivity (FC) as the basic feature for disease classification. However, FC is calculated on time series of predefined regions of interest and neglects detailed information contained in each voxel. Anot… ▽ More

    Submitted 1 March, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

  33. arXiv:2208.01559  [pdf, other

    eess.SP

    The design and optimization of synchronization sequence for Ultraviolet communication

    Authors: Shihui Yu, Chen Gong, Zhengyuan Xu

    Abstract: In the ultraviolet (UV) scattering communication, the received signals exhibit the characteristics of discrete photoelectrons due to path loss. The synchronization is based on maximum Pulse Number-Sequence correlation problem. First of all, the accuracy of synchronization is vital to channel estimation and decoding. This article focuses on improving synchronization accuracy by designing and optimi… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  34. arXiv:2207.03105  [pdf

    q-bio.TO cs.CV eess.IV physics.med-ph

    Uncertainty-Aware Self-supervised Neural Network for Liver $T_{1ρ}$ Mapping with Relaxation Constraint

    Authors: Chaoxing Huang, Yurui Qian, Simon Chun Ho Yu, Jian Hou, Baiyan Jiang, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: $T_{1ρ}$ mapping is a promising quantitative MRI technique for the non-invasive assessment of tissue properties. Learning-based approaches can map $T_{1ρ}$ from a reduced number of $T_{1ρ}$ weighted images, but requires significant amounts of high quality training data. Moreover, existing methods do not provide the confidence level of the $T_{1ρ}… ▽ More

    Submitted 25 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Provisionally accepted by Physics in Medicine and Biology

  35. arXiv:2207.01581  [pdf, other

    cs.LG cs.AI eess.SP q-bio.NC

    Interpretable Fusion Analytics Framework for fMRI Connectivity: Self-Attention Mechanism and Latent Space Item-Response Model

    Authors: Jeong-Jae Kim, Yeseul Jeon, SuMin Yu, Junggu Choi, Sanghoon Han

    Abstract: There have been several attempts to use deep learning based on brain fMRI signals to classify cognitive impairment diseases. However, deep learning is a hidden black box model that makes it difficult to interpret the process of classification. To address this issue, we propose a novel analytical framework that interprets the classification result from deep learning processes. We first derive the r… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 38 pages,12 figures,3 tables

  36. arXiv:2205.03612  [pdf, other

    eess.SP cs.LG

    BrainIB: Interpretable Brain Network-based Psychiatric Diagnosis with Graph Information Bottleneck

    Authors: Kaizhong Zheng, Shujian Yu, Baojuan Li, Robert Jenssen, Badong Chen

    Abstract: Developing a new diagnostic models based on the underlying biological mechanisms rather than subjective symptoms for psychiatric disorders is an emerging consensus. Recently, machine learning-based classifiers using functional connectivity (FC) for psychiatric disorders and healthy controls are developed to identify brain markers. However, existing machine learningbased diagnostic models are prone… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: 15 pages, 8 figures

  37. arXiv:2204.03737  [pdf, other

    eess.SP cs.LG

    Mixing Signals: Data Augmentation Approach for Deep Learning Based Modulation Recognition

    Authors: Xinjie Xu, Zhuangzhi Chen, Dongwei Xu, Huaji Zhou, Shanqing Yu, Shilian Zheng, Qi Xuan, Xiaoniu Yang

    Abstract: With the rapid development of deep learning, automatic modulation recognition (AMR), as an important task in cognitive radio, has gradually transformed from traditional feature extraction and classification to automatic classification by deep learning technology. However, deep learning models are data-driven methods, which often require a large amount of data as the training support. Data augmenta… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  38. arXiv:2202.12940  [pdf

    eess.SP physics.app-ph physics.optics

    Fully-integrated multipurpose microwave frequency identification system on a single chip

    Authors: Yuhan Yao, Yuhe Zhao, Yanxian Wei, Feng Zhou, Daigao Chen, Yuguang Zhang, Xi Xiao, Ming Li, Jianji Dong, Shaohua Yu, Xinliang Zhang

    Abstract: We demonstrate a fully-integrated multipurpose microwave frequency identification system on silicon-on-insulator platform. Thanks to its multipurpose features, the chip is able to identify different types of microwave signals, including single-frequency, multiple-frequency, chirped and frequency-hopping microwave signals, as well as discriminate instantaneous frequency variation among the frequenc… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 23 pages,6 figures

  39. arXiv:2202.11189  [pdf, ps, other

    eess.IV eess.SP physics.optics

    Mathematical Foundation of Sparsity-based Multi-snapshot Spectral Estimation

    Authors: Ping Liu, Sanghyeon Yu, Ola Sabet, Lucas Pelkmans, Habib Ammari

    Abstract: In this paper, we study the spectral estimation problem of estimating the locations of a fixed number of point sources given multiple snapshots of Fourier measurements in a bounded domain. We aim to provide a mathematical foundation for sparsity-based super-resolution in such spectral estimation problems in both one- and multi-dimensional spaces. In particular, we estimate the resolution and stabi… ▽ More

    Submitted 22 February, 2024; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 48 pages, 0 figure

    MSC Class: 65R32; 42A10; 62H20; 78A46

  40. arXiv:2202.08994  [pdf, other

    eess.IV cs.CV

    REFUGE2 Challenge: A Treasure Trove for Multi-Dimension Analysis and Evaluation in Glaucoma Screening

    Authors: Huihui Fang, Fei Li, Junde Wu, Huazhu Fu, Xu Sun, Jaemin Son, Shuang Yu, Menglu Zhang, Chenglang Yuan, Cheng Bian, Baiying Lei, Benjian Zhao, Xinxing Xu, Shaohua Li, Francisco Fumero, José Sigut, Haidar Almubarak, Yakoub Bazi, Yuanhao Guo, Yating Zhou, Ujjwal Baid, Shubham Innani, Tianjiao Guo, Jie Yang, José Ignacio Orlando , et al. (3 additional authors not shown)

    Abstract: With the rapid development of artificial intelligence (AI) in medical image processing, deep learning in color fundus photography (CFP) analysis is also evolving. Although there are some open-source, labeled datasets of CFPs in the ophthalmology community, large-scale datasets for screening only have labels of disease categories, and datasets with annotations of fundus structures are usually small… ▽ More

    Submitted 29 December, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 29 pages, 21 figures

  41. arXiv:2202.02951  [pdf, other

    eess.IV cs.CV

    Deep Deterministic Independent Component Analysis for Hyperspectral Unmixing

    Authors: Hongming Li, Shujian Yu, Jose C. Principe

    Abstract: We develop a new neural network based independent component analysis (ICA) method by directly minimizing the dependence amongst all extracted components. Using the matrix-based R{é}nyi's $α$-order entropy functional, our network can be directly optimized by stochastic gradient descent (SGD), without any variational approximation or adversarial training. As a solid application, we evaluate our ICA… ▽ More

    Submitted 14 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted by ICASSP 2022

  42. arXiv:2202.00951  [pdf, other

    eess.AS cs.AI cs.LG cs.MM cs.SD

    TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

    Authors: Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov

    Abstract: Singing melody extraction is an important problem in the field of music information retrieval. Existing methods typically rely on frequency-domain representations to estimate the sung frequencies. However, this design does not lead to human-level performance in the perception of melody information for both tone (pitch-class) and octave. In this paper, we propose TONet, a plug-and-play model that i… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: Preprint Version for ICASSP 2022, Singapore

  43. arXiv:2201.02192  [pdf

    cs.RO cs.AI cs.HC eess.SY

    A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture

    Authors: Mohsen Jafarzadeh, Stephen Brooks, Shimeng Yu, Balakrishnan Prabhakaran, Yonas Tadesse

    Abstract: Currently, most social robots interact with their surroundings and humans through sensors that are integral parts of the robots, which limits the usability of the sensors, human-robot interaction, and interchangeability. A wearable sensor garment that fits many robots is needed in many applications. This article presents an affordable wearable sensor vest, and an open-source software architecture… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: This is the preprint version. The final version is published in Robotics and Autonomous Systems, Volume 139, 2021, Page 103536, ISSN 0921-8890, https://doi.org/10.1016/j.robot.2020.103536

    MSC Class: 68T40 ACM Class: I.2.9

    Journal ref: Robotics and Autonomous Systems, vol 139, page 103536, year 2021

  44. A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples

    Authors: Sen Jia, Shuguo Jiang, Zhijie Lin, Nanying Li, Meng Xu, Shiqi Yu

    Abstract: With the rapid development of deep learning technology and improvement in computing capability, deep learning has been widely used in the field of hyperspectral image (HSI) classification. In general, deep learning models often contain many trainable parameters and require a massive number of labeled samples to achieve optimal performance. However, in regard to HSI classification, a large number o… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Journal ref: Neurocomputing, Volume 448, 2021, Pages 179-204

  45. arXiv:2109.08632  [pdf, other

    cs.LG cs.AI eess.SY

    Graph Learning for Cognitive Digital Twins in Manufacturing Systems

    Authors: Trier Mortlock, Deepan Muthirayan, Shih-Yuan Yu, Pramod P. Khargonekar, Mohammad A. Al Faruque

    Abstract: Future manufacturing requires complex systems that connect simulation platforms and virtualization with physical data from industrial processes. Digital twins incorporate a physical twin, a digital twin, and the connection between the two. Benefits of using digital twins, especially in manufacturing, are abundant as they can increase efficiency across an entire manufacturing life-cycle. The digita… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

  46. arXiv:2109.05664  [pdf

    cs.CV eess.IV

    Unsupervised domain adaptation for cross-modality liver segmentation via joint adversarial learning and self-learning

    Authors: Jin Hong, Simon Chun-Ho Yu, Weitian Chen

    Abstract: Liver segmentation on images acquired using computed tomography (CT) and magnetic resonance imaging (MRI) plays an important role in clinical management of liver diseases. Compared to MRI, CT images of liver are more abundant and readily available. However, MRI can provide richer quantitative information of the liver compared to CT. Thus, it is desirable to achieve unsupervised domain adaptation f… ▽ More

    Submitted 24 February, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

  47. Robust Tube-based Model Predictive Control with Koopman Operators--Extended Version

    Authors: Xinglong Zhang, Wei Pan, Riccardo Scattolini, Shuyou Yu, Xin Xu

    Abstract: Koopman operators are of infinite dimension and capture the characteristics of nonlinear dynamics in a lifted global linear manner. The finite data-driven approximation of Koopman operators results in a class of linear predictors, useful for formulating linear model predictive control (MPC) of nonlinear dynamical systems with reduced computational complexity. However, the robustness of the closed-… ▽ More

    Submitted 21 March, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 18 pages, 9 figures

    Journal ref: Automatica, March, 2022

  48. arXiv:2108.12460  [pdf, other

    eess.IV cs.CV eess.SP

    High Fidelity Deep Learning-based MRI Reconstruction with Instance-wise Discriminative Feature Matching Loss

    Authors: Ke Wang, Jonathan I Tamir, Alfredo De Goyeneche, Uri Wollner, Rafi Brada, Stella Yu, Michael Lustig

    Abstract: Purpose: To improve reconstruction fidelity of fine structures and textures in deep learning (DL) based reconstructions. Methods: A novel patch-based Unsupervised Feature Loss (UFLoss) is proposed and incorporated into the training of DL-based reconstruction frameworks in order to preserve perceptual similarity and high-order statistics. The UFLoss provides instance-level discrimination by mappi… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 35 pages, 13 figures

  49. arXiv:2106.11330  [pdf, other

    eess.IV cs.CV

    Context-aware PolyUNet for Liver and Lesion Segmentation from Abdominal CT Images

    Authors: Liping Zhang, Simon Chun-Ho Yu

    Abstract: Accurate liver and lesion segmentation from computed tomography (CT) images are highly demanded in clinical practice for assisting the diagnosis and assessment of hepatic tumor disease. However, automatic liver and lesion segmentation from contrast-enhanced CT volumes is extremely challenging due to the diversity in contrast, resolution, and quality of images. Previous methods based on UNet for 2D… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 7 pages and 3 figures

  50. Unsupervised Discriminative Learning of Sounds for Audio Event Classification

    Authors: Sascha Hornauer, Ke Li, Stella X. Yu, Shabnam Ghaffarzadegan, Liu Ren

    Abstract: Recent progress in network-based audio event classification has shown the benefit of pre-training models on visual data such as ImageNet. While this process allows knowledge transfer across different domains, training a model on large-scale visual datasets is time consuming. On several audio event classification benchmarks, we show a fast and effective alternative that pre-trains the model unsuper… ▽ More

    Submitted 20 May, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 978-1-7281-7605-5/20/$31.00 (c) 2021 IEEE | DOI: 10.1109/ICASSP39728.2021.9413482