Skip to main content

Showing 1–50 of 92 results for author: Gan, L

  1. arXiv:2407.13214  [pdf, other

    cs.CV

    TXL-PBC: a freely accessible labeled peripheral blood cell dataset

    Authors: Lu Gan, Xi Li

    Abstract: In a recent study, we found that publicly BCCD and BCD datasets have significant issues such as labeling errors, insufficient sample size, and poor data quality. To address these problems, we performed sample deletion, re-labeling, and integration of these two datasets. Additionally, we introduced the PBC and Raabin-WBC datasets, and ultimately created a high-quality, sample-balanced new dataset,… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.07614  [pdf, other

    cs.CV

    MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

    Authors: Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, LeiLei Gan, Hao Jiang

    Abstract: Auto-regressive models have made significant progress in the realm of language generation, yet they do not perform on par with diffusion models in the domain of image synthesis. In this work, we introduce MARS, a novel framework for T2I generation that incorporates a specially designed Semantic Vision-Language Integration Expert (SemVIE). This innovative component integrates pre-trained LLMs by in… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  3. arXiv:2407.03566  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Wireless Sensing and Communication: Applications and Challenges

    Authors: Hao Liu, Jiancheng An, Xing Jia, Shining Lin, Xianghao Yao, Lu Gan, Bruno Clerckx, Chau Yuen, Mehdi Bennis, Mérouane Debbah

    Abstract: The rapid advancement of wireless communication technologies has precipitated an unprecedented demand for high data rates, extremely low latency, and ubiquitous connectivity. In order to achieve these goals, stacked intelligent metasurfaces (SIM) has been developed as a novel solution to perform advanced signal processing tasks directly in the electromagnetic wave domain, thus achieving ultra-fast… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures, 1 table

  4. arXiv:2406.16989  [pdf, other

    cs.LG cs.AI

    Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning

    Authors: Ziyu Zhao, Leilei Gan, Guoyin Wang, Yuwei Hu, Tao Shen, Hongxia Yang, Kun Kuang, Fei Wu

    Abstract: Low-Rank Adaptation (LoRA) offers an efficient way to fine-tune large language models (LLMs). Its modular and plug-and-play nature allows the integration of various domain-specific LoRAs, enhancing LLM capabilities. Open-source platforms like Huggingface and Modelscope have introduced a new computational paradigm, Uploadable Machine Learning (UML). In UML, contributors use decentralized data to tr… ▽ More

    Submitted 16 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.09997

  5. arXiv:2406.11793  [pdf, other

    cs.RO

    FetchBench: A Simulation Benchmark for Robot Fetching

    Authors: Beining Han, Meenal Parakh, Derek Geng, Jack A Defay, Luyang Gan, Jia Deng

    Abstract: Fetching, which includes approaching, grasping, and retrieving, is a critical challenge for robot manipulation tasks. Existing methods primarily focus on table-top scenarios, which do not adequately capture the complexities of environments where both grasping and planning are essential. To address this gap, we propose a new benchmark FetchBench, featuring diverse procedural scenes that integrate b… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.09486  [pdf, other

    cs.CV cs.AI

    SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

    Authors: Shenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based offline reinforcement Learning (RL) is a promising approach that leverages existing data effectively in many real-world applications, especially those involving high-dimensional inputs like images and videos. To alleviate the distribution shift issue in offline RL, existing model-based methods heavily rely on the uncertainty of learned dynamics. However, the model uncertainty estimatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  7. arXiv:2406.09058  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook Design for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Ertugrul Basar, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can proactively reshape the characteristics of wireless channel environments. In RIS-assisted communication systems, the acquisition of channel state information (CSI) and the optimization of reflecting coefficients constitute major design challenges. To address these issues, codebook-based sol… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 36 pages, 12 figures, 2 tables, accepted by IEEE TCOM. arXiv admin note: text overlap with arXiv:2404.00265

  8. arXiv:2406.06852  [pdf, other

    cs.CR cs.AI cs.CL

    A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures

    Authors: Shuai Zhao, Meihuizi Jia, Zhongliang Guo, Leilei Gan, Jie Fu, Yichao Feng, Fengjun Pan, Luu Anh Tuan

    Abstract: The large language models (LLMs), which bridge the gap between human language understanding and complex problem-solving, achieve state-of-the-art performance on several NLP tasks, particularly in few-shot and zero-shot settings. Despite the demonstrable efficacy of LMMs, due to constraints on computational resources, users have to engage with open-source language models or outsource the entire tra… ▽ More

    Submitted 13 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  9. arXiv:2405.13078  [pdf, other

    cs.LG

    Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch

    Authors: Xin-Chun Li, Wen-Shu Fan, Bowen Tao, Le Gan, De-Chuan Zhan

    Abstract: Knowledge Distillation (KD) could transfer the ``dark knowledge" of a well-performed yet large neural network to a weaker but lightweight one. From the view of output logits and softened probabilities, this paper goes deeper into the dark knowledge provided by teachers with different capacities. Two fundamental observations are: (1) a larger teacher tends to produce probability vectors that are le… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  10. arXiv:2405.06004  [pdf, other

    physics.ao-ph cs.AI cs.LG

    EWMoE: An effective model for global weather forecasting with mixture-of-experts

    Authors: Lihao Gan, Xin Man, Chenghong Zhang, Jie Shao

    Abstract: Weather forecasting is a crucial task for meteorologic research, with direct social and economic impacts. Recently, data-driven weather forecasting models based on deep learning have shown great potential, achieving superior performance compared with traditional numerical weather prediction methods. However, these models often require massive training data and computational resources. In this pape… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  11. arXiv:2404.14233  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

    Authors: Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu

    Abstract: The rapidly developing Large Vision Language Models (LVLMs) have shown notable capabilities on a range of multi-modal tasks, but still face the hallucination phenomena where the generated texts do not align with the given contexts, significantly restricting the usages of LVLMs. Most previous work detects and mitigates hallucination at the coarse-grained level or requires expensive annotation (e.g.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  12. arXiv:2404.03386  [pdf, other

    cs.RO cs.AI cs.LG

    SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

    Authors: Kaichen Huang, Minghao Shao, Shenghua Wan, Hai-Hang Sun, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: In many real-world visual Imitation Learning (IL) scenarios, there is a misalignment between the agent's and the expert's perspectives, which might lead to the failure of imitation. Previous methods have generally solved this problem by domain alignment, which incurs extra computation and storage costs, and these methods fail to handle the \textit{hard cases} where the viewpoint gap is too large.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  13. arXiv:2404.03382  [pdf, other

    cs.LG cs.AI

    DIDA: Denoised Imitation Learning based on Domain Adaptation

    Authors: Kaichen Huang, Hai-Hang Sun, Shenghua Wan, Minghao Shao, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: Imitating skills from low-quality datasets, such as sub-optimal demonstrations and observations with distractors, is common in real-world applications. In this work, we focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. Previous IL methods improve t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  14. arXiv:2404.00265  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can reshape the characteristics of wireless channels. In this paper, we propose a novel environment-aware codebook protocol for RIS-assisted multi-user multiple-input single-output (MU-MISO) systems. Specifically, we first introduce a channel training protocol which consists of off-line and on-… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures, accepted by VTC2024-Spring

  15. arXiv:2403.09976  [pdf, other

    cs.LG cs.CV

    AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

    Authors: Yucen Wang, Shenghua Wan, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based methods have significantly contributed to distinguishing task-irrelevant distractors for visual control. However, prior research has primarily focused on heterogeneous distractors like noisy background videos, leaving homogeneous distractors that closely resemble controllable agents largely unexplored, which poses significant challenges to existing methods. To tackle this problem, we p… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  16. Stacked Intelligent Metasurface Enabled LEO Satellite Communications Relying on Statistical CSI

    Authors: Shining Lin, Jiancheng An, Lu Gan, Mérouane Debbah, Chau Yuen

    Abstract: Low earth orbit (LEO) satellite communication systems have gained increasing attention as a crucial supplement to terrestrial wireless networks due to their extensive coverage area. This letter presents a novel system design for LEO satellite systems by leveraging stacked intelligent metasurface (SIM) technology. Specifically, the lightweight and energy-efficient SIM is mounted on a satellite to a… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures, accepted by IEEE WCL

  17. Channel Estimation for Stacked Intelligent Metasurface-Assisted Wireless Networks

    Authors: Xianghao Yao, Jiancheng An, Lu Gan, Marco Di Renzo, Chau Yuen

    Abstract: Emerging technologies, such as holographic multiple-input multiple-output (HMIMO) and stacked intelligent metasurface (SIM), are driving the development of wireless communication systems. Specifically, the SIM is physically constructed by stacking multiple layers of metasurfaces and has an architecture similar to an artificial neural network (ANN), which can flexibly manipulate the electromagnetic… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages, 3 figures, accepted by IEEE WCL

  18. arXiv:2403.03215  [pdf, other

    cs.RO

    A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach

    Authors: Skylar X. Wei, Lu Gan, Joel W. Burdick

    Abstract: This work presents a novel data-driven multi-layered planning and control framework for the safe navigation of a class of unmanned ground vehicles (UGVs) in the presence of unknown stationary obstacles and additive modeling uncertainties. The foundation of this framework is a novel robust model predictive planner, designed to generate optimal collision-free trajectories given an occupancy grid map… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2402.13532  [pdf, other

    cs.CL

    Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation

    Authors: Quanyu Long, Yue Deng, LeiLei Gan, Wenya Wang, Sinno Jialin Pan

    Abstract: Dense retrievers and retrieval-augmented language models have been widely used in various NLP applications. Despite being designed to deliver reliable and secure outcomes, the vulnerability of retrievers to potential attacks remains unclear, raising concerns about their security. In this paper, we introduce a novel scenario where the attackers aim to covertly disseminate targeted misinformation, s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  20. arXiv:2402.12168  [pdf, other

    cs.CR cs.AI cs.CL

    Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning

    Authors: Shuai Zhao, Leilei Gan, Luu Anh Tuan, Jie Fu, Lingjuan Lyu, Meihuizi Jia, Jinming Wen

    Abstract: Recently, various parameter-efficient fine-tuning (PEFT) strategies for application to language models have been proposed and successfully implemented. However, this raises the question of whether PEFT, which only updates a limited set of model parameters, constitutes security vulnerabilities when confronted with weight-poisoning backdoor attacks. In this study, we show that PEFT is more susceptib… ▽ More

    Submitted 29 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: NAACL Findings 2024

  21. arXiv:2402.09997  [pdf, other

    cs.AI cs.CL cs.LG

    LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild

    Authors: Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu

    Abstract: Low-Rank Adaptation (LoRA) provides an effective yet efficient solution for fine-tuning large language models (LLM). The modular and plug-and-play nature of LoRA enables the integration of diverse domain-specific LoRAs to enhance the capabilities of LLMs. Previous research on exploiting multiple LoRAs either focuses on specific isolated downstream tasks or fixes the selection of LoRAs during train… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  22. arXiv:2401.08939  [pdf, other

    cs.RO

    Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle "Snow Lion''

    Authors: Yingbing Chen, Jie Cheng, Sheng Wang, Hongji Liu, Xiaodong Mei, Xiaoyang Yan, Mingkai Tang, Ge Sun, Ya Wen, Junwei Cai, Xupeng Xie, Lu Gan, Mandan Chao, Ren Xin, Ming Liu, Jianhao Jiao, Kangcheng Liu, Lujia Wang

    Abstract: The rapid evolution of autonomous vehicles (AVs) has significantly influenced global transportation systems. In this context, we present ``Snow Lion'', an autonomous shuttle meticulously designed to revolutionize on-campus transportation, offering a safer and more efficient mobility solution for students, faculty, and visitors. The primary objective of this research is to enhance campus mobility b… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 9 pages, 9 figures

  23. arXiv:2311.09860  [pdf, other

    cs.CL

    GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets

    Authors: Wolfgang Otto, Matthäus Zloch, Lu Gan, Saurav Karmakar, Stefan Dietze

    Abstract: Named Entity Recognition (NER) models play a crucial role in various NLP tasks, including information extraction (IE) and text understanding. In academic writing, references to machine learning models and datasets are fundamental components of various computer science publications and necessitate accurate models for identification. Despite the advancements in NER, existing ground truth datasets do… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, Accepted at EMNLP2023-Findings

  24. arXiv:2311.08179  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning via Swapped Prediction for Communication Signal Recognition

    Authors: Weidong Wang, Hongshu Liao, Lu Gan

    Abstract: Deep neural networks have been widely used in communication signal recognition and achieved remarkable performance, but this superiority typically depends on using massive examples for supervised learning, whereas training a deep neural network on small datasets with few labels generally falls into overfitting, resulting in degenerated performance. To this end, we develop a semi-supervised learnin… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  25. IR-STP: Enhancing Autonomous Driving with Interaction Reasoning in Spatio-Temporal Planning

    Authors: Yingbing Chen, Jie Cheng, Lu Gan, Sheng Wang, Hongji Liu, Xiaodong Mei, Ming Liu

    Abstract: Considerable research efforts have been devoted to the development of motion planning algorithms, which form a cornerstone of the autonomous driving system (ADS). Nonetheless, acquiring an interactive and secure trajectory for the ADS remains challenging due to the complex nature of interaction modeling in planning. Modern planning methods still employ a uniform treatment of prediction outcomes an… ▽ More

    Submitted 15 February, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: 12 pages, 10 figures, accepted by IEEE-TITS at this January

    MSC Class: 68T40 ACM Class: I.0; J.2

  26. arXiv:2310.19372  [pdf, other

    cs.CV cs.AI cs.RO

    RGB-X Object Detection via Scene-Specific Fusion Modules

    Authors: Sri Aditya Deevi, Connor Lee, Lu Gan, Sushruth Nagesh, Gaurav Pandey, Soon-Jo Chung

    Abstract: Multimodal deep sensor fusion has the potential to enable autonomous vehicles to visually understand their surrounding environments in all weather conditions. However, existing deep sensor fusion methods usually employ convoluted architectures with intermingled multimodal features, requiring large coregistered multimodal datasets for training. In this work, we present an efficient and modular RGB-… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  27. arXiv:2310.11142  [pdf, other

    cs.CV cs.LG

    BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference

    Authors: Siqi Kou, Lei Gan, Dequan Wang, Chongxuan Li, Zhijie Deng

    Abstract: Diffusion models have impressive image generation capability, but low-quality generations still exist, and their identification remains challenging due to the lack of a proper sample-wise metric. To address this, we propose BayesDiff, a pixel-wise uncertainty estimator for generations from diffusion models based on Bayesian inference. In particular, we derive a novel uncertainty iteration principl… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  28. arXiv:2310.05629  [pdf, ps, other

    eess.AS cs.SD

    Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments

    Authors: Junkang Yang, Hongqing Liu, Lu Gan, Yi Zhou

    Abstract: Speech super-resolution (SSR) aims to predict a high resolution (HR) speech signal from its low resolution (LR) corresponding part. Most neural SSR models focus on producing the final result in a noise-free environment by recovering the spectrogram of high-frequency part of the signal and concatenating it with the original low-frequency part. Although these methods achieve high accuracy, they beco… ▽ More

    Submitted 9 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  29. arXiv:2309.04945  [pdf, other

    cs.PL cs.SE

    O2ATH: An OpenMP Offloading Toolkit for the Sunway Heterogeneous Manycore Platform

    Authors: Haoran Lin, Lifeng Yan, Qixin Chang, Haitian Lu, Chenlin Li, Quanjie He, Zeyu Song, Xiaohui Duan, Zekun Yin, Yuxuan Li, Zhao Liu, Wei Xue, Haohuan Fu, Lin Gan, Guangwen Yang, Weiguo Liu

    Abstract: The next generation Sunway supercomputer employs the SW26010pro processor, which features a specialized on-chip heterogeneous architecture. Applications with significant hotspots can benefit from the great computation capacity improvement of Sunway many-core architectures by carefully making intensive manual many-core parallelization efforts. However, some legacy projects with large codebases, suc… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 15 pages, 6 figures, 5 tables,

  30. arXiv:2308.01475  [pdf, other

    stat.ML cs.LG stat.ME

    Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities

    Authors: Genevera I. Allen, Luqin Gan, Lili Zheng

    Abstract: New technologies have led to vast troves of large and complex datasets across many scientific domains and industries. People routinely use machine learning techniques to not only process, visualize, and make predictions from this big data, but also to make data-driven discoveries. These discoveries are often made using Interpretable Machine Learning, or machine learning models and techniques that… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  31. arXiv:2307.09027  [pdf, other

    cs.CV cs.RO

    Online Self-Supervised Thermal Water Segmentation for Aerial Vehicles

    Authors: Connor Lee, Jonathan Gustafsson Frennert, Lu Gan, Matthew Anderson, Soon-Jo Chung

    Abstract: We present a new method to adapt an RGB-trained water segmentation network to target-domain aerial thermal imagery using online self-supervision by leveraging texture and motion cues as supervisory signals. This new thermal capability enables current autonomous aerial robots operating in near-shore environments to perform tasks such as visual navigation, bathymetry, and flow tracking at night. Our… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 8 pages, 4 figures, 3 tables

  32. arXiv:2307.08056  [pdf, ps, other

    math.CO cs.CC

    An algorithmic version of the Hajnal--Szemerédi theorem

    Authors: Luyining Gan, Jie Han, Jie Hu

    Abstract: A $K_r$-factor of a graph $G$ is a collection of vertex disjoint $r$-cliques covering $V(G)$. We prove the following algorithmic version of the classical Hajnal--Szemerédi Theorem in graph theory, when $r$ is considered as a constant. Given $r, c, n\in \mathbb{N}$ such that $n\in r\mathbb N$, let $G$ be an $n$-vertex graph with minimum degree at least $(1-1/r)n - c$. Then there is an algorithm wit… ▽ More

    Submitted 8 July, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: 31 pages

  33. arXiv:2306.13895  [pdf, other

    eess.SP cs.CV

    Open-Set RF Fingerprinting via Improved Prototype Learning

    Authors: Weidong Wang, Hongshu Liao, Lu Gan

    Abstract: Deep learning has been widely used in radio frequency (RF) fingerprinting. Despite its excellent performance, most existing methods only consider a closed-set assumption, which cannot effectively tackle signals emitted from those unknown devices that have never been seen during training. In this letter, we exploit prototype learning for open-set RF fingerprinting and propose two improvements, incl… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  34. arXiv:2306.13893  [pdf, other

    eess.SP cs.AI cs.CV

    Radio Generation Using Generative Adversarial Networks with An Unrolled Design

    Authors: Weidong Wang, Jiancheng An, Hongshu Liao, Lu Gan, Chau Yuen

    Abstract: As a revolutionary generative paradigm of deep learning, generative adversarial networks (GANs) have been widely applied in various fields to synthesize realistic data. However, it is challenging for conventional GANs to synthesize raw signal data, especially in some complex cases. In this paper, we develop a novel GAN framework for radio generation called "Radio GAN". Compared to conventional met… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Submitted to IEEE Transactions on Cognitive Communications and Networking on 20-Dec-2022

  35. arXiv:2306.04985  [pdf, other

    cs.LG

    Beyond Probability Partitions: Calibrating Neural Networks with Semantic Aware Grouping

    Authors: Jia-Qi Yang, De-Chuan Zhan, Le Gan

    Abstract: Research has shown that deep networks tend to be overly optimistic about their predictions, leading to an underestimation of prediction errors. Due to the limited nature of data, existing studies have proposed various methods based on model prediction probabilities to bin the data and evaluate calibration error. We propose a more generalized definition of calibration error called Partitioned Calib… ▽ More

    Submitted 21 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: NeurIPS'23. https://github.com/ThyrixYang/group_calibration

  36. arXiv:2305.11104  [pdf, other

    eess.AS cs.AI

    mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra

    Authors: Chenhao Shuai, Chaohua Shi, Lu Gan, Hongqing Liu

    Abstract: Speech super-resolution (SSR) aims to recover a high resolution (HR) speech from its corresponding low resolution (LR) counterpart. Recent SSR methods focus more on the reconstruction of the magnitude spectrogram, ignoring the importance of phase reconstruction, thereby limiting the recovery quality. To address this issue, we propose mdctGAN, a novel SSR framework based on modified discrete cosine… ▽ More

    Submitted 19 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, INTERSPEECH 2023

  37. arXiv:2304.14795  [pdf, ps, other

    eess.SP cs.CV

    Semi-Supervised RF Fingerprinting with Consistency-Based Regularization

    Authors: Weidong Wang, Cheng Luo, Jiancheng An, Lu Gan, Hongshu Liao, Chau Yuen

    Abstract: As a promising non-password authentication technology, radio frequency (RF) fingerprinting can greatly improve wireless security. Recent work has shown that RF fingerprinting based on deep learning can significantly outperform conventional approaches. The superiority, however, is mainly attributed to supervised learning using a large amount of labeled data, and it significantly degrades if only li… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 12 pages, 15 figures, submitted to IEEE Internet of Things Journal

  38. Validating quantum-supremacy experiments with exact and fast tensor network contraction

    Authors: Yong Liu, Yaojian Chen, Chu Guo, Jiawei Song, Xinmin Shi, Lin Gan, Wenzhao Wu, Wei Wu, Haohuan Fu, Xin Liu, Dexun Chen, Zhifeng Zhao, Guangwen Yang, Jiangang Gao

    Abstract: The quantum supremacy experiment, such as Google Sycamore [Nature \textbf{574}, 505 (2019)], poses great challenge for classical verification due to the exponentially-increasing compute cost. Using a new-generation Sunway supercomputer within $8.5$ days, we provide a direct verification by computing three million exact amplitudes for the experimentally generated bitstrings, obtaining an XEB fideli… ▽ More

    Submitted 16 January, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 5 pages, 3 figures, comments are welcome!

    Journal ref: Physical Review Letters, 132, 030601 (2024)

  39. arXiv:2211.08238  [pdf, other

    cs.CL cs.AI

    Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction

    Authors: Leilei Gan, Baokui Li, Kun Kuang, Yating Zhang, Lei Wang, Luu Anh Tuan, Yi Yang, Fei Wu

    Abstract: Given the fact description text of a legal case, legal judgment prediction (LJP) aims to predict the case's charge, law article and penalty term. A core problem of LJP is how to distinguish confusing legal cases, where only subtle text differences exist. Previous studies fail to distinguish different classification errors with a standard cross-entropy classification loss, and ignore the numbers in… ▽ More

    Submitted 21 October, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of EMNLP 2023

  40. arXiv:2210.08548  [pdf, other

    cs.CL cs.AI

    Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual Samples

    Authors: Chengyuan Liu, Leilei Gan, Kun Kuang, Fei Wu

    Abstract: The aim of Logic2Text is to generate controllable and faithful texts conditioned on tables and logical forms, which not only requires a deep understanding of the tables and logical forms, but also warrants symbolic reasoning over the tables. State-of-the-art methods based on pre-trained models have achieved remarkable performance on the standard test dataset. However, we question whether these met… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: Accepted to appear at the main conference of EMNLP 2022

  41. arXiv:2210.04367  [pdf, other

    cs.CV cs.RO

    Unsupervised RGB-to-Thermal Domain Adaptation via Multi-Domain Attention Network

    Authors: Lu Gan, Connor Lee, Soon-Jo Chung

    Abstract: This work presents a new method for unsupervised thermal image classification and semantic segmentation by transferring knowledge from the RGB domain using a multi-domain attention network. Our method does not require any thermal annotations or co-registered RGB-thermal pairs, enabling robots to perform visual tasks at night and in adverse weather conditions without incurring additional costs of d… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  42. arXiv:2207.09766  [pdf, ps, other

    cs.IT eess.SP

    K-Means Based Constellation Optimization for Index Modulated Reconfigurable Intelligent Surfaces

    Authors: Hao Liu, Jiancheng An, Wangyang Xu, Xing Jia, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) has recently emerged as a promising technology enabling next-generation wireless networks. In this letter, we develop an improved index modulation (IM) scheme by utilizing RIS to convey information. Specifically, we study an RIS-aided multiple-input single-output (MISO) system, in which the information bits are conveyed by reflection patterns of RIS rather… ▽ More

    Submitted 31 May, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: 5 pages, 3 figures, accepted by IEEE CL

  43. Energy-based Legged Robots Terrain Traversability Modeling via Deep Inverse Reinforcement Learning

    Authors: Lu Gan, Jessy W. Grizzle, Ryan M. Eustice, Maani Ghaffari

    Abstract: This work reports on developing a deep inverse reinforcement learning method for legged robots terrain traversability modeling that incorporates both exteroceptive and proprioceptive sensory data. Existing works use robot-agnostic exteroceptive environmental features or handcrafted kinematic features; instead, we propose to also learn robot-specific inertial features from proprioceptive sensory da… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  44. arXiv:2206.08864  [pdf, other

    cs.LG cs.MM cs.SD eess.AS

    Avoid Overfitting User Specific Information in Federated Keyword Spotting

    Authors: Xin-Chun Li, Jin-Lin Tang, Shaoming Song, Bingshuai Li, Yinchuan Li, Yunfeng Shao, Le Gan, De-Chuan Zhan

    Abstract: Keyword spotting (KWS) aims to discriminate a specific wake-up word from other signals precisely and efficiently for different users. Recent works utilize various deep networks to train KWS models with all users' speech data centralized without considering data privacy. Federated KWS (FedKWS) could serve as a solution without directly sharing users' data. However, the small amount of data, differe… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted by Interspeech 2022

  45. arXiv:2206.02088  [pdf, other

    stat.ML cs.LG stat.ME

    Model-Agnostic Confidence Intervals for Feature Importance: A Fast and Powerful Approach Using Minipatch Ensembles

    Authors: Luqin Gan, Lili Zheng, Genevera I. Allen

    Abstract: To promote new scientific discoveries from complex data sets, feature importance inference has been a long-standing statistical problem. Instead of testing for parameters that are only interpretable for specific models, there has been increasing interest in model-agnostic methods, often in the form of feature occlusion or leave-one-covariate-out (LOCO) inference. Existing approaches often make dis… ▽ More

    Submitted 24 January, 2023; v1 submitted 4 June, 2022; originally announced June 2022.

  46. arXiv:2205.08788  [pdf, ps, other

    eess.SP cs.AI

    Deep Reinforcement Learning Based on Location-Aware Imitation Environment for RIS-Aided mmWave MIMO Systems

    Authors: Wangyang Xu, Jiancheng An, Chongwen Huang, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) has recently gained popularity as a promising solution for improving the signal transmission quality of wireless communications with less hardware cost and energy consumption. This letter offers a novel deep reinforcement learning (DRL) algorithm based on a location-aware imitation environment for the joint beamforming design in an RIS-aided mmWave multiple… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: 14 pages, 4 figures

  47. Lifetime-based Optimization for Simulating Quantum Circuits on a New Sunway Supercomputer

    Authors: Yaojian Chen, Yong Liu, Xinmin Shi, Jiawei Song, Xin Liu, Lin Gan, Chu Guo, Haohuan Fu, Jie Gao, Dexun Chen, Guangwen Yang

    Abstract: High-performance classical simulator for quantum circuits, in particular the tensor network contraction algorithm, has become an important tool for the validation of noisy quantum computing. In order to address the memory limitations, the slicing technique is used to reduce the tensor dimensions, but it could also lead to additional computation overhead that greatly slows down the overall performa… ▽ More

    Submitted 27 March, 2023; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: 12 pages, 13 figures

    Journal ref: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (2023) 148-159

  48. RNGDet: Road Network Graph Detection by Transformer in Aerial Images

    Authors: Zhenhua Xu, Yuxuan Liu, Lu Gan, Yuxiang Sun, Xinyu Wu, Ming Liu, Lujia Wang

    Abstract: Road network graphs provide critical information for autonomous-vehicle applications, such as drivable areas that can be used for motion planning algorithms. To find road network graphs, manually annotation is usually inefficient and labor-intensive. Automatically detecting road network graphs could alleviate this issue, but existing works still have some limitations. For example, segmentation-bas… ▽ More

    Submitted 26 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Accepted by IEEE Transactions on Geoscience and Remote Sensing

  49. arXiv:2202.04246  [pdf, ps, other

    math.CO cs.CC

    On the Keevash-Knox-Mycroft Conjecture

    Authors: Luyining Gan, Jie Han

    Abstract: Given $1\le \ell <k$ and $δ\ge0$, let $\textbf{PM}(k,\ell,δ)$ be the decision problem for the existence of perfect matchings in $n$-vertex $k$-uniform hypergraphs with minimum $\ell$-degree at least $δ\binom{n-\ell}{k-\ell}$. For $k\ge 3$, the decision problem in general $k$-uniform hypergraphs, equivalently $\textbf{PM}(k,\ell,0)$, is one of Karp's 21 NP-complete problems. Moreover, for $k\ge 3$,… ▽ More

    Submitted 28 October, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: v1 is the conference version; v2 is the journal version

  50. arXiv:2111.07970  [pdf, other

    cs.CL cs.AI cs.CR

    Triggerless Backdoor Attack for NLP Tasks with Clean Labels

    Authors: Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan

    Abstract: Backdoor attacks pose a new threat to NLP models. A standard strategy to construct poisoned data in backdoor attacks is to insert triggers (e.g., rare words) into selected sentences and alter the original label to a target label. This strategy comes with a severe flaw of being easily detected from both the trigger and the label perspectives: the trigger injected, which is usually a rare word, lead… ▽ More

    Submitted 27 April, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to appear at the main conference of NAACL 2022