Skip to main content

Showing 1–50 of 59 results for author: Ma, T

  1. arXiv:2407.07702  [pdf, other

    cs.IT eess.SP

    Leveraging Self-Supervised Learning for MIMO-OFDM Channel Representation and Generation

    Authors: Zongxi Liu, Jiacheng Chen, Yunting Xu, Ting Ma, Jingbo Liu, Haibo Zhou, Dusit Niyato

    Abstract: In communications theory, the capacity of multiple input multiple output-orthogonal frequency division multiplexing (MIMO-OFDM) systems is fundamentally determined by wireless channels, which exhibit both diversity and correlation in spatial, frequency and temporal domains. It is further envisioned to exploit the inherent nature of channels, namely representation, to achieve geolocation-based MIMO… ▽ More

    Submitted 23 May, 2024; originally announced July 2024.

  2. arXiv:2407.02052  [pdf, other

    eess.AS cs.SD

    The USTC-NERCSLIP Systems for The ICMC-ASR Challenge

    Authors: Minghui Wu, Luzhen Xu, Jie Zhang, Haitao Tang, Yanyan Yue, Ruizhi Liao, Jintao Zhao, Zhengzhe Zhang, Yichi Wang, Haoyin Yan, Hongliang Yu, Tongle Ma, Jiachen Liu, Chongliang Wu, Yongchao Li, Yanyong Zhang, Xin Fang, Yue Zhang

    Abstract: This report describes the submitted system to the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) challenge, which considers the ASR task with multi-speaker overlapping and Mandarin accent dynamics in the ICMC case. We implement the front-end speaker diarization using the self-supervised learning representation based multi-speaker embedding and beamforming using the speaker position,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ICASSP 2024

  3. arXiv:2407.01517  [pdf, other

    eess.IV cs.CV cs.LG

    Centerline Boundary Dice Loss for Vascular Segmentation

    Authors: Pengcheng Shi, Jiesi Hu, Yanwu Yang, Zilve Gao, Wei Liu, Ting Ma

    Abstract: Vascular segmentation in medical imaging plays a crucial role in analysing morphological and functional assessments. Traditional methods, like the centerline Dice (clDice) loss, ensure topology preservation but falter in capturing geometric details, especially under translation and deformation. The combination of clDice with traditional Dice loss can lead to diameter imbalance, favoring larger ves… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: accepted by MICCAI 2024

  4. arXiv:2406.14064  [pdf, other

    cs.IT eess.SP

    PAPR Reduction with Pre-chirp Selection for Affine Frequency Division Multiple

    Authors: Haozhi Yuan, Yin Xu, Xinghao Guo, Tianyao Ma, Haoyang Li, Dazhi He, Wenjun Zhang

    Abstract: Affine frequency division multiplexing (AFDM) is a promising new multicarrier technique based on discrete affine Fourier transform (DAFT). By properly tuning pre-chirp parameter and post-chirp parameter in the DAFT, the effective channel in the DAFT domain can completely avoid overlap of different paths, thus constitutes a full representation of delay-Doppler profile, which significantly improves… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.02166  [pdf, other

    cs.SD cs.CL eess.AS

    Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision

    Authors: Saierdaer Yusuyin, Te Ma, Hao Huang, Wenbo Zhao, Zhijian Ou

    Abstract: There exist three approaches for multilingual and crosslingual automatic speech recognition (MCL-ASR) - supervised pre-training with phonetic or graphemic transcription, and self-supervised pre-training. We find that pre-training with phonetic supervision has been underappreciated so far for MCL-ASR, while conceptually it is more advantageous for information sharing between different languages. Th… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.20595  [pdf, other

    eess.SP

    Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities

    Authors: Yinxiao Zhuo, Tianqi Mao, Haojin Li, Chen Sun, Zhaocheng Wang, Zhu Han, Sheng Chen

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a critical enabling technology for the next-generation wireless communication, which can realize location/motion detection of surroundings with communication devices. This additional sensing capability leads to a substantial network quality gain and expansion of the service scenarios. As the system evolves to millimeter wave (mmWav… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  7. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  8. arXiv:2405.03119  [pdf, ps, other

    cs.IT eess.SP

    DAFT-Spread Affine Frequency Division Multiple Access for Downlink Transmission

    Authors: Yiwei Tao, Miaowen Wen, Yao Ge, Tianqi Mao, Lixia Xiao, Jun Li

    Abstract: Affine frequency division multiplexing (AFDM) and orthogonal AFDM access (O-AFDMA) are promising techniques based on chirp signals, which are able to suppress the performance deterioration caused by Doppler shifts in high-mobility scenarios. However, the high peak-to-average power ratio (PAPR) in AFDM or O-AFDMA is still a crucial problem, which severely limits their practical applications. In thi… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  9. arXiv:2404.10232  [pdf, other

    eess.SP

    Channel Estimation for AFDM With Superimposed Pilots

    Authors: Kai Zheng, Miaowen Wen, Tianqi Mao, Lixia Xiao, Zhaocheng Wang

    Abstract: The recent proposed affine frequency division multiplexing (AFDM) employing a multi-chirp waveform has shown its reliability and robustness in doubly selective fading channels. In the existing embedded pilot-aided channel estimation methods, the presence of guard symbols in the discrete affine Fourier transform (DAFT) domain causes inevitable degradation of the spectral efficiency (SE). To improve… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  10. arXiv:2404.08366  [pdf, other

    eess.SP

    Intelligent Reflecting Surface-Enabled Anti-Detection for Secure Sensing and Communications

    Authors: Beixiong Zheng, Xue Xiong, Tiantian Ma, Jie Tang, Derrick Wing Kwan Ng, A. Lee Swindlehurst, Rui Zhang

    Abstract: The ever-increasing reliance on wireless communication and sensing has led to growing concerns over the vulnerability of sensitive information to unauthorized detection and interception. Traditional anti-detection methods are often inadequate, suffering from limited adaptability and diminished effectiveness against advanced detection technologies. To overcome these challenges, this article present… ▽ More

    Submitted 21 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 7 pages, 5 figures

  11. arXiv:2404.05257  [pdf, other

    eess.SP

    Sensing-Resistance-Oriented Beamforming for Privacy Protection from ISAC Devices

    Authors: Teng Ma, Yue Xiao, Xia Lei, Ming Xiao

    Abstract: With the evolution of integrated sensing and communication (ISAC) technology, a growing number of devices go beyond conventional communication functions with sensing abilities. Therefore, future networks are divinable to encounter new privacy concerns on sensing, such as the exposure of position information to unintended receivers. In contrast to traditional privacy preserving schemes aiming to pr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at WS29 ICC 2024 Workshop - ISAC6G

  12. arXiv:2402.15185  [pdf, other

    cs.IT eess.SP

    Pre-Chirp-Domain Index Modulation for Affine Frequency Division Multiplexing

    Authors: Guangyao Liu, Tianqi Mao, Ruiqi Liu, Zhenyu Xiao

    Abstract: Affine frequency division multiplexing (AFDM), tailored as a novel multicarrier technique utilizing chirp signals for high-mobility communications, exhibits marked advantages compared to traditional orthogonal frequency division multiplexing (OFDM). AFDM is based on the discrete affine Fourier transform (DAFT) with two modifiable parameters of the chirp signals, termed as the pre-chirp parameter a… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  13. Real-Time Asphalt Pavement Layer Thickness Prediction Using Ground-Penetrating Radar Based on a Modified Extended Common Mid-Point (XCMP) Approach

    Authors: Siqi Wang, Zhen Leng, Xin Sui, Weiguang Zhang, Tao Ma, Zehui Zhu

    Abstract: The conventional surface reflection method has been widely used to measure the asphalt pavement layer dielectric constant using ground-penetrating radar (GPR). This method may be inaccurate for in-service pavement thickness estimation with dielectric constant variation through the depth, which could be addressed using the extended common mid-point method (XCMP) with air-coupled GPR antennas. Howev… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: IEEE Transactions on Intelligent Transportation Systems (2024)

  14. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  15. arXiv:2311.07249  [pdf, other

    eess.SP

    Near-Field Sparse Channel Estimation for Extremely Large-Scale RIS-Aided Wireless Communications

    Authors: Zixing Tang, Yuanbin Chen, Ying Wang, Tianqi Mao, Qingqing Wu, Marco Di Renzo, Lajos Hanzo

    Abstract: A significant increase in the number of reconfigurable intelligent surface (RIS) elements results in a spherical wavefront in the near field of extremely large-scale RIS (XL-RIS). Although the channel matrix of the cascaded two-hop link may become sparse in the polar-domain representation, their accurate estimation of these polar-domain parameters cannot be readily guaranteed. To tackle this chall… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: This paper has been accepted for publication in the IEEE GLOBECOM 2023 Workshops Proceedings

  16. arXiv:2311.04483  [pdf, other

    eess.SP

    Cross-Domain Dual-Functional OFDM Waveform Design for Accurate Sensing/Positioning

    Authors: Fan Zhang, Tianqi Mao, Ruiqi Liu, Zhu Han, Sheng Chen, Zhaocheng Wang

    Abstract: Orthogonal frequency division multiplexing (OFDM) has been widely recognized as the representative waveform for 5G wireless networks, which can directly support sensing/positioning with existing infrastructure. To guarantee superior sensing/positioning accuracy while supporting high-speed communications simultaneously, the dual functions tend to be assigned with different resource elements (REs) d… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  17. arXiv:2309.06421  [pdf, other

    eess.IV cs.CV

    AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

    Authors: Tao Ma, Chao Zhang, Min Lu, Lin Luo

    Abstract: Renal pathology, as the gold standard of kidney disease diagnosis, requires doctors to analyze a series of tissue slices stained by H&E staining and special staining like Masson, PASM, and PAS, respectively. These special staining methods are costly, time-consuming, and hard to standardize for wide use especially in primary hospitals. Advances of supervised learning methods have enabled the virtua… ▽ More

    Submitted 17 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: BMVC 2023

  18. arXiv:2306.15875  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion

    Authors: Zhe Ye, Terui Mao, Li Dong, Diqun Yan

    Abstract: Deep speech classification has achieved tremendous success and greatly promoted the emergence of many real-world applications. However, backdoor attacks present a new security threat to it, particularly with untrustworthy third-party platforms, as pre-defined triggers set by the attacker can activate the backdoor. Most of the triggers in existing speech backdoor attacks are sample-agnostic, and ev… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted by INTERSPEECH 2023

    Journal ref: Proc. INTERSPEECH 2023, pp. 4923-4927

  19. arXiv:2305.15911  [pdf, other

    eess.IV cs.CV

    NexToU: Efficient Topology-Aware U-Net for Medical Image Segmentation

    Authors: Pengcheng Shi, Xutao Guo, Yanwu Yang, Chenfei Ye, Ting Ma

    Abstract: Convolutional neural networks (CNN) and Transformer variants have emerged as the leading medical image segmentation backbones. Nonetheless, due to their limitations in either preserving global image context or efficiently processing irregular shapes in visual objects, these backbones struggle to effectively integrate information from diverse anatomical regions and reduce inter-individual variabili… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 13 pages, 6 figures

  20. arXiv:2305.15184  [pdf, other

    cs.IT cs.NI eess.SP eess.SY

    6G Enabled Advanced Transportation Systems

    Authors: Ruiqi Liu, Meng Hua, Ke Guan, Xiping Wang, Leyi Zhang, Tianqi Mao, Di Zhang, Qingqing Wu, Abbas Jamalipour

    Abstract: With the emergence of communication services with stringent requirements such as autonomous driving or on-flight Internet, the sixth-generation (6G) wireless network is envisaged to become an enabling technology for future transportation systems. In this paper, two ways of interactions between 6G networks and transportation are extensively investigated. On one hand, the new usage scenarios and cap… ▽ More

    Submitted 11 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems (T-ITS)

    Journal ref: IEEE Transactions on Intelligent Transportation Systems (2024) 1-17

  21. arXiv:2305.07234  [pdf, other

    eess.SP

    Doppler-Resilient Design of CAZAC Sequences for mmWave/THz Sensing Applications

    Authors: Fan Zhang, Tianqi Mao, Zhaocheng Wang

    Abstract: Ultra-high-resolution target sensing has emerged as a key enabler for various cutting-edge applications, which can be realized by utilizing the millimeter wave/terahertz frequencies. However, the extremely high operating frequency inevitably leads to significant Doppler shift effects, especially for high-mobility applications, causing the degradation of sensing performance with high false alarm ra… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  22. arXiv:2302.14277  [pdf, other

    eess.IV cs.CV

    DECOR-NET: A COVID-19 Lung Infection Segmentation Network Improved by Emphasizing Low-level Features and Decorrelating Features

    Authors: Jiesi Hu, Yanwu Yang, Xutao Guo, Ting Ma

    Abstract: Since 2019, coronavirus Disease 2019 (COVID-19) has been widely spread and posed a serious threat to public health. Chest Computed Tomography (CT) holds great potential for screening and diagnosis of this disease. The segmentation of COVID-19 CT imaging can achieves quantitative evaluation of infections and tracks disease progression. COVID-19 infections are characterized by high heterogeneity and… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  23. arXiv:2301.13402  [pdf, other

    cs.CV eess.IV

    ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

    Authors: Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi

    Abstract: The StyleGAN family succeed in high-fidelity image generation and allow for flexible and plausible editing of generated images by manipulating the semantic-rich latent style space.However, projecting a real image into its latent space encounters an inherent trade-off between inversion quality and editability. Existing encoder-based or optimization-based StyleGAN inversion methods attempt to mitiga… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  24. arXiv:2210.17408  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation

    Authors: Xutao Guo, Yanwu Yang, Chenfei Ye, Shang Lu, Yang Xiang, Ting Ma

    Abstract: Based on the Denoising Diffusion Probabilistic Model (DDPM), medical image segmentation can be described as a conditional image generation task, which allows to compute pixel-wise uncertainty maps of the segmentation and allows an implicit ensemble of segmentations to boost the segmentation performance. However, DDPM requires many iterative denoising steps to generate segmentations from Gaussian n… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  25. arXiv:2210.13721  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-modal Dynamic Graph Network: Coupling Structural and Functional Connectome for Disease Diagnosis and Classification

    Authors: Yanwu Yang, Xutao Guo, Zhikai Chang, Chenfei Ye, Yang Xiang, Ting Ma

    Abstract: Multi-modal neuroimaging technology has greatlly facilitated the efficiency and diagnosis accuracy, which provides complementary information in discovering objective disease biomarkers. Conventional deep learning methods, e.g. convolutional neural networks, overlook relationships between nodes and fail to capture topological properties in graphs. Graph neural networks have been proven to be of gre… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  26. arXiv:2210.08997  [pdf, other

    cs.CV cs.LG eess.IV

    AIM 2022 Challenge on Instagram Filter Removal: Methods and Results

    Authors: Furkan Kınlı, Sami Menteş, Barış Özcan, Furkan Kıraç, Radu Timofte, Yi Zuo, Zitao Wang, Xiaowen Zhang, Yu Zhu, Chenghua Li, Cong Leng, Jian Cheng, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Tianzhi Ma, Zihan Gao, Wenxin He, Woon-Ha Yeo, Wang-Taek Oh, Young-Il Kim, Han-Cheol Ryu, Gang He , et al. (8 additional authors not shown)

    Abstract: This paper introduces the methods and the results of AIM 2022 challenge on Instagram Filter Removal. Social media filters transform the images by consecutive non-linear operations, and the feature maps of the original content may be interpolated into a different domain. This reduces the overall performance of the recent deep learning strategies. The main goal of this challenge is to produce realis… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 14 pages, 9 figures, Challenge report of AIM 2022 Instagram Filter Removal Challenge in conjunction with ECCV 2022

  27. arXiv:2209.08933  [pdf, ps, other

    eess.IV cs.CV

    Estimating Brain Age with Global and Local Dependencies

    Authors: Yanwu Yang, Xutao Guo, Zhikai Chang, Chenfei Ye, Yang Xiang, Haiyan Lv, Ting Ma

    Abstract: The brain age has been proven to be a phenotype of relevance to cognitive performance and brain disease. Achieving accurate brain age prediction is an essential prerequisite for optimizing the predicted brain-age difference as a biomarker. As a comprehensive biological characteristic, the brain age is hard to be exploited accurately with models using feature engineering and local processing such a… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  28. arXiv:2207.11945  [pdf, other

    eess.SP

    Terahertz-Band Near-Space Communications: From a Physical-Layer Perspective

    Authors: Tianqi Mao, Leyi Zhang, Zhenyu Xiao, Zhu Han, Xiang-Gen Xia

    Abstract: Facilitated by rapid technological development of the near-space platform stations (NSPS), near-space communication (NS-COM) is envisioned to play a pivotal role in the space-air-ground integrated network for sixth-generation (6G) communications and beyond. In NS-COM, ultra-broadband wireless connectivity between NSPSs and various airborne/spaceborne platforms is required for a plethora of bandwid… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  29. arXiv:2207.11896  [pdf, ps, other

    eess.SP

    LEO Satellite Access Network (LEO-SAN) Towards 6G: Challenges and Approaches

    Authors: Zhenyu Xiao, Junyi Yang, Tianqi Mao, Chong Xu, Rui Zhang, Zhu Han, Xiang-Gen Xia

    Abstract: With the rapid development of satellite communication technologies, the space-based access network has been envisioned as a promising complementary part of the future 6G network. Aside from terrestrial base stations, satellite nodes, especially the low-earth-orbit (LEO) satellites, can also serve as base stations for Internet access, and constitute the LEO-satellite-based access network (LEO-SAN).… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  30. arXiv:2207.11883  [pdf, other

    eess.SP

    Near Space Communications (NS-COM): A New Regime in Space-Air-Ground Integrated Network (SAGIN)

    Authors: Zhenyu Xiao, Tianqi Mao, Zhu Han, Xiang-Gen Xia

    Abstract: Precipitated by the technological innovations of the near-space platform stations (NSPS), the near space communication (NS-COM) network has emerged as an indispensable part of the next-generation space-air-ground integrated network (SAGIN) that facilitates ubiquitous coverage and broadband data transfer. This paper aims to provide a comprehensive overview of NS-COM. Firstly, we investigate the dif… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  31. arXiv:2205.03122  [pdf

    physics.med-ph eess.IV physics.optics

    Ultrathin, high-speed, all-optical photoacoustic endomicroscopy probe for guiding minimally invasive surgery

    Authors: Tianrui Zhao, Truc Thuy Pham, Christian Baker, Michelle T. Ma, Sebastien Ourselin, Tom Vercauteren, Edward Zhang, Paul C. Beard, Wenfeng Xia

    Abstract: Photoacoustic (PA) endoscopy has shown significant potential for clinical diagnosis and surgical guidance. Multimode fibres (MMFs) are becoming increasing attractive for the development of miniature endoscopy probes owing to ultrathin size, low cost and diffraction-limited spatial resolution enabled by wavefront shaping. However, current MMF-based PA endomicroscopy probes are either limited by a b… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  32. arXiv:2203.10091  [pdf, other

    eess.IV cs.CV

    Label conditioned segmentation

    Authors: Tianyu Ma, Benjamin C. Lee, Mert R. Sabuncu

    Abstract: Semantic segmentation is an important task in computer vision that is often tackled with convolutional neural networks (CNNs). A CNN learns to produce pixel-level predictions through training on pairs of images and their corresponding ground-truth segmentation labels. For segmentation tasks with multiple classes, the standard approach is to use a network that computes a multi-channel probabilistic… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: MIDL 2022

  33. arXiv:2202.02701  [pdf, other

    eess.IV cs.CV

    Hyper-Convolutions via Implicit Kernels for Medical Imaging

    Authors: Tianyu Ma, Alan Q. Wang, Adrian V. Dalca, Mert R. Sabuncu

    Abstract: The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares weights across all pixels. A standard CNN's capacity, and thus its performance, is directly related to the number of learnable kernel weights, which is determin… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.10559

  34. Survey of charging scheduling, fleet management, and location planning of charging stations for electrified demand-responsive transport systems: methodologies and recent developments

    Authors: Tai-Yu Ma, Yumeng Fang

    Abstract: The accelerated electrification of transport systems with EVs has brought new challenges for charging scheduling, fleet management, and charging infrastructure location and configuration planning. In this review, we have provided a systematic review of the recent development in strategic, tactical, and operational decisions for demand responsive transport system planning using electric vehicles (E… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  35. arXiv:2110.14064  [pdf, other

    eess.SY

    How will electric vehicles affect traffic congestion and energy consumption: an integrated modelling approach

    Authors: Artur Grigorev, Tuo Mao, Adam Berry, Joachim Tan, Loki Purushothaman, Adriana-Simona Mihaita

    Abstract: This paper explores the impact of electric vehicles (EVs) on traffic congestion and energy consumption by proposing an integrated bi-level framework comprising of: a) a dynamic micro-scale traffic simulation suitable for modelling current and hypothetical traffic and charging demand scenarios and b) a queue model for capturing the impact of fast charging station use, informed by traffic flows, tra… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  36. arXiv:2109.07045  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Uncertainty Quantification in Medical Image Segmentation with Multi-decoder U-Net

    Authors: Yanwu Yang, Xutao Guo, Yiwei Pan, Pengcheng Shi, Haiyan Lv, Ting Ma

    Abstract: Accurate medical image segmentation is crucial for diagnosis and analysis. However, the models without calibrated uncertainty estimates might lead to errors in downstream analysis and exhibit low levels of robustness. Estimating the uncertainty in the measurement is vital to making definite, informed conclusions. Especially, it is difficult to make accurate predictions on ambiguous areas and focus… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: MICCAI_QUBIQ challenge, conference, Uncertainty qualification

  37. arXiv:2108.01846  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

    Authors: Yuping Luo, Tengyu Ma

    Abstract: Training-time safety violations have been a major concern when we deploy reinforcement learning algorithms in the real world. This paper explores the possibility of safe RL algorithms with zero training-time safety violations in the challenging setting where we are only given a safe but trivial-reward initial policy without any prior knowledge of the dynamics model and additional offline data. We… ▽ More

    Submitted 10 March, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: NeurIPS 2021. Source code at https://github.com/roosephu/crabs

  38. arXiv:2106.12146  [pdf, ps, other

    eess.SP

    Terahertz Wireless Communications with Flexible Index Modulation Aided Pilot Design

    Authors: Tianqi Mao, Zhaocheng Wang

    Abstract: Terahertz (THz) wireless communication is envisioned as a promising technology, which is capable of providing ultra-high-rate transmission up to Terabit per second. However, some hardware imperfections, which are generally neglected in the existing literature concerning lower data rates and traditional operating frequencies, cannot be overlooked in the THz systems. Hardware imperfections usually c… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: IEEE Journal on Selected Areas in Communications,vol. 39, no. 6, Jun. 2021

  39. Federated Learning for Internet of Things: A Federated Learning Framework for On-device Anomaly Data Detection

    Authors: Tuo Zhang, Chaoyang He, Tianhao Ma, Lei Gao, Mark Ma, Salman Avestimehr

    Abstract: Federated learning can be a promising solution for enabling IoT cybersecurity (i.e., anomaly detection in the IoT environment) while preserving data privacy and mitigating the high communication/storage overhead (e.g., high-frequency data from time-series sensors) of centralized over-the-cloud approaches. In this paper, to further push forward this direction with a comprehensive study in both algo… ▽ More

    Submitted 18 October, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, November 2021, Pages 413-419

  40. arXiv:2106.01549  [pdf, ps, other

    eess.SP

    Waveform Design for Joint Sensing and Communications in Millimeter-Wave and Low Terahertz Bands

    Authors: Tianqi Mao, Jiaxuan Chen, Qi Wang, Chong Han, Zhaocheng Wang, George K. Karagiannidis

    Abstract: The convergence of sensing and communication in the millimeter-wave (mmWave) and low terahertz (THz) bands has been envisioned as a promising technology, since it incorporates high-rate data transmission of hundreds of Gbps and mm-level radar sensing in a spectrum- and cost-efficient manner, by sharing both the frequency and hardware resources. However, the joint radar sensing and communication (J… ▽ More

    Submitted 26 December, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  41. Hyper-Convolution Networks for Biomedical Image Segmentation

    Authors: Tianyu Ma, Adrian V. Dalca, Mert R. Sabuncu

    Abstract: The convolution operation is a central building block of neural network architectures widely used in computer vision. The size of the convolution kernels determines both the expressiveness of convolutional neural networks (CNN), as well as the number of learnable parameters. Increasing the network capacity to capture rich pixel relationships requires increasing the number of learnable parameters,… ▽ More

    Submitted 6 October, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: WACV 2022

  42. arXiv:2103.00780  [pdf, other

    eess.IV cs.CV

    Towards Unbiased COVID-19 Lesion Localisation and Segmentation via Weakly Supervised Learning

    Authors: Yang Yang, Jiancong Chen, Ruixuan Wang, Ting Ma, Lingwei Wang, Jie Chen, Wei-Shi Zheng, Tong Zhang

    Abstract: Despite tremendous efforts, it is very challenging to generate a robust model to assist in the accurate quantification assessment of COVID-19 on chest CT images. Due to the nature of blurred boundaries, the supervised segmentation methods usually suffer from annotation biases. To support unbiased lesion localisation and to minimise the labeling costs, we propose a data-driven framework supervised… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: accepted by ISBI 2021

  43. arXiv:2010.12143  [pdf, other

    cs.SD eess.AS

    Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance

    Authors: Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng

    Abstract: Automatic speech recognition (ASR) for under-represented named-entity (UR-NE) is challenging due to such named-entities (NE) have insufficient instances and poor contextual coverage in the training data to learn reliable estimates and representations. In this paper, we propose approaches to enriching UR-NEs to improve speech recognition performance. Specifically, our first priority is to ensure th… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  44. arXiv:2010.11489  [pdf, other

    eess.AS cs.SD

    The NTU-AISG Text-to-speech System for Blizzard Challenge 2020

    Authors: Haobo Zhang, Tingzhi Mao, Haihua Xu, Hao Huang

    Abstract: We report our NTU-AISG Text-to-speech (TTS) entry systems for the Blizzard Challenge 2020 in this paper. There are two TTS tasks in this year's challenge, one is a Mandarin TTS task, the other is a Shanghai dialect TTS task. We have participated both. One of the main challenges is to build TTS systems with low-resource constraints, particularly for the case of Shanghai dialect, of which about thre… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 5 pages, Technical Report

  45. Ensembling Low Precision Models for Binary Biomedical Image Segmentation

    Authors: Tianyu Ma, Hang Zhang, Hanley Ong, Amar Vora, Thanh D. Nguyen, Ajay Gupta, Yi Wang, Mert Sabuncu

    Abstract: Segmentation of anatomical regions of interest such as vessels or small lesions in medical images is still a difficult problem that is often tackled with manual input by an expert. One of the major challenges for this task is that the appearance of foreground (positive) regions can be similar to background (negative) regions. As a result, many automatic segmentation algorithms tend to exhibit asym… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 10 pages, 4 figures

  46. arXiv:2010.04591  [pdf, other

    stat.ML cs.LG eess.SY

    Physics-Informed Gaussian Process Regression for Probabilistic States Estimation and Forecasting in Power Grids

    Authors: Tong Ma, David Alonso Barajas-Solano, Ramakrishna Tipireddy, Alexandre M. Tartakovsky

    Abstract: Real-time state estimation and forecasting is critical for efficient operation of power grids. In this paper, a physics-informed Gaussian process regression (PhI-GPR) method is presented and used for probabilistic forecasting and estimating the phase angle, angular speed, and wind mechanical power of a three-generator power grid system using sparse measurements. In standard data-driven Gaussian pr… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    MSC Class: 62G99

  47. Two-stage battery recharge scheduling and vehicle-charger assignment policy for dynamic electric dial-a-ride services

    Authors: Tai-Yu Ma

    Abstract: Coordinating the charging scheduling of electric vehicles for dynamic dial-a-ride services is challenging considering charging queuing delays and stochastic customer demand. We propose a new two-stage solution approach to handle dynamic vehicle charging scheduling to minimize the costs of daily charging operations of the fleet. The approach comprises two components: daily vehicle charging scheduli… ▽ More

    Submitted 15 April, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

  48. arXiv:2009.05748  [pdf, other

    eess.AS cs.AI

    Visual-speech Synthesis of Exaggerated Corrective Feedback

    Authors: Yaohua Bu, Weijun Li, Tianyi Ma, Shengqi Chen, Jia Jia, Kun Li, Xiaobo Lu

    Abstract: To provide more discriminative feedback for the second language (L2) learners to better identify their mispronunciation, we propose a method for exaggerated visual-speech feedback in computer-assisted pronunciation training (CAPT). The speech exaggeration is realized by an emphatic speech generation neural network based on Tacotron, while the visual exaggeration is accomplished by ADC Viseme Blend… ▽ More

    Submitted 15 December, 2020; v1 submitted 12 September, 2020; originally announced September 2020.

  49. Optimal fast charging station locations for electric ridesharing service with online vehicle-charging station assignment

    Authors: Tai-Yu Ma, Simin Xie

    Abstract: Electrified shared mobility services need to handle charging infrastructure planning and manage their daily charging operations to minimize total charging operation time and cost. However, existing studies tend to address these problems separately. A new online vehicle-charging assignment model is proposed and integrated into the fast charging location problem for dynamic ridesharing services usin… ▽ More

    Submitted 12 October, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

  50. arXiv:2005.10470  [pdf, other

    eess.AS cs.CL cs.SD

    Multistream CNN for Robust Acoustic Modeling

    Authors: Kyu J. Han, Jing Pan, Venkata Krishna Naveen Tadala, Tao Ma, Dan Povey

    Abstract: This paper proposes multistream CNN, a novel neural network architecture for robust acoustic modeling in speech recognition tasks. The proposed architecture processes input speech with diverse temporal resolutions by applying different dilation rates to convolutional neural networks across multiple streams to achieve the robustness. The dilation rates are selected from the multiples of a sub-sampl… ▽ More

    Submitted 25 April, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted to ICASSP 2021