Skip to main content

Showing 1–50 of 98 results for author: Cao, W

  1. arXiv:2407.03374  [pdf

    cs.AI cs.SE eess.SP eess.SY

    An Outline of Prognostics and Health Management Large Model: Concepts, Paradigms, and Challenges

    Authors: Laifa Tao, Shangyu Li, Haifei Liu, Qixuan Huang, Liang Ma, Guoao Ning, Yiling Chen, Yunlong Wu, Bin Li, Weiwei Zhang, Zhengduo Zhao, Wenchao Zhan, Wenyan Cao, Chao Wang, Hongmei Liu, Jian Ma, Mingliang Suo, Yujie Cheng, Yu Ding, Dengwei Song, Chen Lu

    Abstract: Prognosis and Health Management (PHM), critical for ensuring task completion by complex systems and preventing unexpected failures, is widely adopted in aerospace, manufacturing, maritime, rail, energy, etc. However, PHM's development is constrained by bottlenecks like generalization, interpretation and verification abilities. Presently, generative artificial intelligence (AI), represented by Larg… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18204  [pdf, other

    cs.NI

    Analysis of Channel Uncertainty in Trusted Wireless Services via Repeated Interactions

    Authors: Bingwen Chen, Xintong Ling, Weihang Cao, Jiaheng Wang, Zhi Ding

    Abstract: The coexistence of heterogeneous sub-networks in 6G poses new security and trust concerns and thus calls for a perimeterless-security model. Blockchain radio access network (B-RAN) provides a trust-building approach via repeated interactions rather than relying on pre-established trust or central authentication. Such a trust-building process naturally supports dynamic trusted services across vario… ▽ More

    Submitted 2 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.16710  [pdf, other

    cs.CV

    Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image

    Authors: Jinkun Hao, Junshu Tang, Jiangning Zhang, Ran Yi, Yijia Hong, Moran Li, Weijian Cao, Yating Wang, Lizhuang Ma

    Abstract: While recent works have achieved great success on one-shot 3D common object generation, high quality and fidelity 3D head generation from a single image remains a great challenge. Previous text-based methods for generating 3D heads were limited by text descriptions and image-based methods struggled to produce high-quality head geometry. To handle this challenging problem, we propose a novel framew… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: https://jinkun-hao.github.io/Portrait3D/

  4. arXiv:2406.01380  [pdf, other

    cs.CV stat.AP

    Convolutional Unscented Kalman Filter for Multi-Object Tracking with Outliers

    Authors: Shiqi Liu, Wenhan Cao, Chang Liu, Tianyi Zhang, Shengbo Eben Li

    Abstract: Multi-object tracking (MOT) is an essential technique for navigation in autonomous driving. In tracking-by-detection systems, biases, false positives, and misses, which are referred to as outliers, are inevitable due to complex traffic scenarios. Recent tracking methods are based on filtering algorithms that overlook these outliers, leading to reduced tracking accuracy or even loss of the objects… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures

  5. arXiv:2405.19027  [pdf, other

    cs.DC

    A Dual-functional Blockchain Framework for Solving Distributed Optimization

    Authors: Weihang Cao, Xintong Ling, Jiaheng Wang, Xiqi Gao, Zhi Ding

    Abstract: Proof of Work (PoW) has been extensively utilized as the foundation of blockchain's security, consistency, and tamper-resistance. However, long has it been criticized for its tremendous and inefficient utilization of computational power and energy. In this work, we design a dual-functional blockchain framework that uses solving optimization problems to reach consensus as an alternative to PoW, cha… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.18156  [pdf, other

    cs.CV

    VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation

    Authors: Qilin Wang, Zhengkai Jiang, Chengming Xu, Jiangning Zhang, Yabiao Wang, Xinyi Zhang, Yun Cao, Weijian Cao, Chengjie Wang, Yanwei Fu

    Abstract: Human image animation involves generating a video from a static image by following a specified pose sequence. Current approaches typically adopt a multi-stage pipeline that separately learns appearance and motion, which often leads to appearance degradation and temporal inconsistencies. To address these issues, we propose VividPose, an innovative end-to-end pipeline based on Stable Video Diffusion… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.15763  [pdf, other

    cs.CV

    FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis

    Authors: Ke Fan, Junshu Tang, Weijian Cao, Ran Yi, Moran Li, Jingyu Gong, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Lizhuang Ma

    Abstract: Text-to-motion synthesis is a crucial task in computer vision. Existing methods are limited in their universality, as they are tailored for single-person or two-person scenarios and can not be applied to generate motions for more individuals. To achieve the number-free motion synthesis, this paper reconsiders motion generation and proposes to unify the single and multi-person motion by the conditi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2405.03008  [pdf, other

    eess.IV cs.CV cs.LG

    DVMSR: Distillated Vision Mamba for Efficient Super-Resolution

    Authors: Xiaoyan Lei, Wenlong Zhang, Weifeng Cao

    Abstract: Efficient Image Super-Resolution (SR) aims to accelerate SR network inference by minimizing computational complexity and network parameters while preserving performance. Existing state-of-the-art Efficient Image Super-Resolution methods are based on convolutional neural networks. Few attempts have been made with Mamba to harness its long-range modeling capability and efficient computational comple… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 8 pages, 8 figures

  9. arXiv:2405.00027  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    Multidimensional Compressed Sensing for Spectral Light Field Imaging

    Authors: Wen Cao, Ehsan Miandji, Jonas Unger

    Abstract: This paper considers a compressive multi-spectral light field camera model that utilizes a one-hot spectralcoded mask and a microlens array to capture spatial, angular, and spectral information using a single monochrome sensor. We propose a model that employs compressed sensing techniques to reconstruct the complete multi-spectral light field from undersampled measurements. Unlike previous work wh… ▽ More

    Submitted 27 February, 2024; originally announced May 2024.

    Comments: 8 pages, published of VISAPP 2024

    Journal ref: In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP 2024, ISBN 978-989-758-679-8, ISSN 2184-4321, pages 349-356

  10. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  11. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  12. arXiv:2404.04936  [pdf, other

    cs.CV

    Bootstrapping Chest CT Image Understanding by Distilling Knowledge from X-ray Expert Models

    Authors: Weiwei Cao, Jianpeng Zhang, Yingda Xia, Tony C. W. Mok, Zi Li, Xianghua Ye, Le Lu, Jian Zheng, Yuxing Tang, Ling Zhang

    Abstract: Radiologists highly desire fully automated versatile AI for medical imaging interpretation. However, the lack of extensively annotated large-scale multi-disease datasets has hindered the achievement of this goal. In this paper, we explore the feasibility of leveraging language as a naturally high-quality supervision for chest CT imaging. In light of the limited availability of image-report pairs,… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  13. arXiv:2404.00481  [pdf, other

    stat.ML cs.LG eess.SY

    Convolutional Bayesian Filtering

    Authors: Wenhan Cao, Shiqi Liu, Chang Liu, Zeyu He, Stephen S. -T. Yau, Shengbo Eben Li

    Abstract: Bayesian filtering serves as the mainstream framework of state estimation in dynamic systems. Its standard version utilizes total probability rule and Bayes' law alternatively, where how to define and compute conditional probability is critical to state distribution inference. Previously, the conditional probability is assumed to be exactly known, which represents a measure of the occurrence proba… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  14. arXiv:2403.17664  [pdf, other

    cs.CV

    DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation

    Authors: Qilin Wang, Jiangning Zhang, Chengming Xu, Weijian Cao, Ying Tai, Yue Han, Yanhao Ge, Hong Gu, Chengjie Wang, Yanwei Fu

    Abstract: Facial Appearance Editing (FAE) aims to modify physical attributes, such as pose, expression and lighting, of human facial images while preserving attributes like identity and background, showing great importance in photograph. In spite of the great progress in this area, current researches generally meet three challenges: low generation fidelity, poor attribute preservation, and inefficient infer… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  15. arXiv:2403.12906  [pdf, other

    cs.CV

    TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

    Authors: Yufei Liu, Junwei Zhu, Junshu Tang, Shijie Zhang, Jiangning Zhang, Weijian Cao, Chengjie Wang, Yunsheng Wu, Dongjin Huang

    Abstract: Texturing 3D humans with semantic UV maps remains a challenge due to the difficulty of acquiring reasonably unfolded UV. Despite recent text-to-3D advancements in supervising multi-view renderings using large text-to-image (T2I) models, issues persist with generation speed, text consistency, and texture quality, resulting in data scarcity among existing datasets. We present TexDreamer, the first z… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project Page: https://ggxxii.github.io/texdreamer/

  16. arXiv:2403.07954  [pdf, other

    cs.LG eess.SP

    Optimizing Polynomial Graph Filters: A Novel Adaptive Krylov Subspace Approach

    Authors: Keke Huang, Wencai Cao, Hoang Ta, Xiaokui Xiao, Pietro Liò

    Abstract: Graph Neural Networks (GNNs), known as spectral graph filters, find a wide range of applications in web networks. To bypass eigendecomposition, polynomial graph filters are proposed to approximate graph filters by leveraging various polynomial bases for filter training. However, no existing studies have explored the diverse polynomial graph filters from a unified perspective for optimization. In… ▽ More

    Submitted 20 May, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  17. arXiv:2403.04268  [pdf

    quant-ph cs.LG

    Qubit-Wise Architecture Search Method for Variational Quantum Circuits

    Authors: Jialin Chen, Zhiqiang Cai, Ke Xu, Di Wu, Wei Cao

    Abstract: Considering the noise level limit, one crucial aspect for quantum machine learning is to design a high-performing variational quantum circuit architecture with small number of quantum gates. As the classical neural architecture search (NAS), quantum architecture search methods (QAS) employ methods like reinforcement learning, evolutionary algorithms and supernet optimiza-tion to improve the search… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  18. arXiv:2403.02905  [pdf, other

    cs.MM

    MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model

    Authors: Sen Wang, Jiangning Zhang, Weijian Cao, Xiaobin Hu, Moran Li, Xiaozhong Ji, Xin Tan, Mengtian Li, Zhifeng Xie, Chengjie Wang, Lizhuang Ma

    Abstract: The body movements accompanying speech aid speakers in expressing their ideas. Co-speech motion generation is one of the important approaches for synthesizing realistic avatars. Due to the intricate correspondence between speech and motion, generating realistic and diverse motion is a challenging task. In this paper, we propose MMoFusion, a Multi-modal co-speech Motion generation framework based o… ▽ More

    Submitted 17 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2402.17375  [pdf, other

    eess.SY cs.LG

    Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control

    Authors: Wenhan Cao, Wei Pan

    Abstract: Integral reinforcement learning (IntRL) demands the precise computation of the utility function's integral at its policy evaluation (PEV) stage. This is achieved through quadrature rules, which are weighted sums of utility functions evaluated from state samples obtained in discrete time. Our research reveals a critical yet underexplored phenomenon: the choice of the computational method -- in this… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  20. arXiv:2402.16200  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    IR2: Information Regularization for Information Retrieval

    Authors: Jianyou Wang, Kaicheng Wang, Xiaoyue Wang, Weili Cao, Ramamohan Paturi, Leon Bergen

    Abstract: Effective information retrieval (IR) in settings with limited training data, particularly for complex queries, remains a challenging task. This paper introduces IR2, Information Regularization for Information Retrieval, a technique for reducing overfitting during synthetic data generation. This approach, representing a novel application of regularization techniques in synthetic data creation for I… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation

  21. arXiv:2402.16072  [pdf

    cs.ET quant-ph

    Demonstration of 3 V Programmable Josephson Junction Arrays Using Non-Integer-Multiple Logic

    Authors: Wenhui Cao, Erkun Yang, Jinjin Li, Huan Qiao, Yuan Zhong, Qing Zhong, Da Xu, Xueshen Wang, Xiaolong Xu, Shijian Wang, Jian Chen

    Abstract: This article demonstrates a new kind of programmable logic for the representation of an integer that can be used for the programmable Josephson voltage standard. It can enable the numbers of junctions in most bits to be variable integer values, which is different from normal binary logic or ternary logic. Consequently, missing junctions due to superconducting short circuits can be tolerated under… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  22. arXiv:2402.14151  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives

    Authors: Xiaoyue Wang, Jianyou Wang, Weili Cao, Kaicheng Wang, Ramamohan Paturi, Leon Bergen

    Abstract: We present the Benchmark of Information Retrieval (IR) tasks with Complex Objectives (BIRCO). BIRCO evaluates the ability of IR systems to retrieve documents given multi-faceted user objectives. The benchmark's complexity and compact size make it suitable for evaluating large language model (LLM)-based information retrieval systems. We present a modular framework for investigating factors that may… ▽ More

    Submitted 3 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  23. arXiv:2402.04059  [pdf, other

    cs.LG cs.AI

    Deep Learning for Multivariate Time Series Imputation: A Survey

    Authors: Jun Wang, Wenjie Du, Wei Cao, Keli Zhang, Wenjia Wang, Yuxuan Liang, Qingsong Wen

    Abstract: The ubiquitous missing values cause the multivariate time series data to be partially observed, destroying the integrity of time series and hindering the effective time series data analysis. Recently deep learning imputation methods have demonstrated remarkable success in elevating the quality of corrupted time series data, subsequently enhancing performance in downstream tasks. In this paper, we… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 1 figure, 5 tables, 58 referred papers

  24. arXiv:2401.16827  [pdf

    cs.RO cond-mat.soft

    3D-Printed Hydraulic Fluidic Logic Circuitry for Soft Robots

    Authors: Yuxin Lin, Xinyi Zhou, Wenhan Cao

    Abstract: Fluidic logic circuitry analogous to its electric counterpart could potentially provide soft robots with machine intelligence due to its supreme adaptability, dexterity, and seamless compatibility using state-of-the-art additive manufacturing processes. However, conventional microfluidic channel based circuitry suffers from limited driving force, while macroscopic pneumatic logic lacks timely resp… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 26 pages, 14 figures

  25. arXiv:2401.06614  [pdf, other

    cs.CV

    Motion2VecSets: 4D Latent Vector Set Diffusion for Non-rigid Shape Reconstruction and Tracking

    Authors: Wei Cao, Chang Luo, Biao Zhang, Matthias Nießner, Jiapeng Tang

    Abstract: We introduce Motion2VecSets, a 4D diffusion model for dynamic surface reconstruction from point cloud sequences. While existing state-of-the-art methods have demonstrated success in reconstructing non-rigid objects using neural field representations, conventional feed-forward networks encounter challenges with ambiguous observations from noisy, partial, or sparse point clouds. To address these cha… ▽ More

    Submitted 13 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  26. arXiv:2401.06196  [pdf

    cs.CE

    VW-PINNs: A volume weighting method for PDE residuals in physics-informed neural networks

    Authors: Jiahao Song, Wenbo Cao, Fei Liao, Weiwei Zhang

    Abstract: Physics-informed neural networks (PINNs) have shown remarkable prospects in the solving the forward and inverse problems involving partial differential equations (PDEs). The method embeds PDEs into the neural network by calculating PDE loss at a series of collocation points, providing advantages such as meshfree and more convenient adaptive sampling. However, when solving PDEs using nonuniform col… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  27. arXiv:2312.13537  [pdf, other

    cs.CV

    HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via Hypernetworks

    Authors: Hai Zhang, Chunwei Wu, Guitao Cao, Hailing Wang, Wenming Cao

    Abstract: Editing real images authentically while also achieving cross-domain editing remains a challenge. Recent studies have focused on converting real images into latent codes and accomplishing image editing by manipulating these codes. However, merely manipulating the latent codes would constrain the edited images to the generator's image domain, hindering the attainment of diverse editing goals. In res… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  28. arXiv:2310.16491  [pdf

    cs.LG physics.comp-ph

    TSONN: Time-stepping-oriented neural network for solving partial differential equations

    Authors: Wenbo Cao, Weiwei Zhang

    Abstract: Deep neural networks (DNNs), especially physics-informed neural networks (PINNs), have recently become a new popular method for solving forward and inverse problems governed by partial differential equations (PDEs). However, these methods still face challenges in achieving stable training and obtaining correct results in many problems, since minimizing PDE residuals with PDE-based soft constraint… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  29. arXiv:2310.07402  [pdf, other

    cs.LG cs.AI

    NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining

    Authors: Chenguo Lin, Xumeng Wen, Wei Cao, Congrui Huang, Jiang Bian, Stephen Lin, Zhirong Wu

    Abstract: Recent research on time-series self-supervised models shows great promise in learning semantic representations. However, it has been limited to small-scale datasets, e.g., thousands of temporal sequences. In this work, we make key technical contributions that are tailored to the numerical properties of time-series data and allow the model to scale to large datasets, e.g., millions of temporal sequ… ▽ More

    Submitted 10 July, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted by TMLR 2024

  30. arXiv:2309.07808  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    What Matters to Enhance Traffic Rule Compliance of Imitation Learning for Automated Driving

    Authors: Hongkuan Zhou, Aifen Sui, Wei Cao, Zhenshan Bing

    Abstract: More research attention has recently been given to end-to-end autonomous driving technologies where the entire driving pipeline is replaced with a single neural network because of its simpler structure and faster inference time. Despite this appealing approach largely reducing the components in the driving pipeline, its simplicity also leads to interpretability problems and safety issues. The trai… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 10 pages, 2 figures

  31. arXiv:2308.16406  [pdf, other

    cs.LG

    CktGNN: Circuit Graph Neural Network for Electronic Design Automation

    Authors: Zehao Dong, Weidong Cao, Muhan Zhang, Dacheng Tao, Yixin Chen, Xuan Zhang

    Abstract: The electronic design automation of analog circuits has been a longstanding challenge in the integrated circuit field due to the huge design space and complex design trade-offs among circuit specifications. In the past decades, intensive research efforts have mostly been paid to automate the transistor sizing with a given circuit topology. By recognizing the graph nature of circuits, this paper pr… ▽ More

    Submitted 9 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by ICLR (International Conference on Learning Representations) 2023

  32. arXiv:2307.13837  [pdf, other

    cs.AI cs.PL

    Scaling Integer Arithmetic in Probabilistic Programs

    Authors: William X. Cao, Poorva Garg, Ryan Tjoa, Steven Holtzen, Todd Millstein, Guy Van den Broeck

    Abstract: Distributions on integers are ubiquitous in probabilistic modeling but remain challenging for many of today's probabilistic programming languages (PPLs). The core challenge comes from discrete structure: many of today's PPL inference strategies rely on enumeration, sampling, or differentiation in order to scale, which fail for high-dimensional complex discrete distributions involving integers. Our… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to UAI 2023

  33. arXiv:2307.12309  [pdf, other

    cs.CV

    Building Extraction from Remote Sensing Images via an Uncertainty-Aware Network

    Authors: Wei He, Jiepan Li, Weinan Cao, Liangpei Zhang, Hongyan Zhang

    Abstract: Building extraction aims to segment building pixels from remote sensing images and plays an essential role in many applications, such as city planning and urban dynamic monitoring. Over the past few years, deep learning methods with encoder-decoder architectures have achieved remarkable performance due to their powerful feature representation capability. Nevertheless, due to the varying scales and… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  34. arXiv:2307.03983  [pdf, ps, other

    cs.IT

    Hybrid Successive Interference Cancellation and Power Adaptation: a Win-Win Strategy for Robust Uplink NOMA Transmission

    Authors: Yanshi Sun, Wei Cao, Momiao Zhou, Zhiguo Ding

    Abstract: The aim of this paper is to reveal the importance of hybrid successive interference cancellation (SIC) and power adaptation (PA) for improving transmission robustness of uplink non-orthogonal multiple access (NOMA). Particularly, a cognitive radio inspired uplink NOMA communication scenario is considered, where one primary user is allocated one dedicated resource block, while M secondary users com… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2307.01517

  35. arXiv:2307.01517  [pdf, ps, other

    cs.IT

    New Designs of Robust Uplink NOMA in Cognitive Radio Inspired Communications

    Authors: Yanshi Sun, Wei Cao, Momiao Zhou, Zhiguo Ding

    Abstract: This paper considers a cognitive radio inspired uplink communication scenario, where one primary user is allocated with one dedicated resource block, while $M$ secondary users compete with each other to opportunistically access the primary user's channel. Two new designs of NOMA schemes, namely hybrid successive interference cancellation with power adaptation (HSIC-PA) and fixed successive interfe… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  36. Warpformer: A Multi-scale Modeling Approach for Irregular Clinical Time Series

    Authors: Jiawen Zhang, Shun Zheng, Wei Cao, Jiang Bian, Jia Li

    Abstract: Irregularly sampled multivariate time series are ubiquitous in various fields, particularly in healthcare, and exhibit two key characteristics: intra-series irregularity and inter-series discrepancy. Intra-series irregularity refers to the fact that time-series signals are often recorded at irregular intervals, while inter-series discrepancy refers to the significant variability in sampling rates… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: KDD23 Research Track

  37. arXiv:2306.01997  [pdf, other

    cs.LG

    UADB: Unsupervised Anomaly Detection Booster

    Authors: Hangting Ye, Zhining Liu, Xinyi Shen, Wei Cao, Shun Zheng, Xiaofan Gui, Huishuai Zhang, Yi Chang, Jiang Bian

    Abstract: Unsupervised Anomaly Detection (UAD) is a key data mining problem owing to its wide real-world applications. Due to the complete absence of supervision signals, UAD methods rely on implicit assumptions about anomalous patterns (e.g., scattered/sparsely/densely clustered) to detect anomalies. However, real-world data are complex and vary significantly across different domains. No single assumption… ▽ More

    Submitted 26 December, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: IEEE 39th International Conference on Data Engineering (ICDE 2023)

  38. arXiv:2306.00426  [pdf

    eess.AS cs.SD

    Speaker verification using attentive multi-scale convolutional recurrent network

    Authors: Yanxiong Li, Zhongjie Jiang, Wenchang Cao, Qisheng Huang

    Abstract: In this paper, we propose a speaker verification method by an Attentive Multi-scale Convolutional Recurrent Network (AMCRN). The proposed AMCRN can acquire both local spatial information and global sequential information from the input speech recordings. In the proposed method, logarithm Mel spectrum is extracted from each speech recording and then fed to the proposed AMCRN for learning speaker em… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 21 pages, 6 figures, 8 tables. Accepted for publication in Applied Soft Computing

  39. arXiv:2305.18045  [pdf, ps, other

    cs.SD cs.MM eess.AS

    Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes

    Authors: Wei Xie, Yanxiong Li, Qianhua He, Wenchang Cao, Tuomas Virtanen

    Abstract: New classes of sounds constantly emerge with a few samples, making it challenging for models to adapt to dynamic acoustic environments. This challenge motivates us to address the new problem of few-shot class-incremental audio classification. This study aims to enable a model to continuously recognize new classes of sounds with a few training samples of new classes while remembering the learned on… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 5 pages,2 figures, Accepted by Interspeech 2023

  40. arXiv:2305.05320  [pdf, ps, other

    cs.IT

    Minimal Linear Codes Constructed from partial spreads

    Authors: W. Lu, X. Wu, X. W. Cao, G. J. Luo, X. P. Qin

    Abstract: Partial spread is important in finite geometry and can be used to construct linear codes. From the results in (Designs, Codes and Cryptography 90:1-15, 2022) by Xia Li, Qin Yue and Deng Tang, we know that if the number of the elements in a partial spread is ``big enough", then the corresponding linear code is minimal. They used the sufficient condition in (IEEE Trans. Inf. Theory 44(5): 2010-201… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  41. arXiv:2305.05317  [pdf, ps, other

    cs.IT

    Minimal Linear Codes Constructed from hierarchical posets with two levels

    Authors: X. Wu, W. Lu, X. P. Qin, X. W. Cao

    Abstract: J. Y. Hyun, et al. (Des. Codes Cryptogr., vol. 88, pp. 2475-2492, 2020) constructed some optimal and minimal binary linear codes generated by one or two order ideals in hierarchical posets of two levels. At the end of their paper, they left an open problem: it also should be interesting to investigate the cases of more than two orders in hierarchical posets with two levels or many levels. In this… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:1911.11632, arXiv:1911.07648

  42. arXiv:2303.02545  [pdf, other

    cs.CR

    MINER: A Hybrid Data-Driven Approach for REST API Fuzzing

    Authors: Chenyang Lyu, Jiacheng Xu, Shouling Ji, Xuhong Zhang, Qinying Wang, Binbin Zhao, Gaoning Pan, Wei Cao, Raheem Beyah

    Abstract: In recent years, REST API fuzzing has emerged to explore errors on a cloud service. Its performance highly depends on the sequence construction and request generation. However, existing REST API fuzzers have trouble generating long sequences with well-constructed requests to trigger hard-to-reach states in a cloud service, which limits their performance of finding deep errors and security bugs. Fu… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted as a full paper at USENIX Security '23

  43. arXiv:2302.10959  [pdf, other

    stat.ML cs.LG eess.SY

    Dealing with Collinearity in Large-Scale Linear System Identification Using Gaussian Regression

    Authors: Wenqi Cao, Gianluigi Pillonetto

    Abstract: Many problems arising in control require the determination of a mathematical model of the application. This has often to be performed starting from input-output data, leading to a task known as system identification in the engineering literature. One emerging topic in this field is estimation of networks consisting of several interconnected dynamic systems. We consider the linear setting assuming… ▽ More

    Submitted 28 February, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.13633

  44. arXiv:2301.12101  [pdf, other

    cs.ET cs.AR

    Non-Hermitian Physics-Inspired Voltage-Controlled Oscillators with Resistive Tuning

    Authors: Weidong Cao, Hua Wang, Xuan Zhang

    Abstract: This paper presents a non-Hermitian physics-inspired voltage-controlled oscillator (VCO) topology, which is termed parity-time-symmetric topology. The VCO consists of two coupled inductor-capacitor (LC) cores with a balanced gain and loss profile. Due to the interplay between the gain/loss and their coupling, an extra degree of freedom is enabled via resistive tuning, which can enhance the frequen… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: 5 Pages, 6 figures, accepted by ISCAS 2023

  45. arXiv:2301.08413  [pdf, other

    cs.CV

    Chaos to Order: A Label Propagation Perspective on Source-Free Domain Adaptation

    Authors: Chunwei Wu, Guitao Cao, Yan Li, Xidong Xi, Wenming Cao, Hong Wang

    Abstract: Source-free domain adaptation (SFDA), where only a pre-trained source model is used to adapt to the target distribution, is a more general approach to achieving domain adaptation in the real world. However, it can be challenging to capture the inherent structure of the target features accurately due to the lack of supervised information on the target domain. By analyzing the clustering performance… ▽ More

    Submitted 14 August, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: Accepted by ACM MM2023

  46. arXiv:2301.00089  [pdf, other

    cs.RO cs.AI

    Autonomous Driving Simulator based on Neurorobotics Platform

    Authors: Wei Cao, Liguo Zhou, Yuhong Huang, Alois Knoll

    Abstract: There are many artificial intelligence algorithms for autonomous driving, but directly installing these algorithms on vehicles is unrealistic and expensive. At the same time, many of these algorithms need an environment to train and optimize. Simulation is a valuable and meaningful solution with training and testing functions, and it can say that simulation is a critical link in the autonomous dri… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: 25 pages, 8 figures

    MSC Class: 00-01 ACM Class: D.0

  47. A novel convergence enhancement method based on Online Dimension Reduction Optimization

    Authors: Wenbo Cao, Yilang Liu, Xianglin Shan, Chuanqiang Gao, Weiwei Zhang

    Abstract: Iterative steady-state solvers are widely used in computational fluid dynamics. Unfortunately, it is difficult to obtain steady-state solution for unstable problem caused by physical instability and numerical instability. Optimization is a better choice for solving unstable problem because steady-state solution is always the extreme point of optimization regardless of whether the problem is unstab… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

    Journal ref: Physics of Fluids 35, 036124 (2023)

  48. arXiv:2211.12507  [pdf, other

    cs.LG

    OpenFE: Automated Feature Generation with Expert-level Performance

    Authors: Tianping Zhang, Zheyu Zhang, Zhiyuan Fan, Haoyan Luo, Fengyuan Liu, Qian Liu, Wei Cao, Jian Li

    Abstract: The goal of automated feature generation is to liberate machine learning experts from the laborious task of manual feature generation, which is crucial for improving the learning performance of tabular data. The major challenge in automated feature generation is to efficiently and accurately identify effective features from a vast pool of candidate features. In this paper, we present OpenFE, an au… ▽ More

    Submitted 5 June, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 22 pages, 3 figures, accepted by ICML2023

  49. arXiv:2210.04122  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG

    Inferring Line-of-Sight Velocities and Doppler Widths from Stokes Profiles of GST/NIRIS Using Stacked Deep Neural Networks

    Authors: Haodi Jiang, Qin Li, Yan Xu, Wynne Hsu, Kwangsu Ahn, Wenda Cao, Jason T. L. Wang, Haimin Wang

    Abstract: Obtaining high-quality magnetic and velocity fields through Stokes inversion is crucial in solar physics. In this paper, we present a new deep learning method, named Stacked Deep Neural Networks (SDNN), for inferring line-of-sight (LOS) velocities and Doppler widths from Stokes profiles collected by the Near InfraRed Imaging Spectropolarimeter (NIRIS) on the 1.6 m Goode Solar Telescope (GST) at th… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 16 pages, 8 figures

    Journal ref: The Astrophysical Journal, 2022

  50. On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator

    Authors: Jingliang Duan, Wenhan Cao, Yang Zheng, Lin Zhao

    Abstract: The convergence of policy gradient algorithms in reinforcement learning hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies… ▽ More

    Submitted 29 October, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.09598

    Journal ref: 2022 IEEE 61st Conference on Decision and Control (CDC)