Skip to main content

Showing 1–50 of 60 results for author: Hou, C

  1. arXiv:2407.14491  [pdf, other

    cs.CV

    PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding

    Authors: Chenshu Hou, Liang Peng, Xiaopei Wu, Wenxiao Wang, Xiaofei He

    Abstract: 3D visual grounding aims to locate the target object mentioned by free-formed natural language descriptions in 3D point cloud scenes. Most previous work requires the encoder-decoder to simultaneously align the attribute information of the target object and its relational information with the surrounding environment across modalities. This causes the queries' attention to be dispersed, potentially… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  2. arXiv:2407.02542  [pdf, other

    cs.IR cs.AI cs.LG

    ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation

    Authors: Chaoqun Hou, Yuanhang Zhou, Yi Cao, Tong Liu

    Abstract: In industrial recommendation systems, there are several mini-apps designed to meet the diverse interests and needs of users. The sample space of them is merely a small subset of the entire space, making it challenging to train an efficient model. In recent years, there have been many excellent studies related to cross-domain recommendation aimed at mitigating the problem of data sparsity. However,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2406.10126  [pdf, other

    cs.CV

    Training-free Camera Control for Video Generation

    Authors: Chen Hou, Guoqiang Wei, Yan Zeng, Zhibo Chen

    Abstract: We propose a training-free and robust solution to offer camera movement control for off-the-shelf video diffusion models. Unlike previous work, our method does not require any supervised finetuning on camera-annotated datasets or self-supervised training via data augmentation. Instead, it can be plugged and played with most pretrained video diffusion models and generate camera controllable videos… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.03695  [pdf, other

    cs.CR

    FACOS: Enabling Privacy Protection Through Fine-Grained Access Control with On-chain and Off-chain System

    Authors: Chao Liu, Cankun Hou, Tianyu Jiang, Jianting Ning, Hui Qiao, Yusen Wu

    Abstract: Data-driven landscape across finance, government, and healthcare, the continuous generation of information demands robust solutions for secure storage, efficient dissemination, and fine-grained access control. Blockchain technology emerges as a significant tool, offering decentralized storage while upholding the tenets of data security and accessibility. However, on-chain and off-chain strategies… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2406.02958  [pdf, other

    cs.LG cs.AI cs.CL cs.CR cs.DC

    PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

    Authors: Charlie Hou, Akshat Shrivastava, Hongyuan Zhan, Rylan Conway, Trang Le, Adithya Sagar, Giulia Fanti, Daniel Lazar

    Abstract: On-device training is currently the most common approach for training machine learning (ML) models on private, distributed user data. Despite this, on-device training has several drawbacks: (1) most user devices are too small to train large models on-device, (2) on-device training is communication- and computation-intensive, and (3) on-device training can be difficult to debug and deploy. To addre… ▽ More

    Submitted 17 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024 (Oral)

  6. arXiv:2405.10626  [pdf, other

    cs.CL

    Dynamic data sampler for cross-language transfer learning in large language models

    Authors: Yudong Li, Yuhao Feng, Wen Zhou, Zhe Zhao, Linlin Shen, Cheng Hou, Xianxu Hou

    Abstract: Large Language Models (LLMs) have gained significant attention in the field of natural language processing (NLP) due to their wide range of applications. However, training LLMs for languages other than English poses significant challenges, due to the difficulty in acquiring large-scale corpus and the requisite computing resources. In this paper, we propose ChatFlow, a cross-language transfer-based… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by ICASSP 2024

  7. arXiv:2403.10127  [pdf, other

    cs.CV

    TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model

    Authors: Changhong Hou, Junchuan Yu, Daqing Ge, Liu Yang, Laidian Xi, Yunxuan Pang, Yi Wen

    Abstract: Landslides are one of the most destructive natural disasters in the world, posing a serious threat to human life and safety. The development of foundation models has provided a new research paradigm for large-scale landslide detection. The Segment Anything Model (SAM) has garnered widespread attention in the field of image segmentation. However, our experiment found that SAM performed poorly in th… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  8. arXiv:2402.18905  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?

    Authors: Shuqi Ke, Charlie Hou, Giulia Fanti, Sewoong Oh

    Abstract: Differentially private (DP) machine learning pipelines typically involve a two-phase process: non-private pre-training on a public dataset, followed by fine-tuning on private data using DP optimization techniques. In the DP setting, it has been observed that full fine-tuning may not always yield the best test accuracy, even for in-distribution data. This paper (1) analyzes the training dynamics of… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  9. Label Informed Contrastive Pretraining for Node Importance Estimation on Knowledge Graphs

    Authors: Tianyu Zhang, Chengbin Hou, Rui Jiang, Xuegong Zhang, Chenghu Zhou, Ke Tang, Hairong Lv

    Abstract: Node Importance Estimation (NIE) is a task of inferring importance scores of the nodes in a graph. Due to the availability of richer data and knowledge, recent research interests of NIE have been dedicating to knowledge graphs for predicting future or missing node importance scores. Existing state-of-the-art NIE methods train the model by available labels, and they consider every interested node e… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE TNNLS

  10. arXiv:2401.01065  [pdf, other

    cs.CV cs.AI

    BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

    Authors: Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang

    Abstract: The rapid development of the autonomous driving industry has led to a significant accumulation of autonomous driving data. Consequently, there comes a growing demand for retrieving data to provide specialized optimization. However, directly applying previous image retrieval methods faces several challenges, such as the lack of global feature representation and inadequate text retrieval ability for… ▽ More

    Submitted 18 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  11. arXiv:2312.15707  [pdf, other

    cs.CV

    High-Fidelity Diffusion-based Image Editing

    Authors: Chen Hou, Guoqiang Wei, Zhibo Chen

    Abstract: Diffusion models have attained remarkable success in the domains of image generation and editing. It is widely recognized that employing larger inversion and denoising steps in diffusion model leads to improved image reconstruction quality. However, the editing performance of diffusion models tends to be no more satisfactory even with increasing denoising steps. The deficiency in editing could be… ▽ More

    Submitted 4 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  12. arXiv:2309.15376  [pdf, other

    cs.LG

    ADGym: Design Choices for Deep Anomaly Detection

    Authors: Minqi Jiang, Chaochuan Hou, Ao Zheng, Songqiao Han, Hailiang Huang, Qingsong Wen, Xiyang Hu, Yue Zhao

    Abstract: Deep learning (DL) techniques have recently found success in anomaly detection (AD) across various fields such as finance, medical services, and cloud computing. However, most of the current research tends to view deep AD algorithms as a whole, without dissecting the contributions of individual design choices like loss functions and network architectures. This view tends to diminish the value of p… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023. The first three authors contribute equally. Code available at https://github.com/Minqi824/ADGym

  13. arXiv:2309.04747  [pdf, other

    cs.CV

    When to Learn What: Model-Adaptive Data Augmentation Curriculum

    Authors: Chengkai Hou, Jieyu Zhang, Tianyi Zhou

    Abstract: Data augmentation (DA) is widely used to improve the generalization of neural networks by enforcing the invariances and symmetries to pre-defined transformations applied to input data. However, a fixed augmentation policy may have different effects on each sample in different training stages but existing approaches cannot adjust the policy to be adaptive to each sample and the training model. In t… ▽ More

    Submitted 30 September, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Our paper is accpeted by ICCV 2023

  14. arXiv:2308.00177  [pdf, other

    cs.LG cs.AI

    Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity

    Authors: Charlie Hou, Kiran Koshy Thekumparampil, Michael Shavlovsky, Giulia Fanti, Yesh Dattatreya, Sujay Sanghavi

    Abstract: On tabular data, a significant body of literature has shown that current deep learning (DL) models perform at best similarly to Gradient Boosted Decision Trees (GBDTs), while significantly underperforming them on outlier data. However, these works often study idealized problem settings which may fail to capture complexities of real-world scenarios. We identify a natural tabular data setting where… ▽ More

    Submitted 25 June, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: ICML-MFPL 2023 Workshop Oral, SPIGM@ICML2024

  15. arXiv:2307.15058  [pdf, other

    cs.CV

    MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

    Authors: Zirui Wu, Tianyu Liu, Liyi Luo, Zhide Zhong, Jianteng Chen, Hongmin Xiao, Chao Hou, Haozhe Lou, Yuantao Chen, Runyi Yang, Yuxin Huang, Xiaoyu Ye, Zike Yan, Yongliang Shi, Yiyi Liao, Hao Zhao

    Abstract: Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To this end, we propose an autonomous driving simulator based upon neural radiance fields (NeRFs). Compared with existing works, ours has three notable features: (1) Instance-aware. Our simulator… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: CICAI 2023, project page with code: https://open-air-sun.github.io/mars/

  16. arXiv:2306.15925  [pdf, other

    cs.CV

    Subclass-balancing Contrastive Learning for Long-tailed Recognition

    Authors: Chengkai Hou, Jieyu Zhang, Haonan Wang, Tianyi Zhou

    Abstract: Long-tailed recognition with imbalanced class distribution naturally emerges in practical machine learning applications. Existing methods such as data reweighing, resampling, and supervised contrastive learning enforce the class balance with a price of introducing imbalance between instances of head class and tail class, which may ignore the underlying rich semantic substructures of the former and… ▽ More

    Submitted 9 September, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  17. arXiv:2305.18616  [pdf

    cs.CY cs.AI

    Embrace Opportunities and Face Challenges: Using ChatGPT in Undergraduate Students' Collaborative Interdisciplinary Learning

    Authors: Gaoxia Zhu, Xiuyi Fan, Chenyu Hou, Tianlong Zhong, Peter Seow, Annabel Chen Shen-Hsing, Preman Rajalingam, Low Kin Yew, Tan Lay Poh

    Abstract: ChatGPT, launched in November 2022, has gained widespread attention from students and educators globally, with an online report by Hu (2023) stating it as the fastest-growing consumer application in history. While discussions on the use of ChatGPT in higher education are abundant, empirical studies on its impact on collaborative interdisciplinary learning are rare. To investigate its potential, we… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 33 pages, 2 figures, 5 tables

  18. Adaptive Learning based Upper-Limb Rehabilitation Training System with Collaborative Robot

    Authors: Jun Hong Lim, Kaibo He, Zeji Yi, Chen Hou, Chen Zhang, Yanan Sui, Luming Li

    Abstract: Rehabilitation training for patients with motor disabilities usually requires specialized devices in rehabilitation centers. Home-based multi-purpose training would significantly increase treatment accessibility and reduce medical costs. While it is unlikely to equip a set of rehabilitation robots at home, we investigate the feasibility to use the general-purpose collaborative robot for rehabilita… ▽ More

    Submitted 12 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Journal ref: EMBC2023

  19. arXiv:2305.09098  [pdf, other

    cs.CL cs.LG

    Weight-Inherited Distillation for Task-Agnostic BERT Compression

    Authors: Taiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao, Yujiu Yang

    Abstract: Knowledge Distillation (KD) is a predominant approach for BERT compression. Previous KD-based methods focus on designing extra alignment losses for the student model to mimic the behavior of the teacher model. These methods transfer the knowledge in an indirect way. In this paper, we propose a novel Weight-Inherited Distillation (WID), which directly transfers knowledge from the teacher. WID does… ▽ More

    Submitted 20 March, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, NAACL2024 findings

  20. arXiv:2304.01234  [pdf, other

    astro-ph.SR astro-ph.EP cs.LG physics.plasm-ph physics.space-ph

    Prediction of solar wind speed by applying convolutional neural network to potential field source surface (PFSS) magnetograms

    Authors: Rong Lin, Zhekai Luo, Jiansen He, Lun Xie, Chuanpeng Hou, Shuwei Chen

    Abstract: An accurate solar wind speed model is important for space weather predictions, catastrophic event warnings, and other issues concerning solar wind - magnetosphere interaction. In this work, we construct a model based on convolutional neural network (CNN) and Potential Field Source Surface (PFSS) magnetograms, considering a solar wind source surface of $R_{\rm SS}=2.5R_\odot$, aiming to predict the… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  21. arXiv:2302.09042  [pdf, other

    cs.LG cs.AI cs.DC

    Privately Customizing Prefinetuning to Better Match User Data in Federated Learning

    Authors: Charlie Hou, Hongyuan Zhan, Akshat Shrivastava, Sid Wang, Aleksandr Livshits, Giulia Fanti, Daniel Lazar

    Abstract: In Federated Learning (FL), accessing private client data incurs communication and privacy costs. As a result, FL deployments commonly prefinetune pretrained foundation models on a (large, possibly public) dataset that is held by the central server; they then FL-finetune the model on a private, federated dataset held by clients. Evaluating prefinetuning dataset quality reliably and privately is th… ▽ More

    Submitted 22 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  22. arXiv:2302.08062  [pdf

    cs.CV cs.AI q-bio.PE

    Fossil Image Identification using Deep Learning Ensembles of Data Augmented Multiviews

    Authors: Chengbin Hou, Xinyu Lin, Hanhui Huang, Sheng Xu, Junxuan Fan, Yukun Shi, Hairong Lv

    Abstract: Identification of fossil species is crucial to evolutionary studies. Recent advances from deep learning have shown promising prospects in fossil image identification. However, the quantity and quality of labeled fossil images are often limited due to fossil preservation, conditioned sampling, and expensive and inconsistent label annotation by domain experts, which pose great challenges to training… ▽ More

    Submitted 1 February, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: published in Methods in Ecology and Evolution

    Journal ref: Methods in Ecology and Evolution, 14, 3020-3034 (2023)

  23. arXiv:2302.04549  [pdf, other

    cs.LG cs.AI

    Weakly Supervised Anomaly Detection: A Survey

    Authors: Minqi Jiang, Chaochuan Hou, Ao Zheng, Xiyang Hu, Songqiao Han, Hailiang Huang, Xiangnan He, Philip S. Yu, Yue Zhao

    Abstract: Anomaly detection (AD) is a crucial task in machine learning with various applications, such as detecting emerging diseases, identifying financial frauds, and detecting fake news. However, obtaining complete, accurate, and precise labels for AD tasks can be expensive and challenging due to the cost and difficulties in data annotation. To address this issue, researchers have developed AD methods th… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Code available at https://github.com/yzhao062/wsad

  24. arXiv:2212.06385  [pdf, other

    cs.CL

    TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities

    Authors: Zhe Zhao, Yudong Li, Cheng Hou, Jing Zhao, Rong Tian, Weijie Liu, Yiren Chen, Ningyuan Sun, Haoyan Liu, Weiquan Mao, Han Guo, Weigang Guo, Taiqiang Wu, Tao Zhu, Wenhang Shi, Chen Chen, Shan Huang, Sihong Chen, Liqun Liu, Feifei Li, Xiaoshuai Chen, Xingwu Sun, Zhanhui Kang, Xiaoyong Du, Linlin Shen , et al. (1 additional authors not shown)

    Abstract: Recently, the success of pre-training in text domain has been fully extended to vision, audio, and cross-modal scenarios. The proposed pre-training models of different modalities are showing a rising trend of homogeneity in their model structures, which brings the opportunity to implement different pre-training models within a uniform framework. In this paper, we present TencentPretrain, a toolkit… ▽ More

    Submitted 11 July, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  25. arXiv:2211.07459  [pdf, other

    cs.CV cs.RO

    Self-Aligning Depth-regularized Radiance Fields for Asynchronous RGB-D Sequences

    Authors: Yuxin Huang, Andong Yang, Zirui Wu, Yuantao Chen, Runyi Yang, Zhenxin Zhu, Chao Hou, Hao Zhao, Guyue Zhou

    Abstract: It has been shown that learning radiance fields with depth rendering and depth supervision can effectively promote the quality and convergence of view synthesis. However, this paradigm requires input RGB-D sequences to be synchronized, hindering its usage in the UAV city modeling scenario. As there exists asynchrony between RGB images and depth images due to high-speed flight, we propose a novel t… ▽ More

    Submitted 4 April, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  26. arXiv:2210.12914  [pdf, other

    cs.LG math.NA

    A Novel Adaptive Causal Sampling Method for Physics-Informed Neural Networks

    Authors: Jia Guo, Haifeng Wang, Chenping Hou

    Abstract: Physics-Informed Neural Networks (PINNs) have become a kind of attractive machine learning method for obtaining solutions of partial differential equations (PDEs). Training PINNs can be seen as a semi-supervised learning task, in which only exact values of initial and boundary points can be obtained in solving forward problems, and in the whole spatio-temporal domain collocation points are sampled… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  27. arXiv:2210.12020  [pdf, other

    cs.LG cs.AI

    HCL: Improving Graph Representation with Hierarchical Contrastive Learning

    Authors: Jun Wang, Weixun Li, Changyu Hou, Xin Tang, Yixuan Qiao, Rui Fang, Pengyong Li, Peng Gao, Guotong Xie

    Abstract: Contrastive learning has emerged as a powerful tool for graph representation learning. However, most contrastive learning methods learn features of graphs with fixed coarse-grained scale, which might underestimate either local or global information. To capture more hierarchical and richer representation, we propose a novel Hierarchical Contrastive Learning (HCL) framework that explicitly learns gr… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: published at The 21st International Semantic Web Conference ( ISWC 2022 )

  28. arXiv:2209.08498  [pdf, other

    cs.CV cs.RO

    LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF

    Authors: Zhenxin Zhu, Yuantao Chen, Zirui Wu, Chao Hou, Yongliang Shi, Chuxuan Li, Pengfei Li, Hao Zhao, Guyue Zhou

    Abstract: Neural Radiance Fields (NeRFs) have made great success in representing complex 3D scenes with high-resolution details and efficient memory. Nevertheless, current NeRF-based pose estimators have no initial pose prediction and are prone to local optima during optimization. In this paper, we present LATITUDE: Global Localization with Truncated Dynamic Low-pass Filter, which introduces a two-stage loc… ▽ More

    Submitted 27 February, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: 7 pages, 6 figures, ICRA 2023

  29. arXiv:2205.14660  [pdf, other

    cs.CL

    SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models

    Authors: Changyu Hou, Jun Wang, Yixuan Qiao, Peng Jiang, Peng Gao, Guotong Xie, Qizhi Lin, Xiaopeng Wang, Xiandi Jiang, Benqi Wang, Qifeng Xiao

    Abstract: Large scale pre-training models have been widely used in named entity recognition (NER) tasks. However, model ensemble through parameter averaging or voting can not give full play to the differentiation advantages of different models, especially in the open domain. This paper describes our NER system in the SemEval 2022 task11: MultiCoNER. We proposed an effective system to adaptively ensemble pre… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  30. arXiv:2205.10839  [pdf, other

    cs.CV

    Deep Learning for Visual Speech Analysis: A Survey

    Authors: Changchong Sheng, Gangyao Kuang, Liang Bai, Chenping Hou, Yulan Guo, Xin Xu, Matti Pietikäinen, Li Liu

    Abstract: Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have bee… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: 20 pages, 8 figures. Accepted by IEEE TPAMI

  31. arXiv:2205.10014  [pdf, other

    cs.LG cs.AI

    A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection

    Authors: Bingzhe Wu, Jintang Li, Junchi Yu, Yatao Bian, Hengtong Zhang, CHaochao Chen, Chengbin Hou, Guoji Fu, Liang Chen, Tingyang Xu, Yu Rong, Xiaolin Zheng, Junzhou Huang, Ran He, Baoyuan Wu, GUangyu Sun, Peng Cui, Zibin Zheng, Zhe Liu, Peilin Zhao

    Abstract: Deep graph learning has achieved remarkable progresses in both business and scientific areas ranging from finance and e-commerce, to drug and advanced material discovery. Despite these progresses, how to ensure various deep graph learning algorithms behave in a socially responsible manner and meet regulatory compliance requirements becomes an emerging problem, especially in risk-sensitive domains.… ▽ More

    Submitted 23 May, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Preprint; Work in progress. arXiv admin note: substantial text overlap with arXiv:2202.07114

  32. TTAGN: Temporal Transaction Aggregation Graph Network for Ethereum Phishing Scams Detection

    Authors: Sijia Li, Gaopeng Gou, Chang Liu, Chengshang Hou, Zhenzhen Li, Gang Xiong

    Abstract: In recent years, phishing scams have become the most serious type of crime involved in Ethereum, the second-largest blockchain platform. The existing phishing scams detection technology on Ethereum mostly uses traditional machine learning or network representation learning to mine the key information from the transaction network to identify phishing addresses. However, these methods adopt the last… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: WWW 2022

  33. arXiv:2204.01915  [pdf

    cs.LG cs.CV

    An Exploration of Active Learning for Affective Digital Phenotyping

    Authors: Peter Washington, Cezmi Mutlu, Aaron Kline, Cathy Hou, Kaitlyn Dunlap, Jack Kent, Arman Husic, Nate Stockham, Brianna Chrisman, Kelley Paskov, Jae-Yoon Jung, Dennis P. Wall

    Abstract: Some of the most severe bottlenecks preventing widespread development of machine learning models for human behavior include a dearth of labeled training data and difficulty of acquiring high quality labels. Active learning is a paradigm for using algorithms to computationally select a useful subset of data points to label using metrics for model uncertainty and data similarity. We explore active l… ▽ More

    Submitted 6 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  34. arXiv:2202.07114  [pdf, other

    cs.LG cs.CR cs.IR

    Recent Advances in Reliable Deep Graph Learning: Inherent Noise, Distribution Shift, and Adversarial Attack

    Authors: Jintang Li, Bingzhe Wu, Chengbin Hou, Guoji Fu, Yatao Bian, Liang Chen, Junzhou Huang, Zibin Zheng

    Abstract: Deep graph learning (DGL) has achieved remarkable progress in both business and scientific areas ranging from finance and e-commerce to drug and advanced material discovery. Despite the progress, applying DGL to real-world applications faces a series of reliability threats including inherent noise, distribution shift, and adversarial attacks. This survey aims to provide a comprehensive review of r… ▽ More

    Submitted 8 May, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Preprint. 9 pages, 2 figures

  35. arXiv:2201.00927  [pdf

    cs.SD cs.LG eess.AS

    Classifying Autism from Crowdsourced Semi-Structured Speech Recordings: A Machine Learning Approach

    Authors: Nathan A. Chi, Peter Washington, Aaron Kline, Arman Husic, Cathy Hou, Chloe He, Kaitlyn Dunlap, Dennis Wall

    Abstract: Autism spectrum disorder (ASD) is a neurodevelopmental disorder which results in altered behavior, social development, and communication patterns. In past years, autism prevalence has tripled, with 1 in 54 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process, significant attention has been given to developing systems that automatically screen for autism. Pr… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 17 pages, 4 figures, submitted to JMIR Pediatrics and Parenting

  36. arXiv:2108.06869  [pdf, other

    cs.LG cs.DC math.OC

    FedChain: Chained Algorithms for Near-Optimal Communication Cost in Federated Learning

    Authors: Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

    Abstract: Federated learning (FL) aims to minimize the communication complexity of training a model over heterogeneous data distributed across many clients. A common approach is local methods, where clients take multiple optimization steps over local data before communicating with the server (e.g., FedAvg). Local methods can exploit similarity between clients' data. However, in existing analyses, this comes… ▽ More

    Submitted 16 April, 2023; v1 submitted 15 August, 2021; originally announced August 2021.

    Comments: abstract typo correction

  37. arXiv:2105.14557  [pdf, other

    cs.SI cs.AI cs.LG

    Robust Dynamic Network Embedding via Ensembles

    Authors: Chengbin Hou, Guoji Fu, Peng Yang, Zheng Hu, Shan He, Ke Tang

    Abstract: Dynamic Network Embedding (DNE) has recently attracted considerable attention due to the advantage of network embedding in various fields and the dynamic nature of many real-world networks. An input dynamic network to DNE is often assumed to have smooth changes over snapshots, which however would not hold for all real-world scenarios. It is natural to ask if existing DNE methods can perform well f… ▽ More

    Submitted 30 November, 2021; v1 submitted 30 May, 2021; originally announced May 2021.

  38. arXiv:2105.09143  [pdf, other

    eess.IV cs.CV

    Adaptive Hypergraph Convolutional Network for No-Reference 360-degree Image Quality Assessment

    Authors: Jun Fu, Chen Hou, Wei Zhou, Jiahua Xu, Zhibo Chen

    Abstract: In no-reference 360-degree image quality assessment (NR 360IQA), graph convolutional networks (GCNs), which model interactions between viewports through graphs, have achieved impressive performance. However, prevailing GCN-based NR 360IQA methods suffer from three main limitations. First, they only use high-level features of the distorted image to regress the quality score, while the human visual… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 10 pages

  39. arXiv:2102.06333  [pdf, other

    cs.LG cs.DC math.OC

    Efficient Algorithms for Federated Saddle Point Optimization

    Authors: Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

    Abstract: We consider strongly convex-concave minimax problems in the federated setting, where the communication constraint is the main bottleneck. When clients are arbitrarily heterogeneous, a simple Minibatch Mirror-prox achieves the best performance. As the clients become more homogeneous, using multiple local gradient updates at the clients significantly improves upon Minibatch Mirror-prox by communicat… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  40. arXiv:2101.10983  [pdf, other

    cs.LG

    Unsupervised clustering of series using dynamic programming and neural processes

    Authors: Karthigan Sinnathamby, Chang-Yu Hou, Lalitha Venkataramanan, Vasileios-Marios Gkortsas, François Fleuret

    Abstract: Following the work of arXiv:2101.09512, we are interested in clustering a given multi-variate series in an unsupervised manner. We would like to segment and cluster the series such that the resulting blocks present in each cluster are coherent with respect to a predefined model structure (e.g. a physics model with a functional form defined by a number of parameters). However, such approach might h… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  41. arXiv:2101.09512  [pdf, other

    cs.LG stat.ML

    Unsupervised clustering of series using dynamic programming

    Authors: Karthigan Sinnathamby, Chang-Yu Hou, Lalitha Venkataramanan, Vasileios-Marios Gkortsas, François Fleuret

    Abstract: We are interested in clustering parts of a given single multi-variate series in an unsupervised manner. We would like to segment and cluster the series such that the resulting blocks present in each cluster are coherent with respect to a known model (e.g. physics model). Data points are said to be coherent if they can be described using this model with the same parameters. We have designed an algo… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

  42. arXiv:2101.03478  [pdf

    cs.CV cs.HC

    Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging

    Authors: Peter Washington, Aaron Kline, Onur Cezmi Mutlu, Emilie Leblanc, Cathy Hou, Nate Stockham, Kelley Paskov, Brianna Chrisman, Dennis P. Wall

    Abstract: Activity recognition computer vision algorithms can be used to detect the presence of autism-related behaviors, including what are termed "restricted and repetitive behaviors", or stimming, by diagnostic instruments. The limited data that exist in this domain are usually recorded with a handheld camera which can be shaky or even moving, posing a challenge for traditional feature representation app… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

  43. arXiv:2101.03477  [pdf

    cs.CV cs.HC

    Training Affective Computer Vision Models by Crowdsourcing Soft-Target Labels

    Authors: Peter Washington, Onur Cezmi Mutlu, Emilie Leblanc, Aaron Kline, Cathy Hou, Brianna Chrisman, Nate Stockham, Kelley Paskov, Catalin Voss, Nick Haber, Dennis Wall

    Abstract: Emotion classifiers traditionally predict discrete emotions. However, emotion expressions are often subjective, thus requiring a method to handle subjective labels. We explore the use of crowdsourcing to acquire reliable soft-target labels and evaluate an emotion detection classifier trained with these labels. We center our study on the Child Affective Facial Expression (CAFE) dataset, a gold stan… ▽ More

    Submitted 22 September, 2021; v1 submitted 10 January, 2021; originally announced January 2021.

  44. arXiv:2012.08678  [pdf

    cs.CV cs.CY cs.HC

    Improved Digital Therapy for Developmental Pediatrics Using Domain-Specific Artificial Intelligence: Machine Learning Study

    Authors: Peter Washington, Haik Kalantarian, John Kent, Arman Husic, Aaron Kline, Emilie Leblanc, Cathy Hou, Onur Cezmi Mutlu, Kaitlyn Dunlap, Yordan Penev, Maya Varma, Nate Tyler Stockham, Brianna Chrisman, Kelley Paskov, Min Woo Sun, Jae-Yoon Jung, Catalin Voss, Nick Haber, Dennis Paul Wall

    Abstract: Background: Automated emotion classification could aid those who struggle to recognize emotions, including children with developmental behavioral conditions such as autism. However, most computer vision emotion recognition models are trained on adult emotion and therefore underperform when applied to child faces. Objective: We designed a strategy to gamify the collection and labeling of child emot… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 December, 2020; originally announced December 2020.

    Journal ref: JMIR pediatrics and parenting 5.2 (2022): e26760

  45. GloDyNE: Global Topology Preserving Dynamic Network Embedding

    Authors: Chengbin Hou, Han Zhang, Shan He, Ke Tang

    Abstract: Learning low-dimensional topological representation of a network in dynamic environments is attracting much attention due to the time-evolving nature of many real-world networks. The main and common objective of Dynamic Network Embedding (DNE) is to efficiently update node embeddings while preserving network topology at each time step. The idea of most existing DNE methods is to capture the topolo… ▽ More

    Submitted 5 December, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: Accepted by IEEE-TKDE 2020

  46. Latent Complete Row Space Recovery for Multi-view Subspace Clustering

    Authors: Hong Tao, Chenping Hou, Yuhua Qian, Jubo Zhu, Dongyun Yi

    Abstract: Multi-view subspace clustering has been applied to applications such as image processing and video surveillance, and has attracted increasing attention. Most existing methods learn view-specific self-representation matrices, and construct a combined affinity matrix from multiple views. The affinity construction process is time-consuming, and the combined affinity matrix is not guaranteed to reflec… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  47. arXiv:1912.01798  [pdf, other

    cs.CR

    SquirRL: Automating Attack Analysis on Blockchain Incentive Mechanisms with Deep Reinforcement Learning

    Authors: Charlie Hou, Mingxun Zhou, Yan Ji, Phil Daian, Florian Tramer, Giulia Fanti, Ari Juels

    Abstract: Incentive mechanisms are central to the functionality of permissionless blockchains: they incentivize participants to run and secure the underlying consensus protocol. Designing incentive-compatible incentive mechanisms is notoriously challenging, however. As a result, most public blockchains today use incentive mechanisms whose security properties are poorly understood and largely untested. In th… ▽ More

    Submitted 4 August, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

  48. arXiv:1907.11968  [pdf, other

    cs.SI cs.LG physics.soc-ph

    DynWalks: Global Topology and Recent Changes Awareness Dynamic Network Embedding

    Authors: Chengbin Hou, Han Zhang, Ke Tang, Shan He

    Abstract: Learning topological representation of a network in dynamic environments has recently attracted considerable attention due to the time-evolving nature of many real-world networks i.e. nodes/links might be added/removed as time goes on. Dynamic network embedding aims to learn low dimensional embeddings for unseen and seen nodes by using any currently available snapshots of a dynamic network. For se… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: 14 pages, 7 figures, 5 tables

  49. arXiv:1902.06684  [pdf, other

    cs.SI cs.LG stat.ML

    Learning Topological Representation for Networks via Hierarchical Sampling

    Authors: Guoji Fu, Chengbin Hou, Xin Yao

    Abstract: The topological information is essential for studying the relationship between nodes in a network. Recently, Network Representation Learning (NRL), which projects a network into a low-dimensional vector space, has been shown their advantages in analyzing large-scale networks. However, most existing NRL methods are designed to preserve the local topology of a network, they fail to capture the globa… ▽ More

    Submitted 15 February, 2019; originally announced February 2019.

  50. Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multi-view Learning

    Authors: Hong Tao, Chenping Hou, Dongyun Yi, Jubo Zhu, Dewen Hu

    Abstract: In real-world applications, not all instances in multi-view data are fully represented. To deal with incomplete data, Incomplete Multi-view Learning (IML) rises. In this paper, we propose the Joint Embedding Learning and Low-Rank Approximation (JELLA) framework for IML. The JELLA framework approximates the incomplete data by a set of low-rank matrices and learns a full and common embedding by line… ▽ More

    Submitted 16 December, 2019; v1 submitted 24 December, 2018; originally announced December 2018.