Skip to main content

Showing 1–50 of 169 results for author: An, J

  1. arXiv:2407.12987  [pdf, other

    cs.CV

    ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos

    Authors: Hyolim Kang, Jeongseok Hyun, Joungbin An, Youngjae Yu, Seon Joo Kim

    Abstract: Online Temporal Action Localization (On-TAL) is a critical task that aims to instantaneously identify action instances in untrimmed streaming videos as soon as an action concludes -- a major leap from frame-based Online Action Detection (OAD). Yet, the challenge of detecting overlapping actions is often overlooked even though it is a common scenario in streaming videos. Current methods that can ad… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  2. arXiv:2407.12687  [pdf, other

    cs.CY cs.AI cs.LG

    Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach

    Authors: Irina Jurenka, Markus Kunesch, Kevin R. McKee, Daniel Gillick, Shaojian Zhu, Sara Wiltberger, Shubham Milind Phal, Katherine Hermann, Daniel Kasenberg, Avishkar Bhoopchand, Ankit Anand, Miruna Pîslar, Stephanie Chan, Lisa Wang, Jennifer She, Parsa Mahmoudieh, Aliya Rysbek, Wei-Jen Ko, Andrea Huber, Brett Wiltshire, Gal Elidan, Roni Rabin, Jasmin Rubinovitz, Amit Pitaru, Mac McAllister , et al. (49 additional authors not shown)

    Abstract: A major challenge facing the world is the provision of equitable and universal access to quality education. Recent advances in generative AI (gen AI) have created excitement about the potential of new technologies to offer a personal tutor for every learner and a teaching assistant for every teacher. The full extent of this dream, however, has not yet materialised. We argue that this is primarily… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

  3. arXiv:2407.04944  [pdf, other

    eess.SP cs.IT

    Flexible Antenna Arrays for Wireless Communications: Modeling and Performance Evaluation

    Authors: Songjie Yang, Jiancheng An, Yue Xiu, Wanting Lyu, Boyu Ning, Zhongpei Zhang, Merouane Debbah, Chau Yuen

    Abstract: Flexible antenna arrays (FAAs), distinguished by their rotatable, bendable, and foldable properties, are extensively employed in flexible radio systems to achieve customized radiation patterns. This paper aims to illustrate that FAAs, capable of dynamically adjusting surface shapes, can enhance communication performances with both omni-directional and directional antenna patterns, in terms of mult… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2407.03566  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Wireless Sensing and Communication: Applications and Challenges

    Authors: Hao Liu, Jiancheng An, Xing Jia, Shining Lin, Xianghao Yao, Lu Gan, Bruno Clerckx, Chau Yuen, Mehdi Bennis, Mérouane Debbah

    Abstract: The rapid advancement of wireless communication technologies has precipitated an unprecedented demand for high data rates, extremely low latency, and ubiquitous connectivity. In order to achieve these goals, stacked intelligent metasurfaces (SIM) has been developed as a novel solution to perform advanced signal processing tasks directly in the electromagnetic wave domain, thus achieving ultra-fast… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures, 1 table

  5. arXiv:2406.09058  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook Design for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Ertugrul Basar, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can proactively reshape the characteristics of wireless channel environments. In RIS-assisted communication systems, the acquisition of channel state information (CSI) and the optimization of reflecting coefficients constitute major design challenges. To address these issues, codebook-based sol… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 36 pages, 12 figures, 2 tables, accepted by IEEE TCOM. arXiv admin note: text overlap with arXiv:2404.00265

  6. arXiv:2406.01079  [pdf, other

    cs.CV cs.AI

    Object Aware Egocentric Online Action Detection

    Authors: Joungbin An, Yunsu Park, Hyolim Kang, Seon Joo Kim

    Abstract: Advancements in egocentric video datasets like Ego4D, EPIC-Kitchens, and Ego-Exo4D have enriched the study of first-person human interactions, which is crucial for applications in augmented reality and assisted living. Despite these advancements, current Online Action Detection methods, which efficiently detect actions in streaming videos, are predominantly designed for exocentric views and thus f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: CVPR First Joint Egocentric Vision Workshop 2024

  7. arXiv:2405.20775  [pdf, other

    cs.CR cs.AI cs.CL cs.MM

    Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

    Authors: Xijie Huang, Xinyuan Wang, Hantao Zhang, Jiawen Xi, Jingkun An, Hao Wang, Chengwei Pan

    Abstract: Security concerns related to Large Language Models (LLMs) have been extensively explored, yet the safety implications for Multimodal Large Language Models (MLLMs), particularly in medical contexts (MedMLLMs), remain insufficiently studied. This paper delves into the underexplored security vulnerabilities of MedMLLMs, especially when deployed in clinical environments where the accuracy and relevanc… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  8. arXiv:2405.20584  [pdf, other

    cs.CV cs.AI

    Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization

    Authors: Yisu Liu, Jinyang An, Wanqian Zhang, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang

    Abstract: With the development of diffusion-based customization methods like DreamBooth, individuals now have access to train the models that can generate their personalized images. Despite the convenience, malicious users have misused these techniques to create fake images, thereby triggering a privacy security crisis. In light of this, proactive adversarial attacks are proposed to protect users against cu… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Under review

    ACM Class: I.2.10

  9. arXiv:2405.09753  [pdf, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Holographic MIMO Aided Cell-Free Networks

    Authors: Qingchao Li, Mohammed El-Hajjar, Chao Xu, Jiancheng An, Chau Yuen, Lajos Hanzo

    Abstract: Large-scale multiple-input and multiple-output (MIMO) systems are capable of achieving high date rate. However, given the high hardware cost and excessive power consumption of massive MIMO systems, as a remedy, intelligent metasurfaces have been designed for efficient holographic MIMO (HMIMO) systems. In this paper, we propose a HMIMO architecture based on stacked intelligent metasurfaces (SIM) fo… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  10. arXiv:2404.11465  [pdf, other

    cs.SI

    X-posing Free Speech: Examining the Impact of Moderation Relaxation on Online Social Networks

    Authors: Arvindh Arun, Saurav Chhatani, Jisun An, Ponnurangam Kumaraguru

    Abstract: We investigate the impact of free speech and the relaxation of moderation on online social media platforms using Elon Musk's takeover of Twitter as a case study. By curating a dataset of over 10 million tweets, our study employs a novel framework combining content and network analysis. Our findings reveal a significant increase in the distribution of certain forms of hate content, particularly tar… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  11. arXiv:2404.10633  [pdf, other

    cs.CV

    Contextrast: Contextual Contrastive Learning for Semantic Segmentation

    Authors: Changki Sung, Wanhee Kim, Jungho An, Wooju Lee, Hyungtae Lim, Hyun Myung

    Abstract: Despite great improvements in semantic segmentation, challenges persist because of the lack of local/global contexts and the relationship between them. In this paper, we propose Contextrast, a contrastive learning-based semantic segmentation method that allows to capture local/global contexts and comprehend their relationships. Our proposed method comprises two parts: a) contextual contrastive lea… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  12. Learning Deterministic Multi-Clock Timed Automata

    Authors: Yu Teng, Miaomiao Zhang, Jie An

    Abstract: We present an algorithm for active learning of deterministic timed automata with multiple clocks. The algorithm is within the querying framework of Angluin's $L^*$ algorithm and follows the idea proposed in existing work on the active learning of deterministic one-clock timed automata. We introduce an equivalence relation over the reset-clocked language of a timed automaton and then transform the… ▽ More

    Submitted 20 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 20 pages. It is an author version of the paper with the same title accepted by HSCC 2024

  13. arXiv:2404.02126  [pdf, other

    cs.CL cs.IR

    Rematch: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity

    Authors: Zoher Kachwala, Jisun An, Haewoon Kwak, Filippo Menczer

    Abstract: Knowledge graphs play a pivotal role in various applications, such as question-answering and fact-checking. Abstract Meaning Representation (AMR) represents text as knowledge graphs. Evaluating the quality of these graphs involves matching them structurally to each other and semantically to the source text. Existing AMR metrics are inefficient and struggle to capture semantic similarity. We also l… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: To be published in NAACL24 proceedings

  14. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  15. arXiv:2404.00265  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can reshape the characteristics of wireless channels. In this paper, we propose a novel environment-aware codebook protocol for RIS-assisted multi-user multiple-input single-output (MU-MISO) systems. Specifically, we first introduce a channel training protocol which consists of off-line and on-… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures, accepted by VTC2024-Spring

  16. Algorithmic Ways of Seeing: Using Object Detection to Facilitate Art Exploration

    Authors: Louie Søs Meyer, Johanne Engel Aaen, Anitamalina Regitse Tranberg, Peter Kun, Matthias Freiberger, Sebastian Risi, Anders Sundnes Løvlie

    Abstract: This Research through Design paper explores how object detection may be applied to a large digital art museum collection to facilitate new ways of encountering and experiencing art. We present the design and evaluation of an interactive application called SMKExplore, which allows users to explore a museum's digital collection of paintings by browsing through objects detected in the images, as a no… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  17. arXiv:2403.18231  [pdf, ps, other

    cs.IT

    The Dimensions of the Hulls of Conorm Codes from Algebraic Geometry Codes

    Authors: Junmin An, Jon-Lark Kim

    Abstract: Chara et al. introduced conorm codes defined over algebraic geometry codes, but the hulls of conorm codes were not determined yet. In this paper, we study the dimension of the hull of conorm codes using the method introduced by Camps et al. For an algebraic geometry code $\mathcal{C}:=C_\mathscr{L}(D, G)$, we consider the divisor $\gcd(G, H)$, where $H$ is the divisor satisfying \[C_\mathscr{L}(D,… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    MSC Class: 94B27

  18. arXiv:2403.17368  [pdf, other

    cs.CL cs.AI

    ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?

    Authors: Fan Huang, Haewoon Kwak, Kunwoo Park, Jisun An

    Abstract: As AI becomes more integral in our lives, the need for transparency and responsibility grows. While natural language explanations (NLEs) are vital for clarifying the reasoning behind AI decisions, evaluating them through human judgments is complex and resource-intensive due to subjectivity and the need for fine-grained ratings. This study explores the alignment between ChatGPT and human assessment… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accpeted by LREC-COLING 2024 main conference, long paper

  19. arXiv:2403.13352  [pdf, other

    cs.CV

    AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation

    Authors: Jingkun An, Yinghao Zhu, Zongjian Li, Haoran Feng, Bohua Chen, Yemin Shi, Chengwei Pan

    Abstract: Text-to-Image (T2I) diffusion models have achieved remarkable success in image generation. Despite their progress, challenges remain in both prompt-following ability, image quality and lack of high-quality datasets, which are essential for refining these models. As acquiring labeled data is costly, we introduce AGFSync, a framework that enhances T2I diffusion models through Direct Preference Optim… ▽ More

    Submitted 3 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  20. Stacked Intelligent Metasurface Enabled LEO Satellite Communications Relying on Statistical CSI

    Authors: Shining Lin, Jiancheng An, Lu Gan, Mérouane Debbah, Chau Yuen

    Abstract: Low earth orbit (LEO) satellite communication systems have gained increasing attention as a crucial supplement to terrestrial wireless networks due to their extensive coverage area. This letter presents a novel system design for LEO satellite systems by leveraging stacked intelligent metasurface (SIM) technology. Specifically, the lightweight and energy-efficient SIM is mounted on a satellite to a… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures, accepted by IEEE WCL

  21. Channel Estimation for Stacked Intelligent Metasurface-Assisted Wireless Networks

    Authors: Xianghao Yao, Jiancheng An, Lu Gan, Marco Di Renzo, Chau Yuen

    Abstract: Emerging technologies, such as holographic multiple-input multiple-output (HMIMO) and stacked intelligent metasurface (SIM), are driving the development of wireless communication systems. Specifically, the SIM is physically constructed by stacking multiple layers of metasurfaces and has an architecture similar to an artificial neural network (ANN), which can flexibly manipulate the electromagnetic… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages, 3 figures, accepted by IEEE WCL

  22. arXiv:2403.00236  [pdf, other

    cs.CL cs.AI cs.LG

    Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance

    Authors: Rachith Aiyappa, Shruthi Senthilmani, Jisun An, Haewoon Kwak, Yong-Yeol Ahn

    Abstract: We investigate the performance of LLM-based zero-shot stance detection on tweets. Using FlanT5-XXL, an instruction-tuned open-source LLM, with the SemEval 2016 Tasks 6A, 6B, and P-Stance datasets, we study the performance and its variations under different prompts and decoding strategies, as well as the potential biases of the model. We show that the zero-shot approach can match or outperform stat… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  23. arXiv:2402.16415  [pdf, other

    cs.IT

    Achievable Rate Optimization for Stacked Intelligent Metasurface-Assisted Holographic MIMO Communications

    Authors: Anastasios Papazafeiropoulos, Jiancheng An, Pandelis Kourtessis, Tharmalingam Ratnarajah, Symeon Chatzinotas

    Abstract: Stacked intelligent metasurfaces (SIM) is a revolutionary technology, which can outperform its single-layer counterparts by performing advanced signal processing relying on wave propagation. In this work, we exploit SIM to enable transmit precoding and receiver combining in holographic multiple-input multiple-output (HMIMO) communications, and we study the achievable rate by formulating a joint op… ▽ More

    Submitted 8 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  24. arXiv:2402.13740  [pdf, other

    cs.CL

    From Text to CQL: Bridging Natural Language and Corpus Search Engine

    Authors: Luming Lu, Jiyuan An, Yujie Wang, Liner yang, Cunliang Kong, Zhenghao Liu, Shuo Wang, Haozhe Lin, Mingwei Fang, Yaping Huang, Erhong Yang

    Abstract: Natural Language Processing (NLP) technologies have revolutionized the way we interact with information systems, with a significant focus on converting natural language queries into formal query languages such as SQL. However, less emphasis has been placed on the Corpus Query Language (CQL), a critical tool for linguistic research and detailed analysis within text corpora. The manual construction… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  25. arXiv:2402.11167  [pdf, other

    cs.CL cs.AI

    Token-Ensemble Text Generation: On Attacking the Automatic AI-Generated Text Detection

    Authors: Fan Huang, Haewoon Kwak, Jisun An

    Abstract: The robustness of AI-content detection models against cultivated attacks (e.g., paraphrasing or word switching) remains a significant concern. This study proposes a novel token-ensemble generation strategy to challenge the robustness of current AI-content detection approaches. We explore the ensemble attack strategy by completing the prompt with the next token generated from random candidate LLMs.… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Submitted to ACL 2024

  26. arXiv:2402.08224  [pdf, ps, other

    cs.IT eess.SP

    Two-Dimensional Direction-of-Arrival Estimation Using Stacked Intelligent Metasurfaces

    Authors: Jiancheng An, Chau Yuen, Yong Liang Guan, Marco Di Renzo, Mérouane Debbah, H. Vincent Poor, Lajos Hanzo

    Abstract: Stacked intelligent metasurfaces (SIM) are capable of emulating reconfigurable physical neural networks by relying on electromagnetic (EM) waves as carriers. They can also perform various complex computational and signal processing tasks. A SIM is fabricated by densely integrating multiple metasurface layers, each consisting of a large number of small meta-atoms that can control the EM waves passi… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 37 pages, 12 figures, and 2 tables. arXiv admin note: text overlap with arXiv:2310.09861

  27. arXiv:2402.01713  [pdf, other

    cs.CL cs.AI cs.LG

    Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data

    Authors: Yinghao Zhu, Zixiang Wang, Junyi Gao, Yuning Tong, Jingkun An, Weibin Liao, Ewen M. Harrison, Liantao Ma, Chengwei Pan

    Abstract: The inherent complexity of structured longitudinal Electronic Health Records (EHR) data poses a significant challenge when integrated with Large Language Models (LLMs), which are traditionally tailored for natural language processing. Motivated by the urgent need for swift decision-making during new disease outbreaks, where traditional predictive models often fail due to a lack of historical data,… ▽ More

    Submitted 10 February, 2024; v1 submitted 25 January, 2024; originally announced February 2024.

  28. arXiv:2401.14008  [pdf, other

    cs.IT eess.SP

    Massive Unsourced Random Access for Near-Field Communications

    Authors: Xinyu Xie, Yongpeng Wu, Jianping An, Derrick Wing Kwan Ng, Chengwen Xing, Wenjun Zhang

    Abstract: This paper investigates the unsourced random access (URA) problem with a massive multiple-input multiple-output receiver that serves wireless devices in the near-field of radiation. We employ an uncoupled transmission protocol without appending redundancies to the slot-wise encoded messages. To exploit the channel sparsity for block length reduction while facing the collapsed sparse structure in t… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE Transactions on Communications

  29. arXiv:2401.11161  [pdf, other

    cs.SE

    BinaryAI: Binary Software Composition Analysis via Intelligent Binary Source Code Matching

    Authors: Ling Jiang, Junwen An, Huihui Huang, Qiyi Tang, Sen Nie, Shi Wu, Yuqun Zhang

    Abstract: While third-party libraries are extensively reused to enhance productivity during software development, they can also introduce potential security risks such as vulnerability propagation. Software composition analysis, proposed to identify reused TPLs for reducing such risks, has become an essential procedure within modern DevSecOps. As one of the mainstream SCA techniques, binary-to-source SCA id… ▽ More

    Submitted 23 January, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: In Proceedings of the 46th International Conference on Software Engineering (ICSE'24)

  30. arXiv:2401.09455  [pdf, other

    cs.NI cs.AI cs.LG eess.SY

    Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

    Authors: Yifeng Lyu, Han Hu, Rongfei Fan, Zhi Liu, Jianping An, Shiwen Mao

    Abstract: The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to sat… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

  31. arXiv:2401.02414  [pdf, other

    cs.CV

    Bring Metric Functions into Diffusion Models

    Authors: Jie An, Zhengyuan Yang, Jianfeng Wang, Linjie Li, Zicheng Liu, Lijuan Wang, Jiebo Luo

    Abstract: We introduce a Cascaded Diffusion Model (Cas-DM) that improves a Denoising Diffusion Probabilistic Model (DDPM) by effectively incorporating additional metric functions in training. Metric functions such as the LPIPS loss have been proven highly effective in consistency models derived from the score matching. However, for the diffusion counterparts, the methodology and efficacy of adding extra met… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  32. arXiv:2312.17432  [pdf, other

    cs.CV cs.CL

    Video Understanding with Large Language Models: A Survey

    Authors: Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu

    Abstract: With the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly. Given the remarkable capabilities of Large Language Models (LLMs) in language and multimodal tasks, this survey provides a detailed overview of the recent advancements in video understanding harnessing the power of LLMs (Vid-… ▽ More

    Submitted 3 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  33. arXiv:2311.11812  [pdf, other

    cs.AI

    Improving Real Estate Appraisal with POI Integration and Areal Embedding

    Authors: Sumin Han, Youngjun Park, Sonia Sabir, Jisun An, Dongman Lee

    Abstract: Despite advancements in real estate appraisal methods, this study primarily focuses on two pivotal challenges. Firstly, we explore the often-underestimated impact of Points of Interest (POI) on property values, emphasizing the necessity for a comprehensive, data-driven approach to feature selection. Secondly, we integrate road-network-based Areal Embedding to enhance spatial understanding for real… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  34. arXiv:2311.09814  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurface-Aided MIMO Transceiver Design

    Authors: Jiancheng An, Chau Yuen, Chao Xu, Hongbin Li, Derrick Wing Kwan Ng, Marco Di Renzo, Mérouane Debbah, Lajos Hanzo

    Abstract: Next-generation wireless networks are expected to utilize the limited radio frequency (RF) resources more efficiently with the aid of intelligent transceivers. To this end, we propose a promising transceiver architecture relying on stacked intelligent metasurfaces (SIM). An SIM is constructed by stacking an array of programmable metasurface layers, where each layer consists of a massive number of… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures, 1 table

  35. arXiv:2311.00285  [pdf, ps, other

    cs.CV cs.LG

    Mixture-of-Experts for Open Set Domain Adaptation: A Dual-Space Detection Approach

    Authors: Zhenbang Du, Jiayu An, Yunlu Tu, Jiahao Hong, Dongrui Wu

    Abstract: Open Set Domain Adaptation (OSDA) aims to cope with the distribution and label shifts between the source and target domains simultaneously, performing accurate classification for known classes while identifying unknown class samples in the target domain. Most existing OSDA approaches, depending on the final image feature space of deep models, require manually-tuned thresholds, and may easily miscl… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2310.09861  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurface Performs a 2D DFT in the Wave Domain for DOA Estimation

    Authors: Jiancheng An, Chau Yuen, Marco Di Renzo, Merouane Debbah, H. Vincent Poor, Lajos Hanzo

    Abstract: Staked intelligent metasurface (SIM) based techniques are developed to perform two-dimensional (2D) direction-of-arrival (DOA) estimation. In contrast to the conventional designs, an advanced SIM in front of the receiving array automatically performs the 2D discrete Fourier transform (DFT) as the incident waves propagate through it. To arrange for the SIM to carry out this task, we design a gradie… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 16 pages, 5 figures, submitted to IEEE ICC 2024

  37. arXiv:2310.09848  [pdf

    cs.CL

    Enhancing Stance Classification with Quantified Moral Foundations

    Authors: Hong Zhang, Prasanta Bhattacharya, Wei Gao, Liang Ze Wong, Brandon Siyuan Loh, Joseph J. P. Simons, Jisun An

    Abstract: This study enhances stance detection on social media by incorporating deeper psychological attributes, specifically individuals' moral foundations. These theoretically-derived dimensions aim to provide a comprehensive profile of an individual's moral concerns which, in recent work, has been linked to behaviour in a range of domains, including society, politics, health, and the environment. In this… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 11 pages, 5 figures

  38. arXiv:2310.07749  [pdf, other

    cs.CV

    OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

    Authors: Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo

    Abstract: This work investigates a challenging task named open-domain interleaved image-text generation, which generates interleaved texts and images following an input query. We propose a new interleaved generation framework based on prompting large-language models (LLMs) and pre-trained text-to-image (T2I) models, namely OpenLEAF. In OpenLEAF, the LLM generates textual descriptions, coordinates T2I models… ▽ More

    Submitted 3 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  39. arXiv:2309.16204  [pdf, other

    cs.IT eess.SP

    Hybrid Digital-Wave Domain Channel Estimator for Stacked Intelligent Metasurface Enabled Multi-User MISO Systems

    Authors: Qurrat-Ul-Ain Nadeem, Jiancheng An, Anas Chaaban

    Abstract: Stacked intelligent metasurface (SIM) is an emerging programmable metasurface architecture that can implement signal processing directly in the electromagnetic wave domain, thereby enabling efficient implementation of ultra-massive multiple-input multiple-output (MIMO) transceivers with a limited number of radio frequency (RF) chains. Channel estimation (CE) is challenging for SIM-enabled communic… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  40. arXiv:2309.09242  [pdf, ps, other

    cs.IT eess.SP

    Toward Beamfocusing-Aided Near-Field Communications: Research Advances, Potential, and Challenges

    Authors: Jiancheng An, Chau Yuen, Linglong Dai, Marco Di Renzo, Merouane Debbah, Lajos Hanzo

    Abstract: Next-generation mobile networks promise to support high throughput, massive connectivity, and improved energy efficiency. To achieve these ambitious goals, extremely large-scale antenna arrays (ELAAs) and terahertz communications constitute a pair of promising technologies. This will result in future wireless communications occurring in the near-field regions. To accurately portray the channel cha… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures, 1 table

  41. arXiv:2309.05015  [pdf, other

    cs.CV cs.DC cs.PF

    DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices

    Authors: Guanyu Xu, Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Shiwen Mao

    Abstract: Recent years have witnessed the great success of vision transformer (ViT), which has achieved state-of-the-art performance on multiple computer vision benchmarks. However, ViT models suffer from vast amounts of parameters and high computation cost, leading to difficult deployment on resource-constrained edge devices. Existing solutions mostly compress ViT models to a compact model but still cannot… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Transactions on Mobile Computing

  42. arXiv:2309.02687  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Multiuser Downlink Beamforming in the Wave Domain

    Authors: Jiancheng An, Marco Di Renzo, Merouane Debbah, H. Vincent Poor, Chau Yuen

    Abstract: Intelligent metasurface has recently emerged as a promising technology that enables the customization of wireless environments by harnessing large numbers of inexpensive configurable scattering elements. However, prior studies have predominantly focused on single-layer metasurfaces, which have limitations in terms of the number of beam patterns they can steer accurately due to practical hardware r… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 32 pages, 6 figures, submitted to IEEE TWC

  43. arXiv:2308.14099  [pdf, ps, other

    cs.IT eess.SP

    Pilot Power Allocation for Channel Estimation in a Multi-RIS Aided Communication System

    Authors: Jiancheng An, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) is a promising technology that enables the customization of electromagnetic propagation environments in next-generation wireless networks. In this paper, we investigate the optimal pilot power allocation during the channel estimation stage to improve the ergodic channel gain of RIS-assisted systems under practical imperfect channel state information (CSI).… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: 16 pages, 5 figures, accepted by IEEE GLOBECOM 2023. arXiv admin note: substantial text overlap with arXiv:2110.11534

  44. Enhancing Spatiotemporal Traffic Prediction through Urban Human Activity Analysis

    Authors: Sumin Han, Youngjun Park, Minji Lee, Jisun An, Dongman Lee

    Abstract: Traffic prediction is one of the key elements to ensure the safety and convenience of citizens. Existing traffic prediction models primarily focus on deep learning architectures to capture spatial and temporal correlation. They often overlook the underlying nature of traffic. Specifically, the sensor networks in most traffic datasets do not accurately represent the actual road network exploited by… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: CIKM 2023

  45. arXiv:2308.09889  [pdf, other

    cs.CV cs.CR cs.LG

    DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization

    Authors: Xiaoyu Ye, Hao Huang, Jiaqi An, Yongtao Wang

    Abstract: Stable Diffusion (SD) customization approaches enable users to personalize SD model outputs, greatly enhancing the flexibility and diversity of AI art. However, they also allow individuals to plagiarize specific styles or subjects from copyrighted images, which raises significant concerns about potential copyright infringement. To address this issue, we propose an invisible data-free universal adv… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 12 pages, 11 figures

  46. Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation

    Authors: Alexander Martin, Haitian Zheng, Jie An, Jiebo Luo

    Abstract: With a strong understanding of the target domain from natural language, we produce promising results in translating across large domain gaps and bringing skeletons back to life. In this work, we use text-guided latent diffusion models for zero-shot image-to-image translation (I2I) across large domain gaps (longI2I), where large amounts of new visual features and new geometry need to be generated t… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 9 pages, 10 figures, ACM Multimedia 2023

    ACM Class: I.4; I.7

  47. Domain-Scalable Unpaired Image Translation via Latent Space Anchoring

    Authors: Siyu Huang, Jie An, Donglai Wei, Zudi Lin, Jiebo Luo, Hanspeter Pfister

    Abstract: Unpaired image-to-image translation (UNIT) aims to map images between two visual domains without paired training data. However, given a UNIT model trained on certain domains, it is difficult for current methods to incorporate new domains because they often need to train the full model on both existing and new domains. To address this problem, we propose a new domain-scalable UNIT method, termed as… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepeted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). Code is available at https://github.com/siyuhuang/Latent-Space-Anchoring

  48. arXiv:2306.13893  [pdf, other

    eess.SP cs.AI cs.CV

    Radio Generation Using Generative Adversarial Networks with An Unrolled Design

    Authors: Weidong Wang, Jiancheng An, Hongshu Liao, Lu Gan, Chau Yuen

    Abstract: As a revolutionary generative paradigm of deep learning, generative adversarial networks (GANs) have been widely applied in various fields to synthesize realistic data. However, it is challenging for conventional GANs to synthesize raw signal data, especially in some complex cases. In this paper, we develop a novel GAN framework for radio generation called "Radio GAN". Compared to conventional met… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Submitted to IEEE Transactions on Cognitive Communications and Networking on 20-Dec-2022

  49. arXiv:2306.10491  [pdf

    cs.CV cs.RO

    A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity

    Authors: Seongjeong Park, Jinu Pahk, Lennart Lorenz Freimuth Jahn, Yongseob Lim, Jinung An, Gyeungho Choi

    Abstract: Autonomous driving simulations require highly realistic images. Our preliminary study found that when the CARLA Simulator image was made more like reality by using DCLGAN, the performance of the lane recognition model improved to levels comparable to real-world driving. It was also confirmed that the vehicle's ability to return to the center of the lane after deviating from it improved significant… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  50. arXiv:2306.06078  [pdf, other

    cs.CV cs.HC cs.LG eess.SP

    Cheating off your neighbors: Improving activity recognition through corroboration

    Authors: Haoxiang Yu, Jingyi An, Evan King, Edison Thomaz, Christine Julien

    Abstract: Understanding the complexity of human activities solely through an individual's data can be challenging. However, in many situations, surrounding individuals are likely performing similar activities, while existing human activity recognition approaches focus almost exclusively on individual measurements and largely ignore the context of the activity. Consider two activities: attending a small grou… ▽ More

    Submitted 27 May, 2023; originally announced June 2023.