Skip to main content

Showing 1–50 of 85 results for author: Joo, J

  1. arXiv:2407.12401  [pdf, other

    cs.LG cs.CV

    Geometric Remove-and-Retrain (GOAR): Coordinate-Invariant eXplainable AI Assessment

    Authors: Yong-Hyun Park, Junghoon Seo, Bomseok Park, Seongsu Lee, Junghyo Jo

    Abstract: Identifying the relevant input features that have a critical influence on the output results is indispensable for the development of explainable artificial intelligence (XAI). Remove-and-Retrain (ROAR) is a widely accepted approach for assessing the importance of individual pixels by measuring changes in accuracy following their removal and subsequent retraining of the modified dataset. However, w… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted in XAI in Action Workshop @ NeurIPS2023

  2. arXiv:2407.01034  [pdf, other

    cs.CV cs.GR

    Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert

    Authors: Han EunGi, Oh Hyun-Bin, Kim Sung-Bin, Corentin Nivelet Etcheberry, Suekyeong Nam, Janghoon Joo, Tae-Hyun Oh

    Abstract: Speech-driven 3D facial animation has recently garnered attention due to its cost-effective usability in multimedia production. However, most current advances overlook the intelligibility of lip movements, limiting the realism of facial expressions. In this paper, we introduce a method for speech-driven 3D facial animation to generate accurate lip movements, proposing an audio-visual multimodal pe… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: INTERSPEECH 2024

  3. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  4. arXiv:2406.00441  [pdf, other

    physics.chem-ph cs.AI cs.LG

    Neural Polarization: Toward Electron Density for Molecules by Extending Equivariant Networks

    Authors: Bumju Kwak, Jeonghee Jo

    Abstract: Recent SO(3)-equivariant models embedded a molecule as a set of single atoms fixed in the three-dimensional space, which is analogous to a ball-and-stick view. This perspective provides a concise view of atom arrangements, however, the surrounding electron density cannot be represented and its polarization effects may be underestimated. To overcome this limitation, we propose \textit{Neural Polari… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.19630  [pdf

    cs.RO

    The use of a humanoid robot for older people with dementia in aged care facilities

    Authors: Dongjun Wu, Lihui Pu, Jun Jo, Rene Hexel, Wendy Moyle

    Abstract: This paper presents an interdisciplinary PhD project using a humanoid robot to encourage interactive activities for people with dementia living in two aged care facilities. The aim of the project was to develop software and use technologies to achieve successful robot-led engagement with older people with dementia. This paper outlines the qualitative findings from the project's feasibility stage.… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted for the Second Workshop on Care Robots for Older Adults (CROA), RO-MAN 2023, Busan, Korea

  6. arXiv:2404.04243  [pdf, other

    cs.CV cs.AI

    Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models

    Authors: Sangwon Jang, Jaehyeong Jo, Kimin Lee, Sung Ju Hwang

    Abstract: Text-to-image diffusion models have shown remarkable success in generating personalized subjects based on a few reference images. However, current methods often fail when generating multiple subjects simultaneously, resulting in mixed identities with combined attributes from different subjects. In this work, we present MuDI, a novel framework that enables multi-subject personalization by effective… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: Preprint. Project page: https://mudi-t2i.github.io/

  7. arXiv:2404.01709  [pdf, other

    cs.CV cs.AI

    Upsample Guidance: Scale Up Diffusion Models without Training

    Authors: Juno Hwang, Yong-Hyun Park, Junghyo Jo

    Abstract: Diffusion models have demonstrated superior performance across various generative tasks including images, videos, and audio. However, they encounter difficulties in directly generating high-resolution samples. Previously proposed solutions to this issue involve modifying the architecture, further training, or partitioning the sampling process into multiple stages. These methods have the limitation… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 15 pages, 15 Figures

  8. arXiv:2403.07255  [pdf, other

    eess.SP cs.AI cs.LG

    Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication

    Authors: Yongjeong Oh, Jaehong Jo, Byonghyo Shim, Yo-Seb Jeon

    Abstract: In this paper, we present a novel approach for joint activity detection (AD), channel estimation (CE), and data detection (DD) in uplink grant-free non-orthogonal multiple access (NOMA) systems. Our approach employs an iterative and parallel interference removal strategy inspired by parallel interference cancellation (PIC), enhanced with deep learning to jointly tackle the AD, CE, and DD problems.… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  9. arXiv:2402.13827  [pdf

    cs.CV cs.AR

    Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting

    Authors: Joongho Jo, Hyeongwon Kim, Jongsun Park

    Abstract: 3D Gaussian splatting (3D-GS) is a new rendering approach that outperforms the neural radiance field (NeRF) in terms of both speed and image quality. 3D-GS represents 3D scenes by utilizing millions of 3D Gaussians and projects these Gaussians onto the 2D image plane for rendering. However, during the rendering process, a substantial number of unnecessary 3D Gaussians exist for the current view di… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  10. arXiv:2401.10247  [pdf, other

    cs.CV cs.LG

    Resolution Chromatography of Diffusion Models

    Authors: Juno Hwang, Yong-Hyun Park, Junghyo Jo

    Abstract: Diffusion models generate high-resolution images through iterative stochastic processes. In particular, the denoising method is one of the most popular approaches that predicts the noise in samples and denoises it at each time step. It has been commonly observed that the resolution of generated samples changes over time, starting off blurry and coarse, and becoming sharper and finer. In this paper… ▽ More

    Submitted 6 December, 2023; originally announced January 2024.

    Comments: 24 pages, 9 figures

  11. arXiv:2310.07216  [pdf, other

    cs.LG stat.ML

    Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes

    Authors: Jaehyeong Jo, Sung Ju Hwang

    Abstract: Learning the distribution of data on Riemannian manifolds is crucial for modeling data from non-Euclidean space, which is required by many applications in diverse scientific fields. Yet, existing generative models on manifolds suffer from expensive divergence computation or rely on approximations of heat kernel. These limitations restrict their applicability to simple geometries and hinder scalabi… ▽ More

    Submitted 2 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  12. arXiv:2310.00618  [pdf, other

    cs.LG

    GNRK: Graph Neural Runge-Kutta method for solving partial differential equations

    Authors: Hoyun Choi, Sungyeop Lee, B. Kahng, Junghyo Jo

    Abstract: Neural networks have proven to be efficient surrogate models for tackling partial differential equations (PDEs). However, their applicability is often confined to specific PDEs under certain constraints, in contrast to classical PDE solvers that rely on numerical differentiation. Striking a balance between efficiency and versatility, this study introduces a novel approach called Graph Neural Runge… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 14 pages, 6 figures, 1 table

  13. arXiv:2309.05192  [pdf, other

    cs.CV

    Towards Viewpoint Robustness in Bird's Eye View Segmentation

    Authors: Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez

    Abstract: Autonomous vehicles (AV) require that neural networks used for perception be robust to different viewpoints if they are to be deployed across many types of vehicles without the repeated cost of data collection and labeling for each. AV companies typically focus on collecting data from diverse scenarios and locations, but not camera rig configurations, due to cost. As a result, only a small number… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: ICCV 2023. Project Page: https://nvlabs.github.io/viewpoint-robustness

  14. arXiv:2308.00558  [pdf, other

    cs.NE

    Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information

    Authors: Seongsik Park, Jeonghee Jo, Jongkil Park, Yeonjoo Jeong, Jaewook Kim, Suyoun Lee, Joon Young Kwak, Inho Kim, Jong-Keuk Park, Kyeong Seok Lee, Gye Weon Hwang, Hyun Jae Jang

    Abstract: Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work,… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: ICML-23 Localized Learning Workshop: Decentralized Model Updates via Non-Global Objectives

  15. arXiv:2308.00282  [pdf, other

    cs.LG

    ZADU: A Python Library for Evaluating the Reliability of Dimensionality Reduction Embeddings

    Authors: Hyeon Jeon, Aeri Cho, Jinhwa Jang, Soohyun Lee, Jake Hyun, Hyung-Kwon Ko, Jaemin Jo, Jinwook Seo

    Abstract: Dimensionality reduction (DR) techniques inherently distort the original structure of input high-dimensional data, producing imperfect low-dimensional embeddings. Diverse distortion measures have thus been proposed to evaluate the reliability of DR embeddings. However, implementing and executing distortion measures in practice has so far been time-consuming and tedious. To address this issue, we p… ▽ More

    Submitted 11 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 2023 IEEE Visualization and Visual Analytics (IEEE VIS 2023) Short paper

  16. arXiv:2307.12868  [pdf, other

    cs.CV

    Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry

    Authors: Yong-Hyun Park, Mingi Kwon, Jaewoong Choi, Junghyo Jo, Youngjung Uh

    Abstract: Despite the success of diffusion models (DMs), we still lack a thorough understanding of their latent space. To understand the latent space $\mathbf{x}_t \in \mathcal{X}$, we analyze them from a geometrical perspective. Our approach involves deriving the local latent basis within $\mathcal{X}$ by leveraging the pullback metric associated with their encoding feature maps. Remarkably, our discovered… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted for NeurIPS 2023

  17. arXiv:2306.16085  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Mass Spectra Prediction with Structural Motif-based Graph Neural Networks

    Authors: Jiwon Park, Jeonghee Jo, Sungroh Yoon

    Abstract: Mass spectra, which are agglomerations of ionized fragments from targeted molecules, play a crucial role across various fields for the identification of molecular structures. A prevalent analysis method involves spectral library searches,where unknown spectra are cross-referenced with a database. The effectiveness of such search-based approaches, however, is restricted by the scope of the existing… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 19 pages, 3figures

  18. arXiv:2306.15919  [pdf, other

    cs.CV cs.AI

    Fine-grained 3D object recognition: an approach and experiments

    Authors: Junhyung Jo, Hamidreza Kasaei

    Abstract: Three-dimensional (3D) object recognition technology is being used as a core technology in advanced technologies such as autonomous driving of automobiles. There are two sets of approaches for 3D object recognition: (i) hand-crafted approaches like Global Orthographic Object Descriptor (GOOD), and (ii) deep learning-based approaches such as MobileNet and VGG. However, it is needed to know which of… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  19. arXiv:2306.05732  [pdf, other

    cs.GT math.OC

    Computing Algorithm for an Equilibrium of the Generalized Stackelberg Game

    Authors: Jaeyeon Jo, Jihwan Yu, Jinkyoo Park

    Abstract: The $1-N$ generalized Stackelberg game (single-leader multi-follower game) is intricately intertwined with the interaction between a leader and followers (hierarchical interaction) and the interaction among followers (simultaneous interaction). However, obtaining the optimal strategy of the leader is generally challenging due to the complex interactions among the leader and followers. Here, we pro… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 37 pages, 10 figures

  20. arXiv:2305.16943  [pdf, other

    cs.LG

    DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models

    Authors: Sohyun An, Hayeon Lee, Jaehyeong Jo, Seanie Lee, Sung Ju Hwang

    Abstract: Existing NAS methods suffer from either an excessive amount of time for repetitive sampling and training of many task-irrelevant architectures. To tackle such limitations of existing NAS methods, we propose a paradigm shift from NAS to a novel conditional Neural Architecture Generation (NAG) framework based on diffusion models, dubbed DiffusionNAG. Specifically, we consider the neural architecture… ▽ More

    Submitted 24 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024

  21. arXiv:2305.14045  [pdf, other

    cs.CL cs.AI cs.LG

    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

    Authors: Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo

    Abstract: Language models (LMs) with less than 100B parameters are known to perform poorly on chain-of-thought (CoT) reasoning in contrast to large LMs when solving unseen tasks. In this work, we aim to equip smaller LMs with the step-by-step reasoning capability by instruction tuning with CoT rationales. In order to achieve this goal, we first introduce a new instruction-tuning dataset called the CoT Colle… ▽ More

    Submitted 14 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (Main Conference)

  22. arXiv:2304.01515  [pdf, other

    cs.LG cs.CL cs.CV

    Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models

    Authors: Jaewoong Lee, Sangwon Jang, Jaehyeong Jo, Jaehong Yoon, Yunji Kim, Jin-Hwa Kim, Jung-Woo Ha, Sung Ju Hwang

    Abstract: Token-based masked generative models are gaining popularity for their fast inference time with parallel decoding. While recent token-based approaches achieve competitive performance to diffusion-based models, their generation performance is still suboptimal as they sample multiple tokens simultaneously without considering the dependence among them. We empirically investigate this problem and propo… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    ACM Class: I.5.4; I.2.10; I.4.m

  23. arXiv:2303.05718  [pdf, other

    cond-mat.stat-mech cs.LG

    Tradeoff of generalization error in unsupervised learning

    Authors: Gilhan Kim, Hojun Lee, Junghyo Jo, Yongjoo Baek

    Abstract: Finding the optimal model complexity that minimizes the generalization error (GE) is a key issue of machine learning. For the conventional supervised learning, this task typically involves the bias-variance tradeoff: lowering the bias by making the model more complex entails an increase in the variance. Meanwhile, little has been studied about whether the same tradeoff exists for unsupervised lear… ▽ More

    Submitted 12 September, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: 15 pages, 7 figures

    Journal ref: J. Stat. Mech.: Theor. Exp. 2023, 083401 (2023)

  24. arXiv:2303.03628  [pdf, other

    cs.CL cs.LG

    CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification

    Authors: Seungone Kim, Se June Joo, Yul Jang, Hyungjoo Chae, Jinyoung Yeo

    Abstract: Chain-of-thought (CoT) prompting enables large language models (LLMs) to solve complex reasoning tasks by generating an explanation before the final prediction. Despite it's promising ability, a critical downside of CoT prompting is that the performance is greatly affected by the factuality of the generated explanation. To improve the correctness of the explanations, fine-tuning language models wi… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at EACL 2023 Demo

  25. arXiv:2303.02574  [pdf, other

    cs.RO physics.app-ph

    Sim2Real Neural Controllers for Physics-based Robotic Deployment of Deformable Linear Objects

    Authors: Dezhong Tong, Andrew Choi, Longhui Qin, Weicheng Huang, Jungseock Joo, M. Khalid Jawed

    Abstract: Deformable linear objects (DLOs), such as rods, cables, and ropes, play important roles in daily life. However, manipulation of DLOs is challenging as large geometrically nonlinear deformations may occur during the manipulation process. This problem is made even more difficult as the different deformation modes (e.g., stretching, bending, and twisting) may result in elastic instabilities during ma… ▽ More

    Submitted 10 December, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: YouTube video: https://youtu.be/OSD6dhOgyMA?feature=shared

  26. arXiv:2302.12469  [pdf, other

    cs.CV

    Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

    Authors: Yong-Hyun Park, Mingi Kwon, Junghyo Jo, Youngjung Uh

    Abstract: Despite the success of diffusion models (DMs), we still lack a thorough understanding of their latent space. While image editing with GANs builds upon latent space, DMs rely on editing the conditions such as text prompts. We present an unsupervised method to discover interpretable editing directions for the latent variables $\mathbf{x}_t \in \mathcal{X}$ of DMs. Our method adopts Riemannian geomet… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  27. mBEST: Realtime Deformable Linear Object Detection Through Minimal Bending Energy Skeleton Pixel Traversals

    Authors: Andrew Choi, Dezhong Tong, Brian Park, Demetri Terzopoulos, Jungseock Joo, Mohammad Khalid Jawed

    Abstract: Robotic manipulation of deformable materials is a challenging task that often requires realtime visual feedback. This is especially true for deformable linear objects (DLOs) or "rods", whose slender and flexible structures make proper tracking and detection nontrivial. To address this challenge, we present mBEST, a robust algorithm for the realtime detection of DLOs that is capable of producing an… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: IEEE Robotics and Automation Letters (RA-L 2023). YouTube video: https://youtu.be/q84I9i0DOK4

  28. arXiv:2302.03596  [pdf, other

    cs.LG

    Graph Generation with Diffusion Mixture

    Authors: Jaehyeong Jo, Dongki Kim, Sung Ju Hwang

    Abstract: Generation of graphs is a major challenge for real-world tasks that require understanding the complex nature of their non-Euclidean structures. Although diffusion models have achieved notable success in graph generation recently, they are ill-suited for modeling the topological properties of graphs since learning to denoise the noisy samples does not explicitly learn the graph structures to be gen… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: ICML 2024

  29. Learning Neural Force Manifolds for Sim2Real Robotic Symmetrical Paper Folding

    Authors: Andrew Choi, Dezhong Tong, Demetri Terzopoulos, Jungseock Joo, M. Khalid Jawed

    Abstract: Robotic manipulation of slender objects is challenging, especially when the induced deformations are large and nonlinear. Traditionally, learning-based control approaches, such as imitation learning, have been used to address deformable material manipulation. These approaches lack generality and often suffer critical failure from a simple switch of material, geometric, and/or environmental (e.g.,… ▽ More

    Submitted 19 February, 2024; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: IEEE Transactions on Automation Science and Engineering (T-ASE 2024). First two authors have equal contribution. Supplementary video is available on YouTube: https://youtu.be/k0nexYGy-P4

  30. arXiv:2212.03414  [pdf, other

    cs.DC cs.LG

    DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

    Authors: Seah Kim, Hyoukjun Kwon, Jinook Song, Jihyuck Jo, Yu-Hsin Chen, Liangzhen Lai, Vikas Chandra

    Abstract: Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML system since the overall system load is not completely predictable, unlike traditional ML workloads. In addition, RTMM workloads require real-time… ▽ More

    Submitted 20 September, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 14 pages

  31. arXiv:2211.15880  [pdf, other

    cs.LG math.OC

    Mirror descent of Hopfield model

    Authors: Hyungjoon Soh, Dongyeob Kim, Juno Hwang, Junghyo Jo

    Abstract: Mirror descent is an elegant optimization technique that leverages a dual space of parametric models to perform gradient descent. While originally developed for convex optimization, it has increasingly been applied in the field of machine learning. In this study, we propose a novel approach for utilizing mirror descent to initialize the parameters of neural networks. Specifically, we demonstrate t… ▽ More

    Submitted 9 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 3 figures

  32. arXiv:2211.08170  [pdf, other

    cs.CL cs.DB cs.IR cs.LG

    A Comparative Study of Question Answering over Knowledge Bases

    Authors: Khiem Vinh Tran, Hao Phu Phan, Khang Nguyen Duc Quach, Ngan Luu-Thuy Nguyen, Jun Jo, Thanh Tam Nguyen

    Abstract: Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, propert… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  33. arXiv:2210.09394  [pdf

    cs.AI cs.LG

    Review Learning: Alleviating Catastrophic Forgetting with Generative Replay without Generator

    Authors: Jaesung Yoo, Sunghyuk Choi, Ye Seul Yang, Suhyeon Kim, Jieun Choi, Dongkyeong Lim, Yaeji Lim, Hyung Joon Joo, Dae Jung Kim, Rae Woong Park, Hyeong-Jin Yoon, Kwangsoo Kim

    Abstract: When a deep learning model is sequentially trained on different datasets, it forgets the knowledge acquired from previous data, a phenomenon known as catastrophic forgetting. It deteriorates performance of the deep learning model on diverse datasets, which is critical in privacy-preserving deep learning (PPDL) applications based on transfer learning (TL). To overcome this, we propose review learni… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  34. Large-scale Text-to-Image Generation Models for Visual Artists' Creative Works

    Authors: Hyung-Kwon Ko, Gwanmo Park, Hyeon Jeon, Jaemin Jo, Juho Kim, Jinwook Seo

    Abstract: Large-scale Text-to-image Generation Models (LTGMs) (e.g., DALL-E), self-supervised deep learning models trained on a huge dataset, have demonstrated the capacity for generating high-quality open-domain images from multi-modal input. Although they can even produce anthropomorphized versions of objects and animals, combine irrelevant concepts in reasonable ways, and give variation to any user-provi… ▽ More

    Submitted 16 February, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 15 pages, 3 figures

    Journal ref: 28th International Conference on Intelligent User Interfaces (IUI '23), March 27--31, 2023, Sydney, NSW, Australia

  35. arXiv:2209.00930  [pdf, other

    cs.CL cs.AI cs.LG

    Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization

    Authors: Seungone Kim, Se June Joo, Hyungjoo Chae, Chaehyeong Kim, Seung-won Hwang, Jinyoung Yeo

    Abstract: In this paper, we propose to leverage the unique characteristics of dialogues sharing commonsense knowledge across participants, to resolve the difficulties in summarizing them. We present SICK, a framework that uses commonsense inferences as additional context. Compared to previous work that solely relies on the input dialogue, SICK uses an external knowledge model to generate a rich set of commo… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  36. arXiv:2207.10888  [pdf, other

    cs.CV cs.AI cs.LG

    FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification

    Authors: Xiaofeng Lin, Seungbae Kim, Jungseock Joo

    Abstract: Existing pruning techniques preserve deep neural networks' overall ability to make correct predictions but may also amplify hidden biases during the compression process. We propose a novel pruning method, Fairness-aware GRAdient Pruning mEthod (FairGRAPE), that minimizes the disproportionate impacts of pruning on different sub-groups. Our method calculates the per-group importance of each model we… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: To appear in ECCV 2022

  37. arXiv:2207.08098  [pdf, other

    cs.SI cs.AI cs.LG

    Model-Agnostic and Diverse Explanations for Streaming Rumour Graphs

    Authors: Thanh Tam Nguyen, Thanh Cong Phan, Minh Hieu Nguyen, Matthias Weidlich, Hongzhi Yin, Jun Jo, Quoc Viet Hung Nguyen

    Abstract: The propagation of rumours on social media poses an important threat to societies, so that various techniques for rumour detection have been proposed recently. Yet, existing work focuses on \emph{what} entities constitute a rumour, but provides little support to understand \emph{why} the entities have been classified as such. This prevents an effective evaluation of the detected rumours as well as… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  38. arXiv:2206.07632  [pdf, other

    q-bio.BM cs.LG physics.chem-ph

    Exploring Chemical Space with Score-based Out-of-distribution Generation

    Authors: Seul Lee, Jaehyeong Jo, Sung Ju Hwang

    Abstract: A well-known limitation of existing molecular generative models is that the generated molecules highly resemble those in the training set. To generate truly novel molecules that may have even better properties for de novo drug discovery, more powerful exploration in the chemical space is necessary. To this end, we propose Molecular Out-Of-distribution Diffusion(MOOD), a score-based diffusion schem… ▽ More

    Submitted 3 June, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: ICML 2023

  39. arXiv:2206.07578   

    cs.CV cs.LG eess.IV

    E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations

    Authors: Jongwan Kim, DongJin Lee, Byunggook Na, Seongsik Park, Jeonghee Jo, Sungroh Yoon

    Abstract: Event cameras respond to brightness changes in the scene asynchronously and independently for every pixel. Due to the properties, these cameras have distinct features: high dynamic range (HDR), high temporal resolution, and low power consumption. However, the results of event cameras should be processed into an alternative representation for computer vision tasks. Also, they are usually noisy and… ▽ More

    Submitted 13 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: This submission has been withdrawn by arXiv administrators due to inappropriate text overlap with external sources. Additional information at https://doi.org/10.1109/CVPR52688.2022.01319

    Journal ref: The IEEE / CVF Computer Vision and Pattern Recognition Conference 2022

  40. A Fully Implicit Method for Robust Frictional Contact Handling in Elastic Rods

    Authors: Dezhong Tong, Andrew Choi, Jungseock Joo, M. Khalid Jawed

    Abstract: Accurate frictional contact is critical in simulating the assembly of rod-like structures in the practical world, such as knots, hairs, flagella, and more. Due to their high geometric nonlinearity and elasticity, rod-on-rod contact remains a challenging problem tackled by researchers in both computational mechanics and computer graphics. Typically, frictional contact is regarded as constraints for… ▽ More

    Submitted 19 February, 2024; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Extreme Mechanics Letters (EML 2023). First two authors have equal contribution. A video summarizing this work is available on YouTube: https://youtu.be/g0rlCFfWJ8U

  41. Uniform Manifold Approximation with Two-phase Optimization

    Authors: Hyeon Jeon, Hyung-Kwon Ko, Soohyun Lee, Jaemin Jo, Jinwook Seo

    Abstract: We introduce Uniform Manifold Approximation with Two-phase Optimization (UMATO), a dimensionality reduction (DR) technique that improves UMAP to capture the global structure of high-dimensional data more accurately. In UMATO, optimization is divided into two phases so that the resulting embeddings can depict the global structure reliably while preserving the local structure with sufficient accurac… ▽ More

    Submitted 11 July, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: IEEE VIS 2022. Hyeon Jeon and Hyung-Kwon Ko equally contributed to this work

  42. arXiv:2204.04601  [pdf, other

    cs.CV cs.AI cs.LG

    Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention

    Authors: Yu Yang, Seungbae Kim, Jungseock Joo

    Abstract: Interpretability is an important property for visual models as it helps researchers and users understand the internal mechanism of a complex model. However, generating semantic explanations about the learned representation is challenging without direct supervision to produce such explanations. We propose a general framework, Latent Visual Semantic Explainer (LaViSE), to teach any existing convolut… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: To appear in CVPR 2022 (oral presentation)

  43. arXiv:2203.00156  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Preemptive Motion Planning for Human-to-Robot Indirect Placement Handovers

    Authors: Andrew Choi, Mohammad Khalid Jawed, Jungseock Joo

    Abstract: As technology advances, the need for safe, efficient, and collaborative human-robot-teams has become increasingly important. One of the most fundamental collaborative tasks in any setting is the object handover. Human-to-robot handovers can take either of two approaches: (1) direct hand-to-hand or (2) indirect hand-to-placement-to-pick-up. The latter approach ensures minimal contact between the hu… ▽ More

    Submitted 19 February, 2024; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: IEEE International Conference on Robotics and Automation (ICRA 2022). Supplementary videos: https://pmp-human-to-robot.github.io/

  44. arXiv:2202.02514  [pdf, other

    cs.LG

    Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations

    Authors: Jaehyeong Jo, Seul Lee, Sung Ju Hwang

    Abstract: Generating graph-structured data requires learning the underlying distribution of graphs. Yet, this is a challenging problem, and the previous graph generative methods either fail to capture the permutation-invariance property of graphs or cannot sufficiently model the complex dependency between nodes and edges, which is crucial for generating real-world graphs such as molecules. To overcome such… ▽ More

    Submitted 15 June, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  45. arXiv:2111.14210  [pdf, other

    cs.CL cs.CV

    Emergent Graphical Conventions in a Visual Communication Game

    Authors: Shuwen Qiu, Sirui Xie, Lifeng Fan, Tao Gao, Jungseock Joo, Song-Chun Zhu, Yixin Zhu

    Abstract: Humans communicate with graphical sketches apart from symbolic languages. Primarily focusing on the latter, recent studies of emergent communication overlook the sketches; they do not account for the evolution process through which symbolic sign systems emerge in the trade-off between iconicity and symbolicity. In this work, we take the very first step to model and simulate this process via two ne… ▽ More

    Submitted 23 February, 2023; v1 submitted 28 November, 2021; originally announced November 2021.

  46. arXiv:2110.06620  [pdf, other

    cs.CL cs.LG

    Maximizing Efficiency of Language Model Pre-training for Learning Representation

    Authors: Junmo Kang, Suwon Shin, Jeonghwan Kim, Jaeyoung Jo, Sung-Hyon Myaeng

    Abstract: Pre-trained language models in the past years have shown exponential growth in model parameters and compute time. ELECTRA is a novel approach for improving the compute efficiency of pre-trained language models (e.g. BERT) based on masked language modeling (MLM) by addressing the sample inefficiency problem with the replaced token detection (RTD) task. Our work proposes adaptive early exit strategy… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: Published in KSC 2020

  47. arXiv:2109.02914  [pdf, other

    cs.LG cs.IT physics.data-an

    Scale-invariant representation of machine learning

    Authors: Sungyeop Lee, Junghyo Jo

    Abstract: The success of machine learning has resulted from its structured representation of data. Similar data have close internal representations as compressed codes for classification or emerged labels for clustering. We observe that the frequency of internal codes or labels follows power laws in both supervised and unsupervised learning models. This scale-invariant distribution implies that machine lear… ▽ More

    Submitted 23 March, 2022; v1 submitted 7 September, 2021; originally announced September 2021.

  48. arXiv:2108.10031  [pdf, other

    cs.CV

    Realistic Image Synthesis with Configurable 3D Scene Layouts

    Authors: Jaebong Jeong, Janghun Jo, Jingdong Wang, Sunghyun Cho, Jaesik Park

    Abstract: Recent conditional image synthesis approaches provide high-quality synthesized images. However, it is still challenging to accurately adjust image contents such as the positions and orientations of objects, and synthesized images often have geometrically invalid contents. To provide users with rich controllability on synthesized images in the aspect of 3D geometry, we propose a novel approach to r… ▽ More

    Submitted 24 August, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: paper: 9 pages, supplementary materials: 7 pages

  49. arXiv:2108.08504  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Understanding and Mitigating Annotation Bias in Facial Expression Recognition

    Authors: Yunliang Chen, Jungseock Joo

    Abstract: The performance of a computer vision model depends on the size and quality of its training data. Recent studies have unveiled previously-unknown composition biases in common image datasets which then lead to skewed model outputs, and have proposed methods to mitigate these biases. However, most existing works assume that human-generated annotations can be considered gold-standard and unbiased. In… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: To appear in ICCV 2021

  50. arXiv:2108.02846  [pdf, other

    cs.AI cs.CV cs.HC cs.LG cs.RO

    Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene

    Authors: Qi Wu, Cheng-Ju Wu, Yixin Zhu, Jungseock Joo

    Abstract: Human-robot collaboration is an essential research topic in artificial intelligence (AI), enabling researchers to devise cognitive AI systems and affords an intuitive means for users to interact with the robot. Of note, communication plays a central role. To date, prior studies in embodied agent navigation have only demonstrated that human languages facilitate communication by instructions in natu… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: To appear in IROS 2021