Skip to main content

Showing 1–19 of 19 results for author: Rim, D

  1. arXiv:2407.05734  [pdf, other

    cs.CL

    Empirical Study of Symmetrical Reasoning in Conversational Chatbots

    Authors: Daniela N. Rim, Heeyoul Choi

    Abstract: This work explores the capability of conversational chatbots powered by large language models (LLMs), to understand and characterize predicate symmetry, a cognitive linguistic function traditionally believed to be an inherent human trait. Leveraging in-context learning (ICL), a paradigm shift enabling chatbots to learn new tasks from prompts without re-training, we assess the symmetrical reasoning… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted in Future Technology Conference (FTC) 2024

  2. arXiv:2407.03631  [pdf, other

    cs.CE cs.LG physics.data-an physics.geo-ph

    On the performance of sequential Bayesian update for database of diverse tsunami scenarios

    Authors: Reika Nomura, Louise A. Hirao Vermare, Saneiki Fujita, Donsub Rim, Shuji Moriguchi, Randall J. LeVeque, Kenjiro Terada

    Abstract: Although the sequential tsunami scenario detection framework was validated in our previous work, several tasks remain to be resolved from a practical point of view. This study aims to evaluate the performance of the previous tsunami scenario detection framework using a diverse database consisting of complex fault rupture patterns with heterogeneous slip distributions. Specifically, we compare the… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 15 pages, 12 figures

  3. arXiv:2406.14091  [pdf, other

    cs.CL

    Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models

    Authors: Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo

    Abstract: Although language models (LMs) demonstrate exceptional capabilities on various tasks, they are potentially vulnerable to extraction attacks, which represent a significant privacy risk. To mitigate the privacy concerns of LMs, machine unlearning has emerged as an important research area, which is utilized to induce the LM to selectively forget about some of its training data. While completely retra… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL2024 findings

  4. arXiv:2406.12329  [pdf, other

    cs.CL

    SNAP: Unlearning Selective Knowledge in Large Language Models with Negative Instructions

    Authors: Minseok Choi, Daniel Rim, Dohyun Lee, Jaegul Choo

    Abstract: Instruction-following large language models (LLMs), such as ChatGPT, have become increasingly popular with the general audience, many of whom are incorporating them into their daily routines. However, these LLMs inadvertently disclose personal or copyrighted information, which calls for a machine unlearning method to remove selective knowledge. Previous attempts sought to forget the link between t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  5. arXiv:2406.05694  [pdf, other

    math.NA cs.LG

    A Low Rank Neural Representation of Entropy Solutions

    Authors: Donsub Rim, Gerrit Welper

    Abstract: We construct a new representation of entropy solutions to nonlinear scalar conservation laws with a smooth convex flux function in a single spatial dimension. The representation is a generalization of the method of characteristics and posseses a compositional form. While it is a nonlinear representation, the embedded dynamics of the solution in the time variable is linear. This representation is t… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 42 pages, 9 figures

    MSC Class: 68T07; 41A46; 41A25; 65N15; 35L65

  6. Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer Models for Classifying Depression Severity in English and Luganda

    Authors: Richard Kimera, Daniela N. Rim, Joseph Kirabira, Ubong Godwin Udomah, Heeyoul Choi

    Abstract: Depression is a global burden and one of the most challenging mental health conditions to control. Experts can detect its severity early using the Beck Depression Inventory (BDI) questionnaire, administer appropriate medication to patients, and impede its progression. Due to the fear of potential stigmatization, many patients turn to social media platforms like Reddit for advice and assistance at… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: In IEEE Proceedings of the 14th International Conference on ICT Convergence (ICTC), Jeju, Korea, October 2023

  7. arXiv:2312.08553  [pdf, other

    eess.AS cs.SD

    USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

    Authors: Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal

    Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios… ▽ More

    Submitted 16 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024. Preprint

  8. arXiv:2310.09528  [pdf, other

    cs.LG math.NA physics.comp-ph

    Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks

    Authors: Woojin Cho, Kookjin Lee, Donsub Rim, Noseong Park

    Abstract: In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural net… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  9. arXiv:2308.08153  [pdf, other

    cs.CL

    Fast Training of NMT Model with Data Sorting

    Authors: Daniela N. Rim, Kimera Richard, Heeyoul Choi

    Abstract: The Transformer model has revolutionized Natural Language Processing tasks such as Neural Machine Translation, and many efforts have been made to study the Transformer architecture, which increased its efficiency and accuracy. One potential area for improvement is to address the computation of empty tokens that the Transformer computes only to discard them later, leading to an unnecessary computat… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  10. arXiv:2305.15536  [pdf, other

    eess.AS cs.LG

    RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

    Authors: David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He

    Abstract: With the rapid increase in the size of neural networks, model compression has become an important area of research. Quantization is an effective technique at decreasing the model size, memory access, and compute load of large models. Despite recent advances in quantization aware training (QAT) technique, most papers present evaluations that are focused on computer vision tasks, which have differen… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  11. arXiv:2305.04720  [pdf, other

    cs.CL

    DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation

    Authors: ChaeHun Park, Seungil Chad Lee, Daniel Rim, Jaegul Choo

    Abstract: Despite the recent advances in open-domain dialogue systems, building a reliable evaluation metric is still a challenging problem. Recent studies proposed learnable metrics based on classification models trained to distinguish the correct response. However, neural classifiers are known to make overly confident predictions for examples from unseen distributions. We propose DEnsity, which evaluates… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Findings of ACL2023. 13 pages

  12. Building a Parallel Corpus and Training Translation Models Between Luganda and English

    Authors: Richard Kimera, Daniela N. Rim, Heeyoul Choi

    Abstract: Neural machine translation (NMT) has achieved great successes with large datasets, so NMT is more premised on high-resource languages. This continuously underpins the low resource languages such as Luganda due to the lack of high-quality parallel corpora, so even 'Google translate' does not serve Luganda at the time of this writing. In this paper, we build a parallel corpus with 41,070 pairwise se… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Journal ref: Journal of KIISE, Vol. 49, No. 11, pp. 1009-1016, 2022. 11

  13. arXiv:2210.04958  [pdf, other

    cs.LG stat.ME

    Mining Causality from Continuous-time Dynamics Models: An Application to Tsunami Forecasting

    Authors: Fan Wu, Sanghyun Hong, Donsub Rim, Noseong Park, Kookjin Lee

    Abstract: Continuous-time dynamics models, such as neural ordinary differential equations, have enabled the modeling of underlying dynamics in time-series data and accurate forecasting. However, parameterization of dynamics using a neural network makes it difficult for humans to identify causal structures in the data. In consequence, this opaqueness hinders the use of these models in the domains where captu… ▽ More

    Submitted 13 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

  14. arXiv:2109.09075  [pdf, other

    cs.CL

    Adversarial Training with Contrastive Learning in NLP

    Authors: Daniela N. Rim, DongNyeong Heo, Heeyoul Choi

    Abstract: For years, adversarial training has been extensively studied in natural language processing (NLP) settings. The main goal is to make models robust so that similar inputs derive in semantically similar outcomes, which is not a trivial problem since there is no objective measure of semantic similarity in language. Previous works use an external pre-trained NLP model to tackle this challenge, introdu… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

  15. arXiv:2105.11681  [pdf, other

    cs.LG cs.SD eess.AS

    Deep Neural Networks and End-to-End Learning for Audio Compression

    Authors: Daniela N. Rim, Inseon Jang, Heeyoul Choi

    Abstract: Recent achievements in end-to-end deep learning have encouraged the exploration of tasks dealing with highly structured data with unified deep network models. Having such models for compressing audio signals has been challenging since it requires discrete representations that are not easy to train with end-to-end backpropagation. In this paper, we present an end-to-end deep learning approach that… ▽ More

    Submitted 13 July, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  16. arXiv:2010.05360  [pdf, other

    math.NA cs.CV

    A range characterization of the single-quadrant ADRT

    Authors: Weilin Li, Kui Ren, Donsub Rim

    Abstract: This work characterizes the range of the single-quadrant approximate discrete Radon transform (ADRT) of square images. The characterization follows from a set of linear constraints on the codomain. We show that for data satisfying these constraints, the exact and fast inversion formula [Rim, Appl. Math. Lett. 102 106159, 2020] yields a square image in a stable manner. The range characterization is… ▽ More

    Submitted 22 March, 2022; v1 submitted 11 October, 2020; originally announced October 2020.

    MSC Class: 44A12; 65R10; 92C55; 68U05; 15A04

  17. arXiv:2007.13977  [pdf, other

    math.NA cs.LG

    Depth separation for reduced deep networks in nonlinear model reduction: Distilling shock waves in nonlinear hyperbolic problems

    Authors: Donsub Rim, Luca Venturi, Joan Bruna, Benjamin Peherstorfer

    Abstract: Classical reduced models are low-rank approximations using a fixed basis designed to achieve dimensionality reduction of large-scale systems. In this work, we introduce reduced deep networks, a generalization of classical reduced models formulated as deep neural networks. We prove depth separation results showing that reduced deep networks approximate solutions of parametrized hyperbolic partial d… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    MSC Class: 68T07; 65M22; 41A46

  18. arXiv:1908.00887  [pdf, ps, other

    math.NA cs.CC cs.CV

    Exact and fast inversion of the approximate discrete Radon transform from partial data

    Authors: Donsub Rim

    Abstract: We give an exact inversion formula for the approximate discrete Radon transform introduced in [Brady, SIAM J. Comput., 27(1), 107--119] that is of cost $O(N \log N)$ for a square 2D image with $N$ pixels and requires only partial data.

    Submitted 18 May, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: 4 pages, 1 figure

    MSC Class: 44A12; 65R10; 65F05; 65Q30

    Journal ref: Appl. Math. Lett. 102 106159 (2020)

  19. arXiv:1512.08212  [pdf, other

    cs.CV

    Improving Facial Analysis and Performance Driven Animation through Disentangling Identity and Expression

    Authors: David Rim, Sina Honari, Md Kamrul Hasan, Chris Pal

    Abstract: We present techniques for improving performance driven facial animation, emotion recognition, and facial key-point or landmark prediction using learned identity invariant representations. Established approaches to these problems can work well if sufficient examples and labels for a particular identity are available and factors of variation are highly controlled. However, labeled examples of facial… ▽ More

    Submitted 22 May, 2016; v1 submitted 27 December, 2015; originally announced December 2015.

    Comments: to appear in Image and Vision Computing Journal (IMAVIS)