Skip to main content

Showing 1–50 of 204 results for author: Chakraborty, T

  1. arXiv:2407.04561  [pdf, other

    cs.NI eess.SP

    Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities

    Authors: Mukaram Shahid, Kunal Das, Taimoor Ul Islam, Christ Somiah, Daji Qiao, Arsalan Ahmad, Jimming Song, Zhengyuan Zhu, Sarath Babu, Yong Guan, Tusher Chakraborty, Suraj Jog, Ranveer Chandra, Hongwei Zhang

    Abstract: Due to factors such as low population density and expansive geographical distances, network deployment falls behind in rural regions, leading to a broadband divide. Wireless spectrum serves as the blood and flesh of wireless communications. Shared white spaces such as those in the TVWS and CBRS spectrum bands offer opportunities to expand connectivity, innovate, and provide affordable access to hi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2407.04465  [pdf, ps, other

    stat.AP cs.SI physics.data-an

    Learning Patterns from Biological Networks: A Compounded Burr Probability Model

    Authors: Tanujit Chakraborty, Shraddha M. Naik, Swarup Chattopadhyay, Suchismita Das

    Abstract: Complex biological networks, comprising metabolic reactions, gene interactions, and protein interactions, often exhibit scale-free characteristics with power-law degree distributions. However, empirical studies have revealed discrepancies between observed biological network data and ideal power-law fits, highlighting the need for improved modeling approaches. To address this challenge, we propose… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2407.04440  [pdf, ps, other

    cs.LG cs.NE

    Wavelet-based Temporal Attention Improves Traffic Forecasting

    Authors: Yash Jakhmola, Nitish Kumar Mishra, Kripabandhu Ghosh, Tanujit Chakraborty

    Abstract: Spatio-temporal forecasting of traffic flow data represents a typical problem in the field of machine learning, impacting urban traffic management systems. Traditional statistical and machine learning methods cannot adequately handle both the temporal and spatial dependencies in these complex traffic flow datasets. A prevalent approach in the field is to combine graph convolutional networks and mu… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2407.02268  [pdf, other

    cs.CR cs.AI

    Footprints of Data in a Classifier Model: The Privacy Issues and Their Mitigation through Data Obfuscation

    Authors: Payel Sadhukhan, Tanujit Chakraborty

    Abstract: The avalanche of AI deployment and its security-privacy concerns are two sides of the same coin. Article 17 of GDPR calls for the Right to Erasure; data has to be obliterated from a system to prevent its compromise. Extant research in this aspect focuses on effacing sensitive data attributes. However, several passive modes of data compromise are yet to be recognized and redressed. The embedding of… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.00453  [pdf, other

    cs.CL cs.LG

    PerSEval: Assessing Personalization in Text Summarizers

    Authors: Sourish Dasgupta, Ankush Chander, Parth Borad, Isha Motiyani, Tanmoy Chakraborty

    Abstract: Personalized summarization models cater to individuals' subjective understanding of saliency, as represented by their reading history and current topics of attention. Existing personalized text summarizers are primarily evaluated based on accuracy measures such as BLEU, ROUGE, and METEOR. However, a recent study argued that accuracy measures are inadequate for evaluating the degree of personalizat… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  6. arXiv:2406.18812  [pdf, other

    cs.RO cs.AI

    A Survey on Privacy Attacks Against Digital Twin Systems in AI-Robotics

    Authors: Ivan A. Fernandez, Subash Neupane, Trisha Chakraborty, Shaswata Mitra, Sudip Mittal, Nisha Pillai, Jingdao Chen, Shahram Rahimi

    Abstract: Industry 4.0 has witnessed the rise of complex robots fueled by the integration of Artificial Intelligence/Machine Learning (AI/ML) and Digital Twin (DT) technologies. While these technologies offer numerous benefits, they also introduce potential privacy and security risks. This paper surveys privacy attacks targeting robots enabled by AI and DT models. Exfiltration and data leakage of ML models… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures, 1 table

  7. arXiv:2406.03953  [pdf, other

    cs.CL

    Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech

    Authors: Neemesh Yadav, Sarah Masud, Vikram Goyal, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Employing language models to generate explanations for an incoming implicit hate post is an active area of research. The explanation is intended to make explicit the underlying stereotype and aid content moderators. The training often combines top-k relevant knowledge graph (KG) tuples to provide world knowledge and improve performance on standard metrics. Interestingly, our study presents conflic… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 17 Pages, 5 Figures, 13 Tables, ACL Findings 2024

  8. arXiv:2406.02575  [pdf, other

    cs.CL cs.CR cs.LG

    Cross-Modal Safety Alignment: Is textual unlearning all you need?

    Authors: Trishna Chakraborty, Erfan Shayegani, Zikui Cai, Nael Abu-Ghazaleh, M. Salman Asif, Yue Dong, Amit K. Roy-Chowdhury, Chengyu Song

    Abstract: Recent studies reveal that integrating new modalities into Large Language Models (LLMs), such as Vision-Language Models (VLMs), creates a new attack surface that bypasses existing safety training techniques like Supervised Fine-tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). While further SFT and RLHF-based safety training can be conducted in multi-modal settings, collecting mu… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  9. arXiv:2405.16616  [pdf, other

    cs.LG cs.SI

    DPHGNN: A Dual Perspective Hypergraph Neural Networks

    Authors: Siddhant Saxena, Shounak Ghatak, Raghu Kolla, Debashis Mukherjee, Tanmoy Chakraborty

    Abstract: Message passing on hypergraphs has been a standard framework for learning higher-order correlations between hypernodes. Recently-proposed hypergraph neural networks (HGNNs) can be categorized into spatial and spectral methods based on their design choices. In this work, we analyze the impact of change in hypergraph topology on the suboptimal performance of HGNNs and propose DPHGNN, a novel dual-pe… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted in SIGKDD'24 -- Research Track

  10. arXiv:2405.11215  [pdf, other

    cs.CL cs.CY

    MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing

    Authors: Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty

    Abstract: Memes have evolved as a prevalent medium for diverse communication, ranging from humour to propaganda. With the rising popularity of image-focused content, there is a growing need to explore its potential harm from different aspects. Previous studies have analyzed memes in closed settings - detecting harm, applying semantic labels, and offering natural language explanations. To extend this researc… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: The paper has been accepted in ACL'24 (Findings)

  11. arXiv:2405.10548  [pdf, other

    cs.CL

    Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks

    Authors: Anwoy Chatterjee, Eshaan Tanwar, Subhabrata Dutta, Tanmoy Chakraborty

    Abstract: Large Language Models (LLMs) have transformed NLP with their remarkable In-context Learning (ICL) capabilities. Automated assistants based on LLMs are gaining popularity; however, adapting them to novel tasks is still challenging. While colossal models excel in zero-shot performance, their computational demands limit widespread use, and smaller language models struggle without context. This paper… ▽ More

    Submitted 12 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL 2024 Main

  12. arXiv:2405.01858  [pdf, other

    cs.CL cs.CY

    SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India

    Authors: Salam Michael Singh, Shubhmoy Kumar Garg, Amitesh Misra, Aaditeshwar Seth, Tanmoy Chakraborty

    Abstract: Sexual education aims to foster a healthy lifestyle in terms of emotional, mental and social well-being. In countries like India, where adolescents form the largest demographic group, they face significant vulnerabilities concerning sexual health. Unfortunately, sexual education is often stigmatized, creating barriers to providing essential counseling and information to this at-risk population. Co… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  13. arXiv:2404.05482  [pdf, other

    cs.LG

    WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data

    Authors: Jintu Borah, Tanujit Chakraborty, Md. Shahrul Md. Nadzir, Mylene G. Cayetano, Shubhankar Majumdar

    Abstract: Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlapping discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approa… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  14. arXiv:2404.02255  [pdf, other

    cs.CL cs.AI

    $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning

    Authors: Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty

    Abstract: Despite demonstrating emergent reasoning abilities, Large Language Models (LLMS) often lose track of complex, multi-step reasoning. Existing studies show that providing guidance via decomposing the original question into multiple subproblems elicits more robustness in LLM reasoning -- a decomposer generates the subproblems, and a solver solves each of these subproblems. However, these techniques f… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  15. arXiv:2403.16771  [pdf

    cs.CL cs.LG

    Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation

    Authors: Kartik Kartik, Sanjana Soni, Anoop Kunchukuttan, Tanmoy Chakraborty, Md Shad Akhtar

    Abstract: The widespread online communication in a modern multilingual world has provided opportunities to blend more than one language (aka code-mixed language) in a single utterance. This has resulted a formidable challenge for the computational models due to the scarcity of annotated data and presence of noise. A potential solution to mitigate the data scarcity problem in low-resource setup is to leverag… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 9 pages, 2 figures, to be published in LREC-COLING 2024

  16. arXiv:2403.10279  [pdf, other

    cs.CY

    Emotion-Aware Multimodal Fusion for Meme Emotion Detection

    Authors: Shivam Sharma, Ramaneswaran S, Md. Shad Akhtar, Tanmoy Chakraborty

    Abstract: The ever-evolving social media discourse has witnessed an overwhelming use of memes to express opinions or dissent. Besides being misused for spreading malcontent, they are mined by corporations and political parties to glean the public's opinion. Therefore, memes predominantly offer affect-enriched insights towards ascertaining the societal psyche. However, the current approaches are yet to model… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to IEEE Transactions on Affective Computing

  17. arXiv:2403.06999  [pdf

    cs.LG cs.AI cs.CY

    Survival modeling using deep learning, machine learning and statistical methods: A comparative analysis for predicting mortality after hospital admission

    Authors: Ziwen Wang, Jin Wee Lee, Tanujit Chakraborty, Yilin Ning, Mingxuan Liu, Feng Xie, Marcus Eng Hock Ong, Nan Liu

    Abstract: Survival analysis is essential for studying time-to-event outcomes and providing a dynamic understanding of the probability of an event occurring over time. Various survival analysis techniques, from traditional statistical models to state-of-the-art machine learning algorithms, support healthcare intervention and policy decisions. However, there remains ongoing discussion about their comparative… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  18. arXiv:2403.03876  [pdf, other

    cs.DC

    A Survey on Adversarial Contention Resolution

    Authors: Ioana Banicescu, Trisha Chakraborty, Seth Gilbert, Maxwell Young

    Abstract: Contention resolution addresses the challenge of coordinating access by multiple processes to a shared resource such as memory, disk storage, or a communication channel. Originally spurred by challenges in database systems and bus networks, contention resolution has endured as an important abstraction for resource sharing, despite decades of technological change. Here, we survey the literature on… ▽ More

    Submitted 4 July, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  19. arXiv:2402.19052  [pdf

    cs.CL cs.HC

    Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study

    Authors: Prottay Kumar Adhikary, Aseem Srivastava, Shivani Kumar, Salam Michael Singh, Puneet Manuja, Jini K Gopinath, Vijay Krishnan, Swati Kedia, Koushik Sinha Deb, Tanmoy Chakraborty

    Abstract: Comprehensive summaries of sessions enable an effective continuity in mental health counseling, facilitating informed therapy planning. Yet, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. This study evaluates the effectiveness of state-of-the-art Large Language Models (LLMs) in selectively summarizing various components of ther… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  20. arXiv:2402.18944  [pdf, other

    cs.CL cs.AI

    SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

    Authors: Shivani Kumar, Md Shad Akhtar, Erik Cambria, Tanmoy Chakraborty

    Abstract: We present SemEval-2024 Task 10, a shared task centred on identifying emotions and finding the rationale behind their flips within monolingual English and Hindi-English code-mixed dialogues. This task comprises three distinct subtasks - emotion recognition in conversation for code-mixed dialogues, emotion flip reasoning for code-mixed dialogues, and emotion flip reasoning for English dialogues. Pa… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 11 pages, 3 figures, 7 tables

  21. arXiv:2402.18312  [pdf, other

    cs.CL cs.LG

    How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

    Authors: Subhabrata Dutta, Joykirat Singh, Soumen Chakrabarti, Tanmoy Chakraborty

    Abstract: Despite superior reasoning prowess demonstrated by Large Language Models (LLMs) with Chain-of-Thought (CoT) prompting, a lack of understanding prevails around the internal mechanisms of the models that facilitate CoT generation. This work investigates the neural sub-structures within LLMs that manifest CoT reasoning from a mechanistic point of view. From an analysis of Llama-2 7B applied to multis… ▽ More

    Submitted 6 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  22. arXiv:2402.13623  [pdf, other

    cs.CL cs.SI

    FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models

    Authors: Sahil Mishra, Ujjwal Sudev, Tanmoy Chakraborty

    Abstract: Taxonomies represent an arborescence hierarchical structure that establishes relationships among entities to convey knowledge within a specific domain. Each edge in the taxonomy signifies a hypernym-hyponym relationship. Taxonomies find utility in various real-world applications, such as e-commerce search engines and recommendation systems. Consequently, there arises a necessity to enhance these t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  23. arXiv:2402.03349  [pdf, other

    physics.geo-ph cs.AI cs.LG physics.ao-ph

    When Geoscience Meets Generative AI and Large Language Models: Foundations, Trends, and Future Challenges

    Authors: Abdenour Hadid, Tanujit Chakraborty, Daniel Busby

    Abstract: Generative Artificial Intelligence (GAI) represents an emerging field that promises the creation of synthetic data and outputs in different modalities. GAI has recently shown impressive results across a large spectrum of applications ranging from biology, medicine, education, legislation, computer science, and finance. As one strives for enhanced safety, efficiency, and sustainability, generative… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

  24. arXiv:2402.02144  [pdf, other

    cs.CL

    Probing Critical Learning Dynamics of PLMs for Hate Speech Detection

    Authors: Sarah Masud, Mohammad Aflah Khan, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Despite the widespread adoption, there is a lack of research into how various critical aspects of pretrained language models (PLMs) affect their performance in hate speech detection. Through five research questions, our findings and recommendations lay the groundwork for empirically investigating different aspects of PLMs' use in hate speech detection. We deep dive into comparing different pretrai… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 20 pages, 9 figures, 14 tables. Accepted at EACL'24

  25. arXiv:2401.16727  [pdf, other

    cs.CL

    Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

    Authors: Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Tanmoy Chakraborty, Roy Ka-Wei Lee

    Abstract: In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content. This comprehensive survey delves into the recent strides in HS moderation, spotlighting the burgeoning role of large language models (LLMs) and large multimodal models (LMMs). Our exploration begins with a thorough analysis of curre… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint; Under-Review

  26. arXiv:2401.13334  [pdf, other

    cs.LG cs.AI

    Explainable Bayesian Optimization

    Authors: Tanmay Chakraborty, Christin Seifert, Christian Wirth

    Abstract: In industry, Bayesian optimization (BO) is widely applied in the human-AI collaborative parameter tuning of cyber-physical systems. However, BO's solutions may deviate from human experts' actual goal due to approximation errors and simplified objectives, requiring subsequent tuning. The black-box nature of BO limits the collaborative tuning process because the expert does not trust the BO recommen… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  27. arXiv:2401.12995  [pdf, other

    cs.CL

    Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation in Dialogues

    Authors: Shivani Kumar, Tanmoy Chakraborty

    Abstract: Code-mixing, the blending of multiple languages within a single conversation, introduces a distinctive challenge, particularly in the context of response generation. Capturing the intricacies of code-mixing proves to be a formidable task, given the wide-ranging variations influenced by individual speaking styles and cultural backgrounds. In this study, we explore response generation within code-mi… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 14 pages, 8 figures, 7 tables. Accepted at EACL (findings) 2024

  28. arXiv:2401.10036  [pdf, other

    cs.CR cs.AI cs.IR cs.LO

    LOCALINTEL: Generating Organizational Threat Intelligence from Global and Local Cyber Knowledge

    Authors: Shaswata Mitra, Subash Neupane, Trisha Chakraborty, Sudip Mittal, Aritran Piplai, Manas Gaur, Shahram Rahimi

    Abstract: Security Operations Center (SoC) analysts gather threat reports from openly accessible global threat databases and customize them manually to suit a particular organization's needs. These analysts also depend on internal repositories, which act as private local knowledge database for an organization. Credible cyber intelligence, critical operational details, and relevant organizational information… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  29. arXiv:2401.05680  [pdf, other

    cs.CR cs.AI cs.LG cs.NE

    Use of Graph Neural Networks in Aiding Defensive Cyber Operations

    Authors: Shaswata Mitra, Trisha Chakraborty, Subash Neupane, Aritran Piplai, Sudip Mittal

    Abstract: In an increasingly interconnected world, where information is the lifeblood of modern society, regular cyber-attacks sabotage the confidentiality, integrity, and availability of digital systems and information. Additionally, cyber-attacks differ depending on the objective and evolve rapidly to disguise defensive systems. However, a typical cyber-attack demonstrates a series of stages from attack i… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 35 pages, 9 figures, 8 tables

  30. arXiv:2312.06022  [pdf, other

    cs.CL

    Exploiting Representation Bias for Data Distillation in Abstractive Text Summarization

    Authors: Yash Kumar Atri, Vikram Goyal, Tanmoy Chakraborty

    Abstract: Abstractive text summarization is surging with the number of training samples to cater to the needs of the deep learning models. These models tend to exploit the training data representations to attain superior performance by improving the quantitative element of the resultant summary. However, increasing the size of the training set may not always be the ideal solution to maximize the performance… ▽ More

    Submitted 20 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  31. arXiv:2312.05878  [pdf, other

    stat.ML cs.LG

    Skew Probabilistic Neural Networks for Learning from Imbalanced Data

    Authors: Shraddha M. Naik, Tanujit Chakraborty, Abdenour Hadid, Bibhas Chakraborty

    Abstract: Real-world datasets often exhibit imbalanced data distribution, where certain class levels are severely underrepresented. In such cases, traditional pattern classifiers have shown a bias towards the majority class, impeding accurate predictions for the minority class. This paper introduces an imbalanced data-oriented approach using probabilistic neural networks (PNNs) with a skew normal probabilit… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  32. arXiv:2312.05571  [pdf, other

    cs.AI cs.LG

    Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning

    Authors: Subhabrata Dutta, Joykirat Singh, Ishan Pandey, Sunny Manchanda, Soumen Chakrabarti, Tanmoy Chakraborty

    Abstract: Large Language Models (LLM) exhibit zero-shot mathematical reasoning capacity as a behavior emergent with scale, commonly manifesting as chain-of-thoughts (CoT) reasoning. However, multiple empirical findings suggest that this prowess is exclusive to LLMs with exorbitant sizes (beyond 50 billion parameters). Meanwhile, educational neuroscientists suggest that symbolic algebraic manipulation be int… ▽ More

    Submitted 19 December, 2023; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  33. arXiv:2311.17446  [pdf, other

    cs.LG cs.AI

    Uncertainty in Additive Feature Attribution methods

    Authors: Abhishek Madaan, Tanya Chowdhury, Neha Rana, James Allan, Tanmoy Chakraborty

    Abstract: In this work, we explore various topics that fall under the umbrella of Uncertainty in post-hoc Explainable AI (XAI) methods. We in particular focus on the class of additive feature attribution explanation methods. We first describe our specifications of uncertainty and compare various statistical and recent methods to quantify the same. Next, for a particular instance, we study the relationship b… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 14

    ACM Class: I.2.6

  34. arXiv:2311.14359  [pdf, other

    stat.ML cs.LG stat.AP

    Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study

    Authors: Xueqing Liu, Nina Deliu, Tanujit Chakraborty, Lauren Bell, Bibhas Chakraborty

    Abstract: Mobile health (mHealth) technologies aim to improve distal outcomes, such as clinical conditions, by optimizing proximal outcomes through just-in-time adaptive interventions. Contextual bandits provide a suitable framework for customizing such interventions according to individual time-varying contexts, intending to maximize cumulative proximal outcomes. However, unique challenges such as modeling… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  35. arXiv:2311.09834  [pdf, other

    cs.CL

    Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection

    Authors: Sarah Masud, Mohammad Aflah Khan, Md. Shad Akhtar, Tanmoy Chakraborty

    Abstract: As hate speech continues to proliferate on the web, it is becoming increasingly important to develop computational methods to mitigate it. Reactively, using black-box models to identify hateful content can perplex users as to why their posts were automatically flagged as hateful. On the other hand, proactive mitigation can be achieved by suggesting rephrasing before a post is made public. However,… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 8 pages, 1 figure, 4 Tables

  36. arXiv:2310.19267  [pdf, other

    cs.CL

    Overview of the CLAIMSCAN-2023: Uncovering Truth in Social Media through Claim Detection and Identification of Claim Spans

    Authors: Megha Sundriyal, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: A significant increase in content creation and information exchange has been made possible by the quick development of online social media platforms, which has been very advantageous. However, these platforms have also become a haven for those who disseminate false information, propaganda, and fake news. Claims are essential in forming our perceptions of the world, but sadly, they are frequently u… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  37. arXiv:2310.18338  [pdf, other

    cs.CL cs.AI

    Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

    Authors: Gurusha Juneja, Subhabrata Dutta, Soumen Chakrabarti, Sunny Manchanda, Tanmoy Chakraborty

    Abstract: Large Language Models (LLMs) prompted to generate chain-of-thought (CoT) exhibit impressive reasoning capabilities. Recent attempts at prompt decomposition toward solving complex, multi-step reasoning problems depend on the ability of the LLM to simultaneously decompose and solve the problem. A significant disadvantage is that foundational LLMs are typically not available for fine-tuning, making a… ▽ More

    Submitted 27 February, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Typos corrected)

  38. arXiv:2310.14338  [pdf, other

    cs.CL cs.AI

    From Chaos to Clarity: Claim Normalization to Empower Fact-Checking

    Authors: Megha Sundriyal, Tanmoy Chakraborty, Preslav Nakov

    Abstract: With the rise of social media, users are exposed to many misleading claims. However, the pervasive noise inherent in these posts presents a challenge in identifying precise and prominent claims that require verification. Extracting the important claims from such posts is arduous and time-consuming, yet it is an underexplored problem. Here, we aim to bridge this gap. We introduce a novel task, Clai… ▽ More

    Submitted 12 February, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted at Findings EMNLP2023

  39. arXiv:2310.14206  [pdf, other

    cs.CL cs.LG

    Manifold-Preserving Transformers are Effective for Short-Long Range Encoding

    Authors: Ayan Sengupta, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Multi-head self-attention-based Transformers have shown promise in different learning tasks. Albeit these models exhibit significant improvement in understanding short-term and long-term contexts from sequences, encoders of Transformers and their variants fail to preserve layer-wise contextual information. Transformers usually project tokens onto sparse manifolds and fail to preserve mathematical… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 17 pages, 7 figures, 5 tables, Findings of the Association for Computational Linguistics: EMNLP2023

  40. arXiv:2310.13080  [pdf, other

    cs.CL cs.AI

    From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues

    Authors: Shivani Kumar, Ramaneswaran S, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Understanding emotions during conversation is a fundamental aspect of human communication, driving NLP research for Emotion Recognition in Conversation (ERC). While considerable research has focused on discerning emotions of individual speakers in monolingual dialogues, understanding the emotional dynamics in code-mixed conversations has received relatively less attention. This motivates our under… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Paper accepted in EMNLP 2023. 15 pages, 6 figures, 9 tables

  41. arXiv:2310.05189  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Factuality Challenges in the Era of Large Language Models

    Authors: Isabelle Augenstein, Timothy Baldwin, Meeyoung Cha, Tanmoy Chakraborty, Giovanni Luca Ciampaglia, David Corney, Renee DiResta, Emilio Ferrara, Scott Hale, Alon Halevy, Eduard Hovy, Heng Ji, Filippo Menczer, Ruben Miguez, Preslav Nakov, Dietram Scheufele, Shivam Sharma, Giovanni Zagni

    Abstract: The emergence of tools based on Large Language Models (LLMs), such as OpenAI's ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention. These incredibly useful, natural-sounding tools mark significant advances in natural language generation, yet they exhibit a propensity to generate false, erroneous, or misleading content -- commonly referred to as "hallucinations.… ▽ More

    Submitted 9 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Our article offers a comprehensive examination of the challenges and risks associated with Large Language Models (LLMs), focusing on their potential impact on the veracity of information in today's digital landscape

  42. arXiv:2309.11896  [pdf, other

    cs.CL cs.CY

    Focal Inferential Infusion Coupled with Tractable Density Discrimination for Implicit Hate Speech Detection

    Authors: Sarah Masud, Ashutosh Bajpai, Tanmoy Chakraborty

    Abstract: Although pre-trained large language models (PLMs) have achieved state-of-the-art on many NLP tasks, they lack understanding of subtle expressions of implicit hate speech. Such nuanced and implicit hate is often misclassified as non-hate. Various attempts have been made to enhance the detection of (implicit) hate content by augmenting external context or enforcing label separation via distance-base… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 21 pages, 6 Figures and 9 Tables

  43. arXiv:2309.09274  [pdf, other

    cs.CL

    Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking

    Authors: Megha Sundriyal, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: The expansion of online social media platforms has led to a surge in online content consumption. However, this has also paved the way for disseminating false claims and misinformation. As a result, there is an escalating demand for a substantial workforce to sift through and validate such unverified claims. Currently, these claims are manually verified by fact-checkers. Still, the volume of online… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 28 pages, 2 figures, 8 tables

  44. arXiv:2309.03750  [pdf, other

    cs.CV

    PBP: Path-based Trajectory Prediction for Autonomous Driving

    Authors: Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui

    Abstract: Trajectory prediction plays a crucial role in the autonomous driving stack by enabling autonomous vehicles to anticipate the motion of surrounding agents. Goal-based prediction models have gained traction in recent years for addressing the multimodal nature of future trajectories. Goal-based prediction models simplify multimodal prediction by first predicting 2D goal locations of agents and then p… ▽ More

    Submitted 2 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally

  45. arXiv:2309.02915  [pdf, other

    cs.CL cs.LG

    Persona-aware Generative Model for Code-mixed Language

    Authors: Ayan Sengupta, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Code-mixing and script-mixing are prevalent across online social networks and multilingual societies. However, a user's preference toward code-mixing depends on the socioeconomic status, demographics of the user, and the local context, which existing generative models mostly ignore while generating code-mixed texts. In this work, we make a pioneering attempt to develop a persona-aware generative m… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 4 tables, 4 figures

  46. arXiv:2309.01618  [pdf, other

    cs.CL

    Critical Behavioral Traits Foster Peer Engagement in Online Mental Health Communities

    Authors: Aseem Srivastava, Tanya Gupta, Alison Cerezo, Sarah Peregrine, Lord, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: Online Mental Health Communities (OMHCs), such as Reddit, have witnessed a surge in popularity as go-to platforms for seeking information and support in managing mental health needs. Platforms like Reddit offer immediate interactions with peers, granting users a vital space for seeking mental health assistance. However, the largely unregulated nature of these platforms introduces intricate challen… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  47. arXiv:2308.16316  [pdf, other

    cs.LG cs.CV

    Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art

    Authors: Tanujit Chakraborty, Ujjwal Reddy K S, Shraddha M. Naik, Madhurima Panja, Bayapureddy Manvitha

    Abstract: Since their inception in 2014, Generative Adversarial Networks (GANs) have rapidly emerged as powerful tools for generating realistic and diverse data across various domains, including computer vision and other applied areas. Consisting of a discriminative network and a generative network engaged in a Minimax game, GANs have revolutionized the field of generative modeling. In February 2018, GAN se… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  48. arXiv:2308.04305  [pdf, other

    cs.DC cs.CR cs.DS

    Defending Hash Tables from Subterfuge with Depth Charge

    Authors: Trisha Chakraborty, Jared Saia, Maxwell Young

    Abstract: We consider the problem of defending a hash table against a Byzantine attacker that is trying to degrade the performance of query, insertion and deletion operations. Our defense makes use of resource burning (RB) -- the the verifiable expenditure of network resources -- where the issuer of a request incurs some RB cost. Our algorithm, Depth Charge, charges RB costs for operations based on the dept… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  49. arXiv:2307.07255  [pdf, other

    cs.CL cs.AI

    Dialogue Agents 101: A Beginner's Guide to Critical Ingredients for Designing Effective Conversational Systems

    Authors: Shivani Kumar, Sumit Bhatia, Milan Aggarwal, Tanmoy Chakraborty

    Abstract: Sharing ideas through communication with peers is the primary mode of human interaction. Consequently, extensive research has been conducted in the area of conversational AI, leading to an increase in the availability and diversity of conversational tasks, datasets, and methods. However, with numerous tasks being explored simultaneously, the current landscape of conversational AI becomes fragmente… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted at the journal of Natural Language Processing (formerly Natural Language Engineering). 21 pages, 3 figures, 3 tables

  50. arXiv:2306.13968  [pdf, other

    cs.CL cs.AI

    Fusing Multimodal Signals on Hyper-complex Space for Extreme Abstractive Text Summarization (TL;DR) of Scientific Contents

    Authors: Yash Kumar Atri, Vikram Goyal, Tanmoy Chakraborty

    Abstract: The realm of scientific text summarization has experienced remarkable progress due to the availability of annotated brief summaries and ample data. However, the utilization of multiple input modalities, such as videos and audio, has yet to be thoroughly explored. At present, scientific multimodal-input-based text summarization systems tend to employ longer target summaries like abstracts, leading… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted to ADS-SIGKDD2023