Skip to main content

Showing 1–49 of 49 results for author: Weld, D S

  1. arXiv:2406.10370  [pdf, other

    cs.HC

    Let's Get to the Point: LLM-Supported Planning, Drafting, and Revising of Research-Paper Blog Posts

    Authors: Marissa Radensky, Daniel S. Weld, Joseph Chee Chang, Pao Siangliulue, Jonathan Bragg

    Abstract: Research-paper blog posts help scientists disseminate their work to a larger audience, but translating papers into this format requires substantial additional effort. Blog post creation is not simply transforming a long-form article into a short output, as studied in most prior work on human-AI summarization. In contrast, blog posts are typically full-length articles that require a combination of… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 28 pages, 9 figures in main text (not appendix)

  2. arXiv:2312.11681  [pdf, other

    cs.HC cs.AI cs.CL

    Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows

    Authors: Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer

    Abstract: LLM chains enable complex tasks by decomposing work into a sequence of subtasks. Similarly, the more established techniques of crowdsourcing workflows decompose complex tasks into smaller tasks for human crowdworkers. Chains address LLM errors analogously to the way crowdsourcing workflows address human error. To characterize opportunities for LLM chaining, we survey 107 papers across the crowdsou… ▽ More

    Submitted 6 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2310.07581  [pdf, other

    cs.HC

    Qlarify: Recursively Expandable Abstracts for Directed Information Retrieval over Scientific Papers

    Authors: Raymond Fok, Joseph Chee Chang, Tal August, Amy X. Zhang, Daniel S. Weld

    Abstract: Navigating the vast scientific literature often starts with browsing a paper's abstract. However, when a reader seeks additional information, not present in the abstract, they face a costly cognitive chasm during their dive into the full text. To bridge this gap, we introduce recursively expandable abstracts, a novel interaction paradigm that dynamically expands abstracts by progressively incorpor… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 21 pages, 10 figures, 4 tables. arXiv admin note: text overlap with arXiv:2305.14314 by other authors

  4. arXiv:2305.07722  [pdf, other

    cs.AI cs.HC

    In Search of Verifiability: Explanations Rarely Enable Complementary Performance in AI-Advised Decision Making

    Authors: Raymond Fok, Daniel S. Weld

    Abstract: The current literature on AI-advised decision making -- involving explainable AI systems advising human decision makers -- presents a series of inconclusive and confounding results. To synthesize these findings, we propose a simple theory that elucidates the frequent failure of AI explanations to engender appropriate reliance and complementary decision making performance. We argue explanations are… ▽ More

    Submitted 1 February, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures, 1 table, working paper

  5. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  6. arXiv:2303.06264  [pdf, other

    cs.HC cs.CL

    An Interactive UI to Support Sensemaking over Collections of Parallel Texts

    Authors: Joyce Zhou, Elena Glassman, Daniel S. Weld

    Abstract: Scientists and science journalists, among others, often need to make sense of a large number of papers and how they compare with each other in scope, focus, findings, or any other important factors. However, with a large corpus of papers, it's cognitively demanding to pairwise compare and contrast them all with each other. Fully automating this review process would be infeasible, because it often… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 13 pages, 12 figures

  7. ScatterShot: Interactive In-context Example Curation for Text Transformation

    Authors: Tongshuang Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro

    Abstract: The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when "enough" examples have been included even for kno… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: IUI 2023: 28th International Conference on Intelligent User Interfaces

  8. CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context

    Authors: Joseph Chee Chang, Amy X. Zhang, Jonathan Bragg, Andrew Head, Kyle Lo, Doug Downey, Daniel S. Weld

    Abstract: When reading a scholarly article, inline citations help researchers contextualize the current article and discover relevant prior work. However, it can be challenging to prioritize and make sense of the hundreds of citations encountered during literature reviews. This paper introduces CiteSee, a paper reading tool that leverages a user's publishing, reading, and saving activities to provide person… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  9. arXiv:2301.10140  [pdf, other

    cs.DL cs.CL

    The Semantic Scholar Open Data Platform

    Authors: Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin , et al. (23 additional authors not shown)

    Abstract: The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by helping scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF conte… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures

  10. FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs

    Authors: Harmanpreet Kaur, Doug Downey, Amanpreet Singh, Evie Yu-Yen Cheng, Daniel S. Weld, Jonathan Bragg

    Abstract: The vast scale and open-ended nature of knowledge graphs (KGs) make exploratory search over them cognitively demanding for users. We introduce a new technique, polymorphic lenses, that improves exploratory search over a KG by obtaining new leverage from the existing preference models that KG-based systems maintain for recommending content. The approach is based on a simple but powerful observation… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: To appear at UIST 2022

  11. arXiv:2205.06982  [pdf, other

    cs.CL cs.AI cs.HC

    ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts

    Authors: Sonia K. Murthy, Kyle Lo, Daniel King, Chandra Bhagavatula, Bailey Kuehl, Sophie Johnson, Jonathan Borchardt, Daniel S. Weld, Tom Hope, Doug Downey

    Abstract: Systems that can automatically define unfamiliar terms hold the promise of improving the accessibility of scientific texts, especially for readers who may lack prerequisite background knowledge. However, current systems assume a single "best" description per concept, which fails to account for the many potentially useful ways a concept can be described. We present ACCoRD, an end-to-end system tack… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  12. Scim: Intelligent Skimming Support for Scientific Papers

    Authors: Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Andrew Head, Marti A. Hearst, Daniel S. Weld

    Abstract: Researchers need to keep up with immense literatures, though it is time-consuming and difficult to do so. In this paper, we investigate the role that intelligent interfaces can play in helping researchers skim papers, that is, rapidly reviewing a paper to attain a cursory understanding of its contents. After conducting formative interviews and a design probe, we suggest that skimming aids should a… ▽ More

    Submitted 25 September, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Updated to reflect version published in proceedings of IUI 2023

  13. arXiv:2205.04050  [pdf, other

    cs.CL

    Few-shot Mining of Naturally Occurring Inputs and Outputs

    Authors: Mandar Joshi, Terra Blevins, Mike Lewis, Daniel S. Weld, Luke Zettlemoyer

    Abstract: Creating labeled natural language training data is expensive and requires significant human effort. We mine input output examples from large corpora using a supervised mining function trained using a small seed set of only 100 examples. The mining consists of two stages -- (1) a biencoder-based recall-oriented dense search which pairs inputs with potential outputs, and (2) a crossencoder-based fil… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  14. arXiv:2205.02007  [pdf, other

    cs.CL cs.CY cs.HC cs.IR

    A Computational Inflection for Scientific Discovery

    Authors: Tom Hope, Doug Downey, Oren Etzioni, Daniel S. Weld, Eric Horvitz

    Abstract: We stand at the foot of a significant inflection in the trajectory of scientific discovery. As society continues on its fast-paced digital transformation, so does humankind's collective scientific knowledge and discourse. We now read and write papers in digitized form, and a great deal of the formal and informal processes of science are captured digitally -- including papers, preprints and books,… ▽ More

    Submitted 24 May, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to CACM

  15. arXiv:2204.13194  [pdf, other

    cs.HC cs.AI cs.LG

    Exploring How Anomalous Model Input and Output Alerts Affect Decision-Making in Healthcare

    Authors: Marissa Radensky, Dustin Burson, Rajya Bhaiya, Daniel S. Weld

    Abstract: An important goal in the field of human-AI interaction is to help users more appropriately trust AI systems' decisions. A situation in which the user may particularly benefit from more appropriate trust is when the AI receives anomalous input or provides anomalous output. To the best of our knowledge, this is the first work towards understanding how anomaly alerts may contribute to appropriate tru… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: 13 pages, 3 figures

    ACM Class: H.5.2; I.2.1

  16. arXiv:2204.10254  [pdf, other

    cs.IR cs.HC cs.SI

    From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks

    Authors: Hyeonsu B. Kang, Rafal Kocielnik, Andrew Head, Jiangjiang Yang, Matt Latzke, Aniket Kittur, Daniel S. Weld, Doug Downey, Jonathan Bragg

    Abstract: The ever-increasing pace of scientific publication necessitates methods for quickly identifying relevant papers. While neural recommenders trained on user interests can help, they still result in long, monotonous lists of suggested papers. To improve the discovery experience we introduce multiple new methods for \em augmenting recommendations with textual relevance messages that highlight knowledg… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: to be published in ACM SIGCHI 2022

  17. arXiv:2203.08436  [pdf, other

    cs.CL

    Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search

    Authors: Daniel King, Zejiang Shen, Nishant Subramani, Daniel S. Weld, Iz Beltagy, Doug Downey

    Abstract: Abstractive summarization systems today produce fluent and relevant output, but often "hallucinate" statements not supported by the source text. We analyze the connection between hallucinations and training data, and find evidence that models hallucinate because they train on target summaries that are unsupported by the source. Based on our findings, we present PINOCCHIO, a new decoding method tha… ▽ More

    Submitted 17 November, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 16 pages, 2 figures, 7 tables

  18. arXiv:2109.13301  [pdf, other

    cs.IR cs.HC cs.LG

    Exploring The Role of Local and Global Explanations in Recommender Systems

    Authors: Marissa Radensky, Doug Downey, Kyle Lo, Zoran Popović, Daniel S. Weld

    Abstract: Explanations are well-known to improve recommender systems' transparency. These explanations may be local, explaining an individual recommendation, or global, explaining the recommender model in general. Despite their widespread use, there has been little investigation into the relative benefits of these two approaches. Do they provide the same benefits to users, or do they serve different purpose… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  19. arXiv:2108.13751  [pdf, other

    cs.CL cs.HC cs.IR

    A Search Engine for Discovery of Scientific Challenges and Directions

    Authors: Dan Lahav, Jon Saad Falcon, Bailey Kuehl, Sophie Johnson, Sravanthi Parasa, Noam Shomron, Duen Horng Chau, Diyi Yang, Eric Horvitz, Daniel S. Weld, Tom Hope

    Abstract: Keeping track of scientific challenges, advances and emerging directions is a fundamental part of research. However, researchers face a flood of papers that hinders discovery of important knowledge. In biomedicine, this directly impacts human lives. To address this problem, we present a novel task of extraction and search of scientific challenges and directions, to facilitate rapid knowledge disco… ▽ More

    Submitted 19 January, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: AAAI 2022

    Journal ref: AAAI 2022

  20. Goldilocks: Consistent Crowdsourced Scalar Annotations with Relative Uncertainty

    Authors: Quanze Chen, Daniel S. Weld, Amy X. Zhang

    Abstract: Human ratings have become a crucial resource for training and evaluating machine learning systems. However, traditional elicitation methods for absolute and comparative rating suffer from issues with consistency and often do not distinguish between uncertainty due to disagreement between annotators and ambiguity inherent to the item being rated. In this work, we present Goldilocks, a novel crowd r… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: CSCW '21

  21. arXiv:2106.00676  [pdf, other

    cs.CL cs.CV

    VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups

    Authors: Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, Doug Downey

    Abstract: Accurately extracting structured content from PDFs is a critical first step for NLP over scientific papers. Recent work has improved extraction accuracy by incorporating elementary layout information, e.g., each token's 2D position on the page, into language model pretraining. We introduce new methods that explicitly model VIsual LAyout (VILA) groups, i.e., text lines or text blocks, to further im… ▽ More

    Submitted 5 January, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: To appear in TACL 2022. The arXiv version is a pre-MIT Press publication version. (17 pages, 5 figures, 9 tables)

  22. arXiv:2105.00076  [pdf, other

    cs.DL cs.HC

    Improving the Accessibility of Scientific Documents: Current State, User Needs, and a System Solution to Enhance Scientific PDF Accessibility for Blind and Low Vision Users

    Authors: Lucy Lu Wang, Isabel Cachola, Jonathan Bragg, Evie Yu-Yen Cheng, Chelsea Haupt, Matt Latzke, Bailey Kuehl, Madeleine van Zuylen, Linda Wagner, Daniel S. Weld

    Abstract: The majority of scientific papers are distributed in PDF, which pose challenges for accessibility, especially for blind and low vision (BLV) readers. We characterize the scope of this problem by assessing the accessibility of 11,397 PDFs published 2010--2019 sampled across various fields of study, finding that only 2.4% of these PDFs satisfy all of our defined accessibility criteria. We introduce… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: 44 pages, 11 figures, 10 tables, 4 appendices; accessible PDF is available at https://llwang.net/publications/2021_wang_scia11y.pdf

  23. arXiv:2101.06561  [pdf, other

    cs.CL cs.AI

    GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation

    Authors: Daniel Khashabi, Gabriel Stanovsky, Jonathan Bragg, Nicholas Lourie, Jungo Kasai, Yejin Choi, Noah A. Smith, Daniel S. Weld

    Abstract: While often assumed a gold standard, effective human evaluation of text generation remains an important, open area for research. We revisit this problem with a focus on producing consistent evaluations that are reproducible -- over time and across different populations. We study this goal in different stages of the human evaluation pipeline. In particular, we consider design choices for the annota… ▽ More

    Submitted 31 October, 2022; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: Accepted to EMNLP 2022 main conference, visit our project page at: https://genie.apps.allenai.org

  24. arXiv:2101.00288  [pdf, other

    cs.CL

    Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

    Authors: Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, Daniel S. Weld

    Abstract: While counterfactual examples are useful for analysis and training of NLP models, current generation methods either rely on manual labor to create very few counterfactuals, or only instantiate limited types of perturbations such as paraphrases or word substitutions. We present Polyjuice, a general-purpose counterfactual generator that allows for control over perturbation types and locations, train… ▽ More

    Submitted 1 June, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: ACL 2021, main conference, long paper

  25. arXiv:2010.05129  [pdf, other

    cs.CL

    Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions

    Authors: Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel S. Weld, Marti A. Hearst

    Abstract: The task of definition detection is important for scholarly papers, because papers often make use of technical terminology that may be unfamiliar to readers. Despite prior work on definition detection, current approaches are far from being accurate enough to use in real-world applications. In this paper, we first perform in-depth error analysis of the current best performing definition detection s… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: Workshop on Scholarly Document Processing (SDP), EMNLP 2020

  26. arXiv:2009.14237  [pdf, other

    cs.HC cs.AI cs.CL

    Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols

    Authors: Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, Marti A. Hearst

    Abstract: Despite the central importance of research papers to scientific progress, they can be difficult to read. Comprehension is often stymied when the information needed to understand a passage resides somewhere else: in another section, or in another paper. In this work, we envision how interfaces can bring definitions of technical terms and symbols to readers when and where they need them most. We int… ▽ More

    Submitted 27 April, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 18 pages, 17 figures, 2 tables. To appear at the 2021 ACM CHI Conference on Human Factors in Computing Systems. For associated video, see https://youtu.be/yYcQf-Yq8B0. v2 changes: expanded discussion of design process and implementation; improved figure design. v3 changes: fixed typo in cell of Table 2; updated HEDDEx and Schwarz-Hearst accuracy in Section 5.3

    ACM Class: H.5.2

  27. arXiv:2006.14779  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance

    Authors: Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld

    Abstract: Many researchers motivate explainable AI with studies showing that human-AI team performance on decision-making tasks improves when the AI explains its recommendations. However, prior studies observed improvements from explanations only when the AI, alone, outperformed both the human and the best team. Can explanations help lead to complementary performance, where team accuracy is higher than eith… ▽ More

    Submitted 12 January, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: CHI'21

  28. High-Precision Extraction of Emerging Concepts from Scientific Literature

    Authors: Daniel King, Doug Downey, Daniel S. Weld

    Abstract: Identification of new concepts in scientific literature can help power faceted search, scientific trend analysis, knowledge-base construction, and more, but current methods are lacking. Manual identification cannot keep up with the torrent of new publications, while the precision of existing automatic techniques is too low for many applications. We present an unsupervised concept extraction method… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted to SIGIR 2020

    Journal ref: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020) 1549-1552

  29. arXiv:2005.12668  [pdf, other

    cs.IR cs.DL cs.HC cs.LG

    SciSight: Combining faceted navigation and research group detection for COVID-19 exploratory scientific search

    Authors: Tom Hope, Jason Portenoy, Kishore Vasan, Jonathan Borchardt, Eric Horvitz, Daniel S. Weld, Marti A. Hearst, Jevin West

    Abstract: The COVID-19 pandemic has sparked unprecedented mobilization of scientists, generating a deluge of papers that makes it hard for researchers to keep track and explore new directions. Search engines are designed for targeted queries, not for discovery of connections across a corpus. In this paper, we present SciSight, a system for exploratory search of COVID-19 research integrating two key capabili… ▽ More

    Submitted 20 September, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to EMNLP 2020

  30. arXiv:2005.01583  [pdf, other

    cs.IR cs.CV cs.LG

    The Newspaper Navigator Dataset: Extracting And Analyzing Visual Content from 16 Million Historic Newspaper Pages in Chronicling America

    Authors: Benjamin Charles Germain Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter, Chris Adams, Nathan Yarasavage, Deborah Thomas, Kate Zwaard, Daniel S. Weld

    Abstract: Chronicling America is a product of the National Digital Newspaper Program, a partnership between the Library of Congress and the National Endowment for the Humanities to digitize historic newspapers. Over 16 million pages of historic American newspapers have been digitized for Chronicling America to date, complete with high-resolution images and machine-readable METS/ALTO OCR. Of considerable int… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 14 pages, 5 figures

  31. arXiv:2004.15011  [pdf, other

    cs.CL

    TLDR: Extreme Summarization of Scientific Documents

    Authors: Isabel Cachola, Kyle Lo, Arman Cohan, Daniel S. Weld

    Abstract: We introduce TLDR generation, a new form of extreme summarization, for scientific papers. TLDR generation involves high source compression and requires expert background knowledge and understanding of complex domain-specific language. To facilitate study on this task, we introduce SciTLDR, a new multi-target dataset of 5.4K TLDRs over 3.2K papers. SciTLDR contains both author-written and expert-de… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

  32. arXiv:2004.13102  [pdf, other

    cs.AI cs.HC cs.LG

    Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork

    Authors: Gagan Bansal, Besmira Nushi, Ece Kamar, Eric Horvitz, Daniel S. Weld

    Abstract: AI practitioners typically strive to develop the most accurate systems, making an implicit assumption that the AI system will function autonomously. However, in practice, AI systems often are used to provide advice to people in domains ranging from criminal justice and finance to healthcare. In such AI-advised decision making, humans and machines form a team, where the human is responsible for mak… ▽ More

    Submitted 19 February, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: v2

  33. arXiv:2004.10706  [pdf, other

    cs.DL cs.CL

    CORD-19: The COVID-19 Open Research Dataset

    Authors: Lucy Lu Wang, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Doug Burdick, Darrin Eide, Kathryn Funk, Yannis Katsis, Rodney Kinney, Yunyao Li, Ziyang Liu, William Merrill, Paul Mooney, Dewey Murdick, Devvret Rishi, Jerry Sheehan, Zhihong Shen, Brandon Stilson, Alex Wade, Kuansan Wang, Nancy Xin Ru Wang, Chris Wilhelm, Boya Xie, Douglas Raymond , et al. (3 additional authors not shown)

    Abstract: The COVID-19 Open Research Dataset (CORD-19) is a growing resource of scientific papers on COVID-19 and related historical coronavirus research. CORD-19 is designed to facilitate the development of text mining and information retrieval systems over its rich collection of metadata and structured full text papers. Since its release, CORD-19 has been downloaded over 200K times and has served as the b… ▽ More

    Submitted 10 July, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: ACL NLP-COVID Workshop 2020

  34. arXiv:2004.07180  [pdf, other

    cs.CL

    SPECTER: Document-level Representation Learning using Citation-informed Transformers

    Authors: Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey, Daniel S. Weld

    Abstract: Representation learning is a critical ingredient for natural language processing systems. Recent Transformer language models like BERT learn powerful textual representations, but these models are targeted towards token- and sentence-level training objectives and do not leverage information on inter-document relatedness, which limits their document-level representation power. For applications on sc… ▽ More

    Submitted 20 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  35. arXiv:2003.04315  [pdf, ps, other

    cs.IR cs.LG stat.ML

    LIMEADE: From AI Explanations to Advice Taking

    Authors: Benjamin Charles Germain Lee, Doug Downey, Kyle Lo, Daniel S. Weld

    Abstract: Research in human-centered AI has shown the benefits of systems that can explain their predictions. Methods that allow an AI to take advice from humans in response to explanations are similarly useful. While both capabilities are well-developed for transparent learning models (e.g., linear models and GA$^2$Ms), and recent techniques (e.g., LIME and SHAP) can generate explanations for opaque models… ▽ More

    Submitted 17 January, 2023; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 18 pages, 7 figures

  36. arXiv:1911.02782  [pdf, other

    cs.CL cs.DL

    S2ORC: The Semantic Scholar Open Research Corpus

    Authors: Kyle Lo, Lucy Lu Wang, Mark Neumann, Rodney Kinney, Dan S. Weld

    Abstract: We introduce S2ORC, a large corpus of 81.1M English-language academic papers spanning many academic disciplines. The corpus consists of rich metadata, paper abstracts, resolved bibliographic references, as well as structured full text for 8.1M open access papers. Full text is annotated with automatically-detected inline mentions of citations, figures, and tables, each linked to their corresponding… ▽ More

    Submitted 6 July, 2020; v1 submitted 7 November, 2019; originally announced November 2019.

    Comments: ACL 2020

  37. Pretrained Language Models for Sequential Sentence Classification

    Authors: Arman Cohan, Iz Beltagy, Daniel King, Bhavana Dalvi, Daniel S. Weld

    Abstract: As a step toward better document-level understanding, we explore classification of a sequence of sentences into their corresponding categories, a task that requires understanding sentences in context of the document. Recent successful models for this task have used hierarchical models to contextualize sentence representations, and Conditional Random Fields (CRFs) to incorporate dependencies betwee… ▽ More

    Submitted 22 September, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

    Journal ref: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (2019) 3693-3699

  38. arXiv:1908.09091  [pdf, ps, other

    cs.CL

    BERT for Coreference Resolution: Baselines and Analysis

    Authors: Mandar Joshi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer

    Abstract: We apply BERT to coreference resolution, achieving strong improvements on the OntoNotes (+3.9 F1) and GAP (+11.5 F1) benchmarks. A qualitative analysis of model predictions indicates that, compared to ELMo and BERT-base, BERT-large is particularly better at distinguishing between related but distinct entities (e.g., President and CEO). However, there is still room for improvement in modeling docum… ▽ More

    Submitted 22 December, 2019; v1 submitted 24 August, 2019; originally announced August 2019.

    Comments: Fix test set numbers for e2e-coref on GAP

  39. arXiv:1907.10529  [pdf, other

    cs.CL cs.LG

    SpanBERT: Improving Pre-training by Representing and Predicting Spans

    Authors: Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, Omer Levy

    Abstract: We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random tokens, and (2) training the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it. SpanBERT consistently outperform… ▽ More

    Submitted 17 January, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: Accepted at TACL

  40. arXiv:1810.10733  [pdf, other

    cs.HC

    Cicero: Multi-Turn, Contextual Argumentation for Accurate Crowdsourcing

    Authors: Quanze Chen, Jonathan Bragg, Lydia B. Chilton, Daniel S. Weld

    Abstract: Traditional approaches for ensuring high quality crowdwork have failed to achieve high-accuracy on difficult problems. Aggregating redundant answers often fails on the hardest problems when the majority is confused. Argumentation has been shown to be effective in mitigating these drawbacks. However, existing argumentation systems only support limited interactions and show workers general justifica… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: 10 pages

  41. arXiv:1810.08854  [pdf, other

    cs.CL

    pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

    Authors: Mandar Joshi, Eunsol Choi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer

    Abstract: Reasoning about implied relationships (e.g., paraphrastic, common sense, encyclopedic) between pairs of words is crucial for many cross-sentence inference problems. This paper proposes new methods for learning and using embeddings of word pairs that implicitly represent background knowledge about such relationships. Our pairwise embeddings are computed as a compositional function on word represent… ▽ More

    Submitted 5 April, 2019; v1 submitted 20 October, 2018; originally announced October 2018.

    Comments: NAACL camera ready

  42. arXiv:1808.08622  [pdf, other

    cs.CL

    Semi-Supervised Event Extraction with Paraphrase Clusters

    Authors: James Ferguson, Colin Lockard, Daniel S. Weld, Hannaneh Hajishirzi

    Abstract: Supervised event extraction systems are limited in their accuracy due to the lack of available training data. We present a method for self-training event extraction systems by bootstrapping additional training data. This is done by taking advantage of the occurrence of multiple mentions of the same event instances across newswire articles from multiple sources. If our system can make a highconfide… ▽ More

    Submitted 26 August, 2018; originally announced August 2018.

    Comments: NAACL 2018

  43. StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow

    Authors: Ziyu Yao, Daniel S. Weld, Wei-Peng Chen, Huan Sun

    Abstract: Stack Overflow (SO) has been a great source of natural language questions and their code solutions (i.e., question-code pairs), which are critical for many tasks including code retrieval and annotation. In most existing research, question-code pairs were collected heuristically and tend to have low quality. In this paper, we investigate a new problem of systematically mining question-code pairs fr… ▽ More

    Submitted 25 March, 2018; originally announced March 2018.

    Comments: Accepted to the Web Conference 2018 (former WWW 2018), 11 pages, 6 figures

  44. arXiv:1803.04263  [pdf, other

    cs.AI

    The Challenge of Crafting Intelligible Intelligence

    Authors: Daniel S. Weld, Gagan Bansal

    Abstract: Since Artificial Intelligence (AI) software uses techniques like deep lookahead search and stochastic optimization of huge neural networks to fit mammoth datasets, it often results in complex behavior that is difficult for people to understand. Yet organizations are deploying AI algorithms in many mission-critical settings. To trust their behavior, we must make AI intelligible, either by using inh… ▽ More

    Submitted 15 October, 2018; v1 submitted 9 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1603.08507 by other authors

  45. arXiv:1705.03551  [pdf, other

    cs.CL

    TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

    Authors: Mandar Joshi, Eunsol Choi, Daniel S. Weld, Luke Zettlemoyer

    Abstract: We present TriviaQA, a challenging reading comprehension dataset containing over 650K question-answer-evidence triples. TriviaQA includes 95K question-answer pairs authored by trivia enthusiasts and independently gathered evidence documents, six per question on average, that provide high quality distant supervision for answering the questions. We show that, in comparison to other recently introduc… ▽ More

    Submitted 13 May, 2017; v1 submitted 9 May, 2017; originally announced May 2017.

    Comments: Added references, fixed typos, minor baseline update

  46. arXiv:1608.08724  [pdf, other

    cs.AI cs.PL

    A Programming Language With a POMDP Inside

    Authors: Christopher H. Lin, Mausam, Daniel S. Weld

    Abstract: We present POAPS, a novel planning system for defining Partially Observable Markov Decision Processes (POMDPs) that abstracts away from POMDP details for the benefit of non-expert practitioners. POAPS includes an expressive adaptive programming language based on Lisp that has constructs for choice points that can be dynamically optimized. Non-experts can use our language to write adaptive programs… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

  47. arXiv:1506.06418  [pdf, other

    cs.CL cs.AI cs.IR

    Extreme Extraction: Only One Hour per Relation

    Authors: Raphael Hoffmann, Luke Zettlemoyer, Daniel S. Weld

    Abstract: Information Extraction (IE) aims to automatically generate a large knowledge base from natural language text, but progress remains slow. Supervised learning requires copious human annotation, while unsupervised and weakly supervised approaches do not deliver competitive accuracy. As a result, most fielded applications of IE, as well as the leading TAC-KBP systems, rely on significant amounts of ma… ▽ More

    Submitted 21 June, 2015; originally announced June 2015.

    ACM Class: H.2.8, H.3.1, I.2.7, I.5.5

  48. Topological Value Iteration Algorithms

    Authors: Peng Dai, Mausam, Daniel Sabby Weld, Judy Goldsmith

    Abstract: Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs) because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases. In order to overcome this problem, many approaches have been proposed. Among them, ILAO* and variants of RTDP are state-of-the-art ones. These methods use reachability analysi… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 42, pages 181-209, 2011

  49. arXiv:cs/9501102  [pdf, ps

    cs.AI

    A Domain-Independent Algorithm for Plan Adaptation

    Authors: S. Hanks, D. S. Weld

    Abstract: The paradigms of transformational planning, case-based planning, and plan debugging all involve a process known as plan adaptation - modifying or repairing an old plan so it solves a new problem. In this paper we provide a domain-independent algorithm for plan adaptation, demonstrate that it is sound, complete, and systematic, and compare it to other adaptation algorithms in the literature. Our… ▽ More

    Submitted 31 December, 1994; originally announced January 1995.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 2, (1995), 319-360