Skip to main content

Showing 1–26 of 26 results for author: Miret, S

  1. arXiv:2406.17295  [pdf, other

    cond-mat.mtrl-sci cs.LG

    MatText: Do Language Models Need More than Text & Scale for Materials Modeling?

    Authors: Nawaf Alampara, Santiago Miret, Kevin Maik Jablonka

    Abstract: Effectively representing materials as text has the potential to leverage the vast advancements of large language models (LLMs) for discovering new materials. While LLMs have shown remarkable success in various domains, their application to materials science remains underexplored. A fundamental challenge is the lack of understanding of how to best utilize text-based representations for materials mo… ▽ More

    Submitted 28 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2404.01475  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.chem-ph

    Are large language models superhuman chemists?

    Authors: Adrian Mirza, Nawaf Alampara, Sreekanth Kunchapu, Benedict Emoekabu, Aswanth Krishnan, Mara Wilhelmi, Macjonathan Okereke, Juliane Eberhardt, Amir Mohammad Elahi, Maximilian Greiner, Caroline T. Holick, Tanya Gupta, Mehrdad Asgari, Christina Glaubitz, Lea C. Klepsch, Yannik Köster, Jakob Meyer, Santiago Miret, Tim Hoffmann, Fabian Alexander Kreth, Michael Ringleb, Nicole Roesner, Ulrich S. Schubert, Leanne M. Stafast, Dinga Wonanke , et al. (3 additional authors not shown)

    Abstract: Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. This is relevant for the chemical sciences, which face the problem of small and diverse datasets that are frequently in the form of text. LLMs have shown promise in addressing these issues and are increasingly being harnessed… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2402.05200  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.CL cs.LG

    Are LLMs Ready for Real-World Materials Discovery?

    Authors: Santiago Miret, N M Anoop Krishnan

    Abstract: Large Language Models (LLMs) create exciting possibilities for powerful language processing tools to accelerate research in materials science. While LLMs have great potential to accelerate materials understanding and discovery, they currently fall short in being practical materials science tools. In this position paper, we show relevant failure cases of LLMs in materials science that reveal curren… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2312.07511  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems

    Authors: Alexandre Duval, Simon V. Mathis, Chaitanya K. Joshi, Victor Schmidt, Santiago Miret, Fragkiskos D. Malliaros, Taco Cohen, Pietro Liò, Yoshua Bengio, Michael Bronstein

    Abstract: Recent advances in computational modelling of atomic systems, spanning molecules, proteins, and materials, represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space. In these graphs, the geometric attributes transform according to the inherent physical symmetries of 3D atomic systems, including rotations and translations in Euclidean space, as well as node permutations.… ▽ More

    Submitted 13 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  5. arXiv:2310.14782  [pdf, other

    cs.LG cs.AI

    Towards equilibrium molecular conformation generation with GFlowNets

    Authors: Alexandra Volokhova, Michał Koziarski, Alex Hernández-García, Cheng-Hao Liu, Santiago Miret, Pablo Lemos, Luca Thiede, Zichao Yan, Alán Aspuru-Guzik, Yoshua Bengio

    Abstract: Sampling diverse, thermodynamically feasible molecular conformations plays a crucial role in predicting properties of a molecule. In this paper we propose to use GFlowNet for sampling conformations of small molecules from the Boltzmann distribution, as determined by the molecule's energy. The proposed approach can be used in combination with energy estimation methods of different fidelity and disc… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  6. arXiv:2310.11609  [pdf, other

    cs.LG astro-ph.GA physics.chem-ph

    Reflection-Equivariant Diffusion for 3D Structure Determination from Isotopologue Rotational Spectra in Natural Abundance

    Authors: Austin Cheng, Alston Lo, Santiago Miret, Brooks Pate, Alán Aspuru-Guzik

    Abstract: Structure determination is necessary to identify unknown organic molecules, such as those in natural products, forensic samples, the interstellar medium, and laboratory syntheses. Rotational spectroscopy enables structure determination by providing accurate 3D information about small organic molecules via their moments of inertia. Using these moments, Kraitchman analysis determines isotopic substi… ▽ More

    Submitted 19 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: added software citations

    Journal ref: J. Chem. Phys. 160, 124115 (2024)

  7. arXiv:2310.08511  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.AI

    HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science

    Authors: Yu Song, Santiago Miret, Huan Zhang, Bang Liu

    Abstract: We propose an instruction-based process for trustworthy data curation in materials science (MatSci-Instruct), which we then apply to finetune a LLaMa-based language model targeted for materials science (HoneyBee). MatSci-Instruct helps alleviate the scarcity of relevant, high-quality materials science textual data available in the open literature, and HoneyBee is the first billion-parameter langua… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  8. arXiv:2310.06682  [pdf, other

    cs.LG

    On the importance of catalyst-adsorbate 3D interactions for relaxed energy predictions

    Authors: Alvaro Carbonero, Alexandre Duval, Victor Schmidt, Santiago Miret, Alex Hernandez-Garcia, Yoshua Bengio, David Rolnick

    Abstract: The use of machine learning for material property prediction and discovery has traditionally centered on graph neural networks that incorporate the geometric configuration of all atoms. However, in practice not all this information may be readily available, e.g.~when evaluating the potentially unknown binding of adsorbates to catalyst. In this paper, we investigate whether it is possible to predic… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  9. arXiv:2310.02902  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    Searching for High-Value Molecules Using Reinforcement Learning and Transformers

    Authors: Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth

    Abstract: Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  10. arXiv:2310.02428  [pdf, other

    cs.LG cond-mat.mtrl-sci

    EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

    Authors: Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M Anoop Krishnan

    Abstract: Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model at… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  11. arXiv:2309.05934  [pdf, other

    cond-mat.mtrl-sci cs.AI

    MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling

    Authors: Kin Long Kelvin Lee, Carmelo Gonzales, Marcel Nassar, Matthew Spellings, Mikhail Galkin, Santiago Miret

    Abstract: We propose MatSci ML, a novel benchmark for modeling MATerials SCIence using Machine Learning (MatSci ML) methods focused on solid-state materials with periodic crystal structures. Applying machine learning methods to solid-state materials is a nascent field with substantial fragmentation largely driven by the great variety of datasets used to develop machine learning models. This fragmentation ma… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  12. arXiv:2309.03139  [pdf, other

    cs.LG

    Using Multiple Vector Channels Improves E(n)-Equivariant Graph Neural Networks

    Authors: Daniel Levy, Sékou-Oumar Kaba, Carmelo Gonzales, Santiago Miret, Siamak Ravanbakhsh

    Abstract: We present a natural extension to E(n)-equivariant graph neural networks that uses multiple equivariant vectors per node. We formulate the extension and show that it improves performance across different physical systems benchmark tasks, with minimal differences in runtime or number of parameters. The proposed multichannel EGNN outperforms the standard singlechannel EGNN on N-body charged particle… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  13. arXiv:2305.08264  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.AI

    MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling

    Authors: Yu Song, Santiago Miret, Bang Liu

    Abstract: We present MatSci-NLP, a natural language benchmark for evaluating the performance of natural language processing (NLP) models on materials science text. We construct the benchmark from publicly available materials science text data to encompass seven different NLP tasks, including conventional NLP tasks like named entity recognition and relation classification, as well as NLP tasks specific to ma… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  14. arXiv:2305.05577  [pdf, other

    cs.LG

    FAENet: Frame Averaging Equivariant GNN for Materials Modeling

    Authors: Alexandre Duval, Victor Schmidt, Alex Hernandez Garcia, Santiago Miret, Fragkiskos D. Malliaros, Yoshua Bengio, David Rolnick

    Abstract: Applications of machine learning techniques for materials modeling typically involve functions known to be equivariant or invariant to specific symmetries. While graph neural networks (GNNs) have proven successful in such tasks, they enforce symmetries via the model architecture, which often reduces their expressivity, scalability and comprehensibility. In this paper, we introduce (1) a flexible f… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: Accepted at ICML 2023

  15. arXiv:2301.12040  [pdf, other

    q-bio.BM cs.LG

    ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts

    Authors: Minghao Xu, Xinyu Yuan, Santiago Miret, Jian Tang

    Abstract: Current protein language models (PLMs) learn protein representations mainly based on their sequences, thereby well capturing co-evolutionary information, but they are unable to explicitly acquire protein functions, which is the end goal of protein representation learning. Fortunately, for many proteins, their textual property descriptions are available, where their various functions are also descr… ▽ More

    Submitted 4 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accpeted by ICML 2023 (Oral), code and data released

  16. arXiv:2212.09146  [pdf, other

    cs.CL

    Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model

    Authors: Parishad BehnamGhader, Santiago Miret, Siva Reddy

    Abstract: Augmenting pretrained language models with retrievers has shown promise in effectively solving common NLP problems, such as language modeling and question answering. In this paper, we evaluate the strengths and weaknesses of popular retriever-augmented language models, namely kNN-LM, REALM, DPR + FiD, Contriever + ATLAS, and Contriever + Flan-T5, in reasoning over retrieved statements across diffe… ▽ More

    Submitted 2 November, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: Accepted in EMNLP2023 Findings

  17. arXiv:2211.13322  [pdf, other

    cs.LG cs.NE physics.chem-ph

    Group SELFIES: A Robust Fragment-Based Molecular String Representation

    Authors: Austin Cheng, Andy Cai, Santiago Miret, Gustavo Malkomes, Mariano Phielipp, Alán Aspuru-Guzik

    Abstract: We introduce Group SELFIES, a molecular string representation that leverages group tokens to represent functional groups or entire substructures while maintaining chemical robustness guarantees. Molecular string representations, such as SMILES and SELFIES, serve as the basis for molecular generation and optimization in chemical language models, deep generative models, and evolutionary methods. Whi… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 11 pages + references and appendix

    Journal ref: Digital Discovery (2023)

  18. arXiv:2211.12020  [pdf, other

    cs.LG physics.comp-ph

    PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

    Authors: Alexandre Duval, Victor Schmidt, Santiago Miret, Yoshua Bengio, Alex Hernández-García, David Rolnick

    Abstract: Mitigating the climate crisis requires a rapid transition towards lower-carbon energy. Catalyst materials play a crucial role in the electrochemical reactions involved in numerous industrial processes key to this transition, such as renewable energy storage and electrofuel synthesis. To reduce the energy spent on such activities, we must quickly discover more efficient catalysts to drive electroch… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Journal of Machine Learning Research (JMLR)

  19. arXiv:2210.17484  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    The Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science

    Authors: Santiago Miret, Kin Long Kelvin Lee, Carmelo Gonzales, Marcel Nassar, Matthew Spellings

    Abstract: We present the Open MatSci ML Toolkit: a flexible, self-contained, and scalable Python-based framework to apply deep learning models and methods on scientific data with a specific focus on materials science and the OpenCatalyst Dataset. Our toolkit provides: 1. A scalable machine learning workflow for materials science leveraging PyTorch Lightning, which enables seamless scaling across different c… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Paper accompanying Open-Source Software from https://github.com/IntelLabs/matsciml

    Report number: 2835-8856

    Journal ref: Transactions on Machine Learning Research (2023)

  20. arXiv:2210.12765  [pdf, other

    cs.LG stat.ML

    Multi-Objective GFlowNets

    Authors: Moksh Jain, Sharath Chandra Raparthy, Alex Hernandez-Garcia, Jarrid Rector-Brooks, Yoshua Bengio, Santiago Miret, Emmanuel Bengio

    Abstract: We study the problem of generating diverse candidates in the context of Multi-Objective Optimization. In many applications of machine learning such as drug discovery and material design, the goal is to generate candidates which simultaneously optimize a set of potentially conflicting objectives. Moreover, these objectives are often imperfect evaluations of some underlying property of interest, mak… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 23 pages, 8 figures. ICML 2023. Code at: https://github.com/GFNOrg/multi-objective-gfn

  21. arXiv:2106.07611  [pdf

    cs.NE cs.AI

    Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization

    Authors: Santiago Miret, Vui Seng Chua, Mattias Marder, Mariano Phielipp, Nilesh Jain, Somdeb Majumdar

    Abstract: Mixed-precision quantization is a powerful tool to enable memory and compute savings of neural network workloads by deploying different sets of bit-width precisions on separate compute operations. In this work, we present a flexible and scalable framework for automated mixed-precision quantization that concurrently optimizes task performance, memory compression, and compute savings through multi-o… ▽ More

    Submitted 1 April, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

  22. arXiv:2010.03694  [pdf, other

    cs.LG cs.AI

    Learning Intrinsic Symbolic Rewards in Reinforcement Learning

    Authors: Hassam Sheikh, Shauharda Khadka, Santiago Miret, Somdeb Majumdar

    Abstract: Learning effective policies for sparse objectives is a key challenge in Deep Reinforcement Learning (RL). A common approach is to design task-related dense rewards to improve task learnability. While such rewards are easily interpreted, they rely on heuristics and domain expertise. Alternate approaches that train neural networks to discover dense surrogate rewards avoid heuristics, but are high-di… ▽ More

    Submitted 9 October, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

  23. arXiv:2010.02846  [pdf, other

    cs.LG cs.AI

    Safety Aware Reinforcement Learning (SARL)

    Authors: Santiago Miret, Somdeb Majumdar, Carroll Wainwright

    Abstract: As reinforcement learning agents become increasingly integrated into complex, real-world environments, designing for safety becomes a critical consideration. We specifically focus on researching scenarios where agents can cause undesired side effects while executing a policy on a primary task. Since one can define multiple tasks for a given environment dynamics, there are two important challenges.… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  24. arXiv:2007.07298  [pdf, other

    cs.LG cs.AI

    Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning

    Authors: Shauharda Khadka, Estelle Aflalo, Mattias Marder, Avrech Ben-David, Santiago Miret, Shie Mannor, Tamir Hazan, Hanlin Tang, Somdeb Majumdar

    Abstract: For deep neural network accelerators, memory movement is both energetically expensive and can bound computation. Therefore, optimal mapping of tensors to memory hierarchies is critical to performance. The growing complexity of neural networks calls for automated memory mapping instead of manual heuristic approaches; yet the search space of neural network computational graphs have previously been p… ▽ More

    Submitted 15 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Updated manuscript

  25. arXiv:1906.07315  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination

    Authors: Shauharda Khadka, Somdeb Majumdar, Santiago Miret, Stephen McAleer, Kagan Tumer

    Abstract: Many cooperative multiagent reinforcement learning environments provide agents with a sparse team-based reward, as well as a dense agent-specific reward that incentivizes learning basic skills. Training policies solely on the team-based reward is often difficult due to its sparsity. Furthermore, relying solely on the agent-specific reward is sub-optimal because it usually does not capture the team… ▽ More

    Submitted 11 June, 2020; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 108, 2020

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020

  26. arXiv:1905.00976  [pdf, other

    cs.LG cs.AI stat.ML

    Collaborative Evolutionary Reinforcement Learning

    Authors: Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer

    Abstract: Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks. However, these methods typically struggle with achieving effective exploration and are extremely sensitive to the choice of hyperparameters. One reason is that most approaches use a noisy version of their operating policy to explore - thereby limiting the range of exploration. In this pap… ▽ More

    Submitted 6 May, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: Added link to public Github repo. Minor editorial changes. Order of authors modified to reflect ICML submission

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019