Skip to main content

Showing 1–4 of 4 results for author: Thakkar, V

  1. arXiv:2407.08608  [pdf, other

    cs.LG cs.AI

    FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision

    Authors: Jay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao

    Abstract: Attention, as a core layer of the ubiquitous Transformer architecture, is the bottleneck for large language models and long-context applications. FlashAttention elaborated an approach to speed up attention on GPUs through minimizing memory reads/writes. However, it has yet to take advantage of new capabilities present in recent hardware, with FlashAttention-2 achieving only 35% utilization on the… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.01781  [pdf, other

    cs.CV cs.GR cs.LG

    fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence

    Authors: Francis Williams, Jiahui Huang, Jonathan Swartz, Gergely Klár, Vijay Thakkar, Matthew Cong, Xuanchi Ren, Ruilong Li, Clement Fuji-Tsang, Sanja Fidler, Eftychios Sifakis, Ken Museth

    Abstract: We present fVDB, a novel GPU-optimized framework for deep learning on large-scale 3D data. fVDB provides a complete set of differentiable primitives to build deep learning architectures for common tasks in 3D learning such as convolution, pooling, attention, ray-tracing, meshing, etc. fVDB simultaneously provides a much larger feature set (primitives and operators) than established frameworks wi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2202.02824  [pdf

    cs.CY cs.DB

    A Summary of COVID-19 Datasets

    Authors: Syed Raza Bashir, Shaina Raza, Vidhi Thakkar, Usman Naseem

    Abstract: This research presents a review of main datasets that are developed for COVID-19 research. We hope this collection will continue to bring together members of the computing community, biomedical experts, and policymakers in the pursuit of effective COVID-19 treatments and management policies. Many organizations, such as the World Health Organization (WHO), John Hopkins, National Institute of Health… ▽ More

    Submitted 27 July, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: Accepted in CAIML 2022: International Conference on Artificial Intelligence and Machine Learning

  4. arXiv:1806.09905  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Conditioning Deep Generative Raw Audio Models for Structured Automatic Music

    Authors: Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis

    Abstract: Existing automatic music generation approaches that feature deep learning can be broadly classified into two types: raw audio models and symbolic models. Symbolic models, which train and generate at the note level, are currently the more prevalent approach; these models can capture long-range dependencies of melodic structure, but fail to grasp the nuances and richness of raw audio generations. Ra… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: Presented at the ISMIR 2018 Conference