High-Throughput Edge Inference for BERT Models via Neural Architecture Search and Pipeline
Abstract
References
Index Terms
- High-Throughput Edge Inference for BERT Models via Neural Architecture Search and Pipeline
Recommendations
PipeBERT: High-throughput BERT Inference for ARM Big.LITTLE Multi-core Processors
AbstractTransformer-based models such as BERT model have achieved state-of-the-art accuracy in the natural language processing (NLP) tasks. Nevertheless, these models are extremely cumbersome and have low throughput in NLP inference. This is more ...
An Optimized High-Throughput Strategy for Constructing Inverted Files
Current high-throughput algorithms for constructing inverted files all follow the MapReduce framework, which presents a high-level programming model that hides the complexities of parallel programming. In this paper, we take an alternative approach and ...
Performance modeling and optimization of parallel LU-SGS on many-core processors for 3D high-order CFD simulations
As a typical Gauss---Seidel method, the inherent strong data dependency of lower-upper symmetric Gauss---Seidel (LU-SGS) poses tough challenges for shared-memory parallelization. On early multi-core processors, the pipelined parallel LU-SGS approach ...
Comments
Information & Contributors
Information
Published In
![cover image ACM Conferences](/cms/asset/48a08d09-95da-4356-9f71-95318ac66221/3583781.cover.jpg)
- General Chairs:
- Himanshu Thapliyal,
- Ronald DeMara,
- Program Chairs:
- Inna Partin-Vaisband,
- Srinivas Katkoori
Sponsors
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Short-paper
Funding Sources
- Huawei Technologies Canada Inc.
- onds de Recherche du QubecNature et Technologies (FRQNT) Postdoctoral Research Scholarship.
Conference
Acceptance Rates
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 161Total Downloads
- Downloads (Last 12 months)113
- Downloads (Last 6 weeks)2
Other Metrics
Citations
View Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in