🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
-
Updated
Aug 2, 2024 - C++
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
FPGA Accelerator for CNN using Vivado HLS
A FPGA Based CNN accelerator, following Google's TPU V1.
同时支持传送TCP与UDP的KCP通道,附带端口跳跃的功能,以及FEC,自带中继服务器支持
A Modeling and Verification Platform for SoCs using ILAs
Advanced Matrix Extensions (AMX) Guide
An example of using Ramulator as memory model in a cycle-accurate SystemC Design
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
Generate an accelerator extension that makes your Antlr parser in Python super-fast!
ImpactX: an s-based beam dynamics code including space charge effects
NeuroSpector: Dataflow and Mapping Optimization of Deep Neural Network Accelerators
Tool to simulate beam dynamics in synchrotron light sources
NATSA is the first near-data-processing accelerator for time series analysis based on the Matrix Profile (SCRIMP) algorithm. NATSA exploits modern 3D-stacked High Bandwidth Memory (HBM) to enable efficient and fast matrix profile computation near memory. Described in ICCD 2020 by Fernandez et al. https://people.inf.ethz.ch/omutlu/pub/NATSA_time-…
Open Source Code for Advanced Radiation Simulation
simulating connection of micro processor and accelerator on a bus context with systemc language
NPUsim: Full-system, Cycle-accurate, Value-aware NPU Simulator
Out-of-the-box CHaiDNN implementation on Zynq ZCU104
C++ wrapper for the Nvidia C libraries (e.g. CUDA driver, nvrtc, cuFFT etc.)
Add a description, image, and links to the accelerator topic page so that developers can more easily learn about it.
To associate your repository with the accelerator topic, visit your repo's landing page and select "manage topics."