End-to-End Speech Processing Toolkit
-
Updated
Aug 1, 2024 - Python
End-to-End Speech Processing Toolkit
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
kaldi-asr/kaldi is the official location of the Kaldi project.
Tools for handling speech data in machine learning projects.
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Command line utility for forced alignment using Kaldi
Service for easy access to speech recognition capabilities of Kaldi using REST API. Simple deployment and usage in couple clicks with Docker containers. Currently supports Russian. Models for other languages may be easily added in case of need.
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
🙊 software for creating speech recognition models.
Speaker Verification using Pytorch
A Python wrapper for Kaldi
Parallelized video speech-to-text converter using ffmpeg and kaldi/vosk
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
A Speech to Text Personal Assist inspired by kaldi2 and joint-bert
Add a description, image, and links to the kaldi topic page so that developers can more easily learn about it.
To associate your repository with the kaldi topic, visit your repo's landing page and select "manage topics."