kaldi

Here are 201 public repositories matching this topic...

espnet / espnet

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Aug 1, 2024
Python

csukuangfj / kaldifeat

Star

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

python cpp pytorch kaldi mfcc plp features-extraction fbank online-feature-extractor streaming-feature-extractor

Updated Jul 31, 2024
C++

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Jul 31, 2024
Shell

lhotse-speech / lhotse

Star

Tools for handling speech data in machine learning projects.

audio python data machine-learning ai deep-learning speech pytorch speech-recognition kaldi

Updated Jul 26, 2024
Python

garvys-org / rustfst

Star

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Updated Jul 24, 2024
Rust

tue-robotics-graveyard / yapykaldi

Star

Yet another PyKaldi

python python3 kaldi pybind11 wrappers

Updated Jul 22, 2024
Python

alphacep / vosk-api

Star

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated Jul 18, 2024
Jupyter Notebook

MontrealCorpusTools / Montreal-Forced-Aligner

Star

Command line utility for forced alignment using Kaldi

python kaldi pronunciation-dictionary forced-alignment grapheme-to-phone acoustic-model

Updated Jul 16, 2024
Python

alphacep / vosk-server

Star

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

python websocket webrtc grpc saas speech-recognition kaldi asr vosk

Updated Jul 5, 2024
Python

Service for easy access to speech recognition capabilities of Kaldi using REST API. Simple deployment and usage in couple clicks with Docker containers. Currently supports Russian. Models for other languages may be easily added in case of need.

api docker-container rest-api speech-recognition kaldi asr kaldi-server kaldi-service