#

automatic-speech-recognition

Here are 297 public repositories matching this topic...

jmaczan / asr-dysarthria

Research on Automatic Speech Recognition for dysarthric speech

deep-learning automatic-speech-recognition asr self-supervised-learning dysarthric-speech wav2vec2 dysarthria

Updated Aug 1, 2024
Jupyter Notebook

winstxnhdw / CapGen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

docker caddy automatic-speech-recognition whisper asr granian huggingface huggingface-spaces ctranslate2 litestar

Updated Aug 1, 2024
Python

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated Aug 1, 2024
Python

Picovoice / leopard

On-device speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

Updated Jul 31, 2024
Python

Picovoice / cheetah

On-device streaming speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr online-speech-recognition streaming-speech-to-text

Updated Jul 30, 2024
Python

roboticslab-uc3m / speech

Text To Speech (TTS) and Automatic Speech Recognition (ASR).

text-to-speech automatic-speech-recognition

Updated Jul 29, 2024
Python

tsmdt / whisply

Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!

subtitles speech-recognition automatic-speech-recognition speech-to-text whisper-ai

Updated Jul 29, 2024
Python

whisper-to-input

j3soon / whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

android kotlin keyboard ime voice speech voice-recognition speech-recognition openai virtual-keyboard automatic-speech-recognition speech-to-text whisper android-ime chinese-speech-recognition openai-api

Updated Jul 25, 2024
Kotlin

anton-jeran / FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

deep-learning neural-network speech impulse-response generative-adversarial-network automatic-speech-recognition rir augmentation acoustics room-impulse-response synthetic-data conditional-generation diffuse-scattering implicit-neural-representation

Updated Jul 24, 2024
Python

thonburian-whisper

biodatlab / thonburian-whisper

Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:

transformers automatic-speech-recognition thai whisper huggingface-transformers

Updated Jul 22, 2024
Jupyter Notebook

TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

tensorflow speech-recognition jasper automatic-speech-recognition speech-to-text ctc conformer deepspeech2 tflite rnn-transducer end2end tensorflow2 contextnet tflite-model tflite-convertion subword-speech-recognition streaming-transducer

Updated Jul 29, 2024
Python

marquesafonso / multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition whisper captioning-videos faster-whisper

Updated Jul 18, 2024
Python

my-north-ai / semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

text-to-speech automatic-speech-recognition whisper fine-tuning synthetic-dataset-generation

Updated Jul 16, 2024
Python

analyticsinmotion / werpy

🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

python nlp metrics pandas levenshtein-distance automatic-speech-recognition speech-to-text stt asr python-package wer word-error-rate stt-benchmark asr-evaluation

Updated Jul 29, 2024
Python

Al-Ajwad

ElGarash / Al-Ajwad

Al Ajwad is a graduation project submitted to the Department of Computers and Systems Engineering, Minia University as partial fulfilment for a B.Sc. degree. It is an ASR model trained to recognize the Tajweed rules of The Holy Quran recitation.

quran automatic-speech-recognition arabic-language tajweed

Updated Jul 14, 2024
Julia

ieasybooks / tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

python youtube facebook twitter soundcloud subtitles srt vtt automatic-speech-recognition whisper asr stable-whisper faster-whisper ctranslate2

Updated Jul 29, 2024
Python

aliyzd95 / modified_shemo

A modification on the Sharif Emotional Speech Database

speech-recognition automatic-speech-recognition persian-language emotion-recognition speech-emotion-recognition persian-dataset emotional-speech emotional-speech-databases persian-emotional-speech shemo-dataset shemo-modifiction shemo persian-corpora farsi-dataset sharif-emotional-speech-database

Updated Jul 12, 2024
Jupyter Notebook

chimechallenge / chime-utils

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

speech-recognition automatic-speech-recognition speech-processing speech-separation speech-enhancement far-field-speech-recognition diarization multi-speaker-asr meeting-transcription

Updated Jul 10, 2024
Python

matiuste / DistriBlock

[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.

machine-learning automatic-speech-recognition uncertainty-quantification adversarial-examples

Updated Jul 10, 2024
Python

goodmike31 / pl-asr-speech-data-survey

Survey of available speech datasets for Polish ASR development

opendata corpus dataset automatic-speech-recognition poland datasets polish asr asr-data

Updated Jul 10, 2024
Python

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."