vad

Star

Here are 88 public repositories matching this topic...

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Mar 18, 2024
Python

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Aug 1, 2024
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jul 21, 2024
Python

CheshireCC / faster-whisper-GUI

Star

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

Updated Jun 3, 2024
Python

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Mar 30, 2023
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

Baidu-AIP / speech-vad-demo

Star

集成Webrtc的VAD，用于切分音频文件

webrtc speech vad webrtc-vad

Updated Aug 26, 2020
C

gtreshchev / RuntimeAudioImporter

Star

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

Updated Jul 28, 2024
C++

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!