Voice Activity Detection

6 projects

Showing 6 of 6 projects

Bindings for many languagesC++

A high-performance C/C++ port of OpenAI's Whisper model for efficient, cross-platform speech recognition.

#transformer#automatic-speech-recognition#openai

Stars52.2k

Forks5.9k

Last commit13 days ago

sherpa-onnxC++

A next-generation Kaldi-based toolkit for offline speech-to-text, text-to-speech, and audio processing across 12 languages and diverse hardware.

#ios#embedded-systems#aarch64

Stars13.7k

Forks1.6k

Last commit20 hours ago

pyannote.audioJupyter Notebook

An open-source Python toolkit for speaker diarization with state-of-the-art pretrained models and pipelines.

#speaker-embedding#python-library#speech-activity-detection

Stars10.3k

Forks1.1k

Last commit3 days ago

whisper-diarizationJupyter Notebook

A pipeline that combines OpenAI Whisper for speech-to-text with speaker diarization to identify who said what in audio.

#nvidia-nemo#automatic-speech-recognition#asr

Stars5.6k

Forks503

Last commit

whisper-timestampedPython

A Python library that extends OpenAI's Whisper to provide accurate word-level timestamps and confidence scores for multilingual speech recognition.

#subtitle-generation#python-library#deep-learning

A Swift SDK for fully local, low-latency audio AI on Apple devices, including transcription, text-to-speech, voice activity detection, and speaker diarization.

#ios#apple-neural-engine#speaker-embedding

Stars2.5k

Forks356

Last commit23 hours ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub