speech-diarization
Here are 12 public repositories matching this topic...
UniSpeech - Large Scale Self-Supervised Learning for Speech
-
Updated
Apr 5, 2024 - Python
The dataset of Speech Recognition
-
Updated
Jan 4, 2026
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
-
Updated
Mar 12, 2025 - Python
Dự án công cụ chuyển đổi giọng nói dành cho người Việt
-
Updated
Jun 15, 2026 - Python
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
-
Updated
Jul 31, 2024 - Jupyter Notebook
Template Project For iOS Apps using .onnx Speech Models for Speech Diarization
-
Updated
Sep 28, 2025 - C
A demo to show Speech Diarization (seperating audio of different speaker) and converting them to text using Google Cloud Speech API.
-
Updated
Mar 30, 2019 - Jupyter Notebook
Vid2Manga is an innovative application designed to bridge the gap between video content and manga-style storytelling. By leveraging advanced video processing and speech-to-text technologies, Vid2Manga extracts audio and visual components from video files to create a foundation for manga generation.
-
Updated
May 31, 2026 - Python
Speech transcription and speech diarization
-
Updated
Mar 31, 2024 - Python
A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.
-
Updated
Aug 13, 2025 - Python
-
Updated
May 7, 2026 - Python
Improve this page
Add a description, image, and links to the speech-diarization topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-diarization topic, visit your repo's landing page and select "manage topics."