pyannote/pyannote-audio
10,038
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
用于说话人日志的神经网络构建模块,包括语音活动、说话人变化和重叠语音检测
modelscope/FunASR
16,870
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
工业级语音识别工具包,支持50+语言、流式处理和OpenAI兼容API
modelscope/3D-Speaker
2,966
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
单模态和多模态说话人验证、识别与日志记录仓库