urumchi 的 Starred 仓库

A toolkit for speaker diarization.

用于说话人日志的工具包，识别音频中谁在何时说话

2026-01-22

audio-processing speaker-diarization ×toolkit

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

用于说话人日志的神经网络构建模块，包括语音活动、说话人变化和重叠语音检测

2026-01-13

speaker-diarization ×speech-activity-detection speaker-embedding

modelscope/FunASR 16,870

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

工业级语音识别工具包，支持50+语言、流式处理和OpenAI兼容API

2025-12-16

speech-recognition speaker-diarization ×emotion-detection

modelscope/3D-Speaker 2,966

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

单模态和多模态说话人验证、识别与日志记录仓库

2025-02-20

speaker-diarization ×speaker-verification speaker-recognition