urumchi 的 Starred 仓库

devcxl/fcitx5-voice-input 4

Fcitx5 voice input addon — PulseAudio/PipeWire capture, Silero VAD, OpenAI-compatible ASR

Fcitx5语音输入插件，支持音频捕获、语音活动检测和兼容OpenAI的语音识别

C++ 2026-06-29

voice-input asr ×fcitx5

zai-org/GLM-ASR 807

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

鲁棒的开源语音识别模型，含15亿参数

2026-01-15

deep-learning speech-recognition asr ×

FunAudioLLM/Fun-ASR 1,202

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

支持31种语言、方言、歌词、热词、时间戳和说话人日志的端到端语音识别大模型。

2025-12-16

speech-recognition asr ×diarization

FunAudioLLM/SenseVoice 8,411

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

多语言语音理解，支持ASR、情感识别和音频事件检测，速度比Whisper快15倍

2025-02-19

speech-recognition asr ×emotion-recognition