urumchi 的 Starred 仓库

当前筛选： speech-recognition ×text-to-speech × 清除筛选

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

基于Kaldi和ONNX Runtime的离线语音识别、合成、说话人分离与VAD

2026-02-11

text-to-speech ×speech-recognition ×onnx

PaddlePaddle/PaddleSpeech 12,610

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

易用的语音工具包，包含语音识别、语音合成、说话人验证和关键词检测

2025-03-28

text-to-speech ×speech-recognition ×speaker-verification