Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
无需Edge或Windows即可使用微软Edge在线文字转语音的Python库
共 1051 个仓库
备份、整理、重新发现你曾点赞过的每一个 GitHub 仓库。
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
无需Edge或Windows即可使用微软Edge在线文字转语音的Python库
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
基于Kaldi和ONNX Runtime的离线语音识别、合成、说话人分离与VAD
这是一个部署在 EdgeOne Pages 上的高性能文本转语音(TTS)代理服务。它巧妙地将微软 Edge 强大且自然的语音合成服务,封装成了一个兼容 OpenAI API 格式的接口。这使得开发者可以无缝地将各种现有应用对接到这个免费、高质量的 TTS 服务上。
部署在EdgeOne Pages上的高性能TTS代理,将微软Edge语音合成封装为OpenAI API格式
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
易用的语音工具包,包含语音识别、语音合成、说话人验证和关键词检测
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
基于条件流匹配的快速语音合成架构,ICASSP 2024论文
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
多语言大模型语音生成,提供推理、训练和部署全栈能力
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
5秒克隆声音,实时生成任意语音