netease-youdao/EmotiVoice
An open-source, multi-voice, and prompt-controlled text-to-speech engine capable of generating speech with diverse emotions in English and Chinese.
Core Features
Quick Start
docker run -dp 127.0.0.1:8501:8501 -p 127.0.0.1:8000:8000 syq163/emoti-voice:latestDetailed Introduction
EmotiVoice is a robust open-source text-to-speech engine developed by NetEase Youdao, offering advanced capabilities for generating natural-sounding speech. It stands out with its extensive library of over 2000 voices and its unique ability to synthesize speech with specific emotions, controlled via prompts. Supporting both English and Chinese, EmotiVoice provides flexible interfaces, including a web UI, scripting API, and an OpenAI-compatible API, making it suitable for a wide range of applications from content creation to accessibility tools. Recent updates include voice cloning and a dedicated Mac app, enhancing its versatility and user experience.