netease-youdao/EmotiVoice
EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.
Core Features
Quick Start
docker run -dp 127.0.0.1:8501:8501 -p 127.0.0.1:8000:8000 syq163/emoti-voice:latestDetailed Introduction
EmotiVoice is a powerful open-source text-to-speech engine designed to generate highly expressive and natural-sounding speech. It stands out with its ability to synthesize speech in over 2000 voices across English and Chinese, crucially offering prompt-controlled emotional synthesis. This allows users to infuse speech with a wide range of emotions like happiness, sadness, or anger. Beyond its core TTS capabilities, EmotiVoice provides a user-friendly web interface, a scripting API for batch processing, an OpenAI-compatible API, and even a Mac application, making advanced voice generation accessible for various applications.