OSS Alternative - Discover Top Open Source Alternatives to Popular Software

netease-youdao/EmotiVoice

EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.

Core Features

Over 2000 distinct voices with emotional synthesis (happy, sad, angry).

Supports both English and Chinese language synthesis.

Provides an easy-to-use web interface and scripting for batch generation.

Offers an OpenAI-compatible TTS API and a dedicated Mac application.

Includes voice cloning capabilities using personal data.

Quick Start

docker run -dp 127.0.0.1:8501:8501 -p 127.0.0.1:8000:8000 syq163/emoti-voice:latest

Detailed Introduction

EmotiVoice is a powerful open-source text-to-speech engine designed to generate highly expressive and natural-sounding speech. It stands out with its ability to synthesize speech in over 2000 voices across English and Chinese, crucially offering prompt-controlled emotional synthesis. This allows users to infuse speech with a wide range of emotions like happiness, sadness, or anger. Beyond its core TTS capabilities, EmotiVoice provides a user-friendly web interface, a scripting API for batch processing, an OpenAI-compatible API, and even a Mac application, making advanced voice generation accessible for various applications.