OSS Alternative - Discover Top Open Source Alternatives to Popular Software

jianchang512/clone-voice

A user-friendly, open-source tool that clones any human voice to generate speech from text or convert existing audio, featuring a web interface and multi-language support.

Core Features

Clone any human voice for text-to-speech synthesis.

Convert existing audio to a cloned voice (speech-to-speech).

Intuitive web-based user interface for easy operation.

Supports 16 languages including Chinese, English, Japanese, Korean, French, German, Italian.

No dedicated GPU required for basic use, with CUDA acceleration for N-card users.

Quick Start

python app.py

Detailed Introduction

This project offers an accessible voice cloning solution built upon the Coqui XTTSv2 model, enabling users to synthesize text into speech or transform audio using any desired human voice. It provides a simple web interface, making advanced AI audio generation available to users without deep technical expertise or high-end hardware. Supporting a wide array of languages and offering both pre-compiled and source deployment options, it serves as a versatile tool for content creators, developers, and researchers exploring voice synthesis.