Tags: #text-to-speech

Frontend UI Component
javascript
3.6k

OvidijusParsiunas/deep-chat

A highly customizable AI chatbot component designed for easy integration into any website or UI framework.

AI SDK / On-device AI Toolkit
LlamaCPP
10.3k

RunanywhereAI/runanywhere-sdks

A production-ready toolkit enabling developers to integrate private, offline, and fast on-device AI capabilities like LLMs, speech-to-text, and text-to-speech into their applications across various platforms.

Desktop Application
Docker
19.2k

jamiepine/voicebox

The open-source, local-first voice synthesis studio for voice cloning, speech generation, and audio effects.

AI Text-to-Speech System
Python
20.6k

FunAudioLLM/CosyVoice

CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.

Voice AI Toolkit
7.8k

moonshine-ai/moonshine

An open-source, on-device AI toolkit for real-time, low-latency speech-to-text, intent recognition, and text-to-speech across multiple platforms.

Audiobook Generation Tool
Docker
18.7k

DrewThomasson/ebook2audiobook

A powerful tool to convert e-books into audiobooks with advanced text-to-speech, voice cloning, and extensive language support.

AI/ML Model Library
pytorch
5.9k

snakers4/silero-models

Silero Models offers a collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech generation.

Unified AI Audio Platform
Docker
3.1k

rsxdalv/TTS-WebUI

A single web interface integrating numerous state-of-the-art open-source models for text-to-speech, audio generation, and voice conversion.

Text-to-Speech Library and CLI Tool
python
10.6k

rany2/edge-tts

A Python module and CLI tool to access Microsoft Edge's online text-to-speech service without an API key, Edge browser, or Windows.

AI-powered Text-to-Speech System
20.1k

index-tts/index-tts

IndexTTS2 is an industrial-level, zero-shot text-to-speech system offering precise duration control and disentangled emotional expression for highly natural and controllable speech synthesis.

Deep Learning Framework / AI Tool
Python
59.6k

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.

Content Creation Tool
Python
4.3k

denizsafak/abogen

Generate high-quality audiobooks and voiceovers from various text formats with synchronized captions.

CLI Tool & Desktop Application
Python
6.0k

santinic/audiblez

Generate high-quality audiobooks in .m4b format from .epub e-books using advanced text-to-speech technology, with both command-line and graphical interfaces.

AI Voice Synthesis Web Application
Python
56.8k

RVC-Boss/GPT-SoVITS

A powerful open-source web UI for few-shot voice conversion and text-to-speech, enabling high-quality voice cloning with minimal audio data.

Text-to-Speech System
4.6k

WhisperSpeech/WhisperSpeech

An open-source, high-performance text-to-speech system built on Whisper, aiming to be a hackable and commercially safe alternative for speech generation.

AI/ML Audio Processing Library
python
6.7k

Blaizzy/mlx-audio

A high-performance library built on Apple's MLX framework, offering efficient text-to-speech, speech-to-text, and speech-to-speech capabilities optimized for Apple Silicon.

AI Speech and Sound Generation Framework
llama.cpp
1.5k

OpenMOSS/MOSS-TTS

An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.

AI/ML Library & SDK
Python
1.4k

edwko/OuteTTS

A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.

AI Chat Client
Git
1.6k

yakGPT/yakGPT

A locally running, hands-free ChatGPT UI that enhances text generation and chat engagement with speech-to-text and text-to-speech capabilities.

Replaces:
Details
Desktop Utility Software
Electron
6.1k

LokerL/tts-vue

A desktop application providing a user-friendly interface for Microsoft's speech synthesis capabilities.

AI Speech Synthesis System
python
6.1k

canopyai/Orpheus-TTS

A state-of-the-art open-source text-to-speech system leveraging LLMs to generate human-like, emotional, and low-latency speech with zero-shot voice cloning capabilities.

Local Web Interface for Text-to-Speech
Python
7.5k

jianchang512/ChatTTS-ui

Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.

AI Voice Cloning and Synthesis Tool
python
9.0k

jianchang512/clone-voice

A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.

Android Utility Application
android
4.3k

jing332/tts-server-android

An advanced Android Text-to-Speech (TTS) application offering Microsoft TTS integration, custom HTTP requests, local engine support, and intelligent dialogue recognition.

Text-to-Speech System
10.8k

rhasspy/piper

A fast, local, neural text-to-speech system for efficient and private voice generation.

AI Voice Synthesis Library
36.2k

myshell-ai/OpenVoice

An AI voice synthesis library offering instant, accurate, and flexible voice cloning with multi-lingual support.

Audio Synthesis Framework
Python
4.8k

MoonInTheRiver/DiffSinger

DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.

Text-to-Speech Library
python
7.3k

myshell-ai/MeloTTS

A high-quality, multi-lingual text-to-speech library supporting real-time CPU inference across various languages and accents.

Deep Learning Library
python
45.1k

coqui-ai/TTS

A deep learning toolkit for Text-to-Speech, offering pretrained models, training tools, and dataset utilities.

AI-powered Text-to-Speech Engine
Docker
8.5k

netease-youdao/EmotiVoice

An open-source, multi-voice, and prompt-controlled text-to-speech engine capable of generating speech with diverse emotions in English and Chinese.

AI/ML Model, Speech Synthesis Library
python
6.2k

yl4579/StyleTTS2

StyleTTS 2 is a text-to-speech model that achieves human-level speech synthesis by leveraging style diffusion and adversarial training with large speech language models.

AI/ML Speech Synthesis Framework
tensorflow
4.0k

TensorSpeech/TensorFlowTTS

TensorFlowTTS provides real-time, state-of-the-art speech synthesis architectures based on TensorFlow 2, supporting multiple languages and optimized for fast inference and deployment on various devices.

Speech Synthesis Library
python
7.9k

jaywalnut310/vits

VITS is an end-to-end text-to-speech model that generates highly natural-sounding audio with diverse rhythms, outperforming traditional two-stage TTS systems.

Speech Synthesis Library
pytorch
10.1k

mozilla/TTS

A deep learning library for advanced Text-to-Speech generation, offering high-quality speech synthesis with pretrained models and multi-language support.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.