Tags: #speech-recognition

AI Model Inference Serving Platform

9.3k

xorbitsai/inference

A unified, production-ready inference API for deploying and serving open-source language, speech, and multimodal AI models on various infrastructures.

llm inference model-serving

Replaces:

OpenAI API

Details

Open-source Large Language Model Framework

Hugging Face

8.3k

LianjiaTech/BELLE

BELLE is an open-source project dedicated to fostering the development of Chinese conversational large language models, aiming to make LLMs accessible to everyone.

llm chinese-nlp open-source

Replaces:

ChatGPT

Details

Speech AI Toolkit

PaddlePaddle

12.6k

PaddlePaddle/PaddleSpeech

An easy-to-use open-source toolkit built on PaddlePaddle, offering state-of-the-art models for diverse speech and audio tasks like ASR, TTS, translation, and speaker verification.

speech recognition text-to-speech speech translation

Details

AI Voice Synthesis WebUI

Python

57.1k

RVC-Boss/GPT-SoVITS

A powerful web-based tool for few-shot voice cloning and text-to-speech, enabling high-quality voice generation from minimal audio data.

text-to-speech voice-cloning few-shot-learning

Replaces:

Commercial Text-to-Speech Services Voice Cloning Software

Details

AI Multimedia Processing Web Application

Gradio

7.2k

abus-aikorea/voice-pro

An AI-powered web application for comprehensive multimedia content creation, offering speech recognition, voice cloning, text-to-speech, and multilingual translation.

ai voice speech recognition text-to-speech

Replaces:

ElevenLabs

Details