Tags: #audio-processing

Deep Learning Library / Machine Learning Framework
159.3k

huggingface/transformers

A unified framework providing state-of-the-art machine learning models for text, vision, audio, and multimodal tasks, optimized for both inference and training.

Deep Learning Toolkit / Multimodal LLM Framework
linux
1.0k

X-LANCE/SLAM-LLM

A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.

AI Voice Conversion Tool
Python
3.2k

IAHispano/Applio

Applio is a powerful, user-friendly, and high-performance open-source tool for high-quality voice transformation.

AI/ML Audio Processing Library
python
6.7k

Blaizzy/mlx-audio

A high-performance library built on Apple's MLX framework, offering efficient text-to-speech, speech-to-text, and speech-to-speech capabilities optimized for Apple Silicon.

Real-time Speech-to-Text Application
Python
4.0k

collabora/WhisperLive

A real-time transcription application leveraging OpenAI's Whisper model for converting live or pre-recorded speech into text with optimized backends.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.