Tags: #speech-processing
Deep Learning Toolkit / Multimodal LLM Framework
linux
1.0k
X-LANCE/SLAM-LLM
A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.
Speech Processing Toolkit
paddlepaddle
12.6k
PaddlePaddle/PaddleSpeech
An open-source, easy-to-use speech toolkit built on PaddlePaddle, offering state-of-the-art models for various speech and audio tasks.
AI Research Hub
22.1k
microsoft/unilm
A comprehensive research initiative by Microsoft focusing on large-scale self-supervised pre-training to develop advanced foundation models across diverse tasks, languages, and modalities.