Speech Processing Toolkit
12.6k 2026-04-18
PaddlePaddle/PaddleSpeech
An open-source, easy-to-use speech toolkit built on PaddlePaddle, offering state-of-the-art models for various speech and audio tasks.
Core Features
Self-Supervised Learning models
State-of-the-art/Streaming Automatic Speech Recognition (ASR)
Streaming Text-to-Speech (TTS) with text frontend
Speaker Verification System
End-to-End Speech Translation
Detailed Introduction
PaddleSpeech is a comprehensive open-source toolkit developed on the PaddlePaddle platform, designed for a wide array of critical speech and audio tasks. It integrates state-of-the-art and influential models, making advanced speech technology accessible. The project covers functionalities from speech recognition and synthesis to translation and speaker verification, and was recognized with the NAACL2022 Best Demo Award for its innovative contributions.