Tags: #speech-to-text
WEIFENG2333/VideoCaptioner
An AI-powered tool for comprehensive video subtitling, including speech recognition, optimization, translation, and video synthesis.
OvidijusParsiunas/deep-chat
A highly customizable AI chatbot component designed for easy integration into any website or UI framework.
RunanywhereAI/runanywhere-sdks
A production-ready toolkit enabling developers to integrate private, offline, and fast on-device AI capabilities like LLMs, speech-to-text, and text-to-speech into their applications across various platforms.
buxuku/SmartSub
A cross-platform desktop tool for batch video/audio subtitle generation and multi-service translation, supporting offline processing and hardware acceleration.
moonshine-ai/moonshine
An open-source, on-device AI toolkit for real-time, low-latency speech-to-text, intent recognition, and text-to-speech across multiple platforms.
Blaizzy/mlx-audio
A high-performance library built on Apple's MLX framework, offering efficient text-to-speech, speech-to-text, and speech-to-speech capabilities optimized for Apple Silicon.
collabora/WhisperLive
A real-time transcription application leveraging OpenAI's Whisper model for converting live or pre-recorded speech into text with optimized backends.
yakGPT/yakGPT
A locally running, hands-free ChatGPT UI that enhances text generation and chat engagement with speech-to-text and text-to-speech capabilities.