Tags: #voice-ai
pipecat-ai/pipecat
An open-source Python framework for building real-time, voice-first, and multimodal conversational AI agents with composable pipelines.
TEN-framework/ten-framework
An open-source framework for building real-time, multimodal conversational AI agents with advanced features like voice assistance, diarization, and lip-sync.
livekit/agents
A framework for building realtime, programmable voice AI agents that can see, hear, and understand.
xinnan-tech/xiaozhi-esp32-server
A backend service for ESP32 devices, facilitating the rapid deployment of smart device control servers with integrated AI capabilities.
moonshine-ai/moonshine
An open-source, on-device AI toolkit for real-time, low-latency speech-to-text, intent recognition, and text-to-speech across multiple platforms.
fikrikarim/parlor
Parlor is an on-device, real-time multimodal AI that enables natural voice and vision conversations, running entirely on your local machine.