Conversational AI Framework
11.6k 2026-04-26
pipecat-ai/pipecat
An open-source Python framework for building real-time, voice-first, and multimodal conversational AI agents with composable pipelines.
Core Features
Voice-first integration (speech recognition, text-to-speech)
Pluggable architecture for various AI services and tools
Composable pipelines for complex conversational behaviors
Real-time, ultra-low latency interaction with different transports
Support for multimodal interfaces (voice, video, images)
Quick Start
pipecat init quickstartDetailed Introduction
Pipecat is a powerful open-source Python framework designed for developing real-time voice and multimodal conversational AI agents. It simplifies the orchestration of audio, video, AI services, and various communication transports, allowing developers to focus on the unique aspects of their agents. With its voice-first approach, pluggable architecture, and composable pipelines, Pipecat enables the creation of natural, low-latency interactions for applications ranging from voice assistants and AI companions to complex business agents and interactive storytelling.