AI Framework
11.2k 2026-04-13
pipecat-ai/pipecat
An open-source Python framework for building real-time, voice-first, and multimodal conversational AI agents with ultra-low latency.
Core Features
Voice-first architecture with integrated speech recognition and text-to-speech.
Pluggable design supporting diverse AI services and tools.
Composable pipelines for orchestrating complex conversational flows.
Real-time, ultra-low latency interaction via various transports like WebSockets and WebRTC.
Support for multimodal interfaces including voice, video, and images.
Quick Start
pipecat init quickstartDetailed Introduction
Pipecat is a powerful open-source Python framework designed for developers to create sophisticated real-time voice and multimodal conversational AI agents. It simplifies the orchestration of audio, video, AI services, and communication transports, allowing for the construction of natural, streaming conversations. With its pluggable architecture and composable pipelines, Pipecat enables the development of everything from voice assistants and AI companions to complex business agents and interactive storytelling tools, all with a focus on ultra-low latency and a voice-first approach.