Natural Language Processing Library
33.4k 2026-03-26
explosion/spaCy
An industrial-strength Python library for advanced Natural Language Processing, designed for building real-world applications.
Core Features
Pretrained pipelines supporting over 70 languages.
State-of-the-art neural network models for tasks like named entity recognition, text classification, tagging, and parsing.
Multi-task learning with pretrained transformers such as BERT.
Production-ready training system and easy model packaging and deployment.
Detailed Introduction
spaCy is a high-performance, open-source library for advanced Natural Language Processing in Python and Cython. Built on the latest research, it provides state-of-the-art neural network models for various NLP tasks, including named entity recognition, text classification, and dependency parsing, supporting over 70 languages with pretrained pipelines. Designed for production use, spaCy integrates with transformers like BERT and offers a robust training system for building and deploying sophisticated NLP applications efficiently.