Tags : # nlp

AI Framework

ai-framework semantic-search llm-orchestration

12.4k

neuml/txtai

An all-in-one AI framework for semantic search, LLM orchestration, and language model workflows, powered by an embeddings database.

NLP Library

nlp python machine-learning

33.5k

explosion/spaCy

An industrial-strength Python library for advanced Natural Language Processing, offering state-of-the-art models and a production-ready training system.

AI/ML Framework

32.5k

microsoft/graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system designed to extract structured data from unstructured text using LLMs to enhance reasoning on private data.

rag llm knowledge-graph

AI Data Labeling & Collaboration Platform

4.9k

argilla-io/argilla

Argilla is an open-source collaboration tool for AI engineers and domain experts to build and manage high-quality datasets for various AI models, leveraging human feedback and programmatic workflows.

ai ml data labeling

Technical Tutorial Repository

27.0k

NirDiamant/RAG_Techniques

A repository showcasing various advanced Retrieval-Augmented Generation (RAG) techniques through detailed notebook tutorials.

rag llm generative-ai

Python Library for Information Extraction

python llm information extraction

36.2k

google/langextract

A Python library leveraging LLMs to extract structured information from unstructured text with precise source grounding and interactive visualization.

Machine Learning Data Library

ai machine learning datasets

21.5k

huggingface/datasets

A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.

AI/ML Toolkit

langchain

2.5k

athina-ai/rag-cookbooks

A comprehensive repository offering practical implementations and evaluation guidance for advanced and agentic Retrieval-Augmented Generation (RAG) techniques.

rag llm ai

text-to-sql llm fine-tuning

LLM Fine-tuning Hub

2.0k

eosphoros-ai/DB-GPT-Hub

A specialized hub providing models, datasets, and fine-tuning techniques to enhance Large Language Models' performance in Text-to-SQL, Text-to-NLU, and Text-to-GQL tasks.

multimodal-llm deep-learning speech-processing

Deep Learning Toolkit / Multimodal LLM Framework

linux

1.0k

X-LANCE/SLAM-LLM

A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.

huggingface transformers nlp

Educational Resource / Course Code Repository

pytorch

3.9k

zyds/transformers-code

A comprehensive code repository accompanying a hands-on course for mastering Huggingface Transformers, covering fundamental concepts to advanced fine-tuning and deployment techniques.

Large Language Model Finetuning Solution

3.8k

mymusise/ChatGLM-Tuning

A cost-effective solution for finetuning ChatGLM-6B with LoRA, enabling personalized large language models.

llm finetuning lora

Replaces:

ChatGPT

Deep Learning Library Extension

nlp deep learning transfer learning

2.8k

adapter-hub/adapters

A unified library extending HuggingFace Transformers for parameter-efficient and modular transfer learning in NLP.

llm nlp prompt-engineering

LLM Orchestration Library / NLP Framework

Python 3.9+

4.6k

promptslab/Promptify

A Python library for structured NLP tasks using LLMs, offering Pydantic outputs, multi-provider support, and built-in evaluation.

Benchmarking and Evaluation Framework

3.2k

embeddings-benchmark/mteb

MTEB is a comprehensive benchmark and evaluation framework designed to assess the performance of text embedding models and retrieval systems across a wide range of tasks.

embeddings benchmark nlp

AI/NLP Model Framework

3.1k

dbiir/UER-py

An open-source PyTorch-based framework for NLP pre-training and fine-tuning, offering modularity, reproducibility, and a comprehensive model zoo for various downstream tasks.

nlp pre-training pytorch

Large Language Model Finetuning Toolkit

DeepSpeed

2.8k

liucongg/ChatGLM-Finetuning

A toolkit for finetuning ChatGLM series models (ChatGLM-6B, ChatGLM2-6B, ChatGLM3-6B) using various methods like Freeze, Lora, P-tuning, and full parameter training for downstream NLP tasks.

chatglm finetuning llm

text-to-image generative-ai computer-vision

Resource Collection / Awesome List

2.4k

Yutong-Zhou-cv/Awesome-Text-to-Image

A comprehensive curated list of resources, papers, datasets, and projects related to text-to-image generation and manipulation.

foundation-models self-supervised-learning multimodal-ai

Foundation Model Research Hub

22.1k

microsoft/unilm

A comprehensive research hub for large-scale self-supervised pre-training of foundation models across diverse tasks, languages, and modalities.

AI Model / Research Project

2.5k

X-PLUG/mPLUG-Owl

A family of powerful multi-modal large language models (MLLMs) designed to advance AI's understanding and generation capabilities across various data types.

multi-modal llm ai

multimodal llm open-source

Multimodal AI Model Suite

HuggingFace

10.0k

OpenGVLab/InternVL

A pioneering open-source multimodal large language model family aiming to match or exceed commercial models like GPT-4o/GPT-5 in performance.

Replaces:

GPT-4o GPT-5

AI Video Editing Agent

2.0k

FireRedTeam/FireRed-OpenStoryline

FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction and LLM-powered planning.

ai video-editing llm

Replaces:

Adobe Premiere Pro DaVinci Resolve...

Large Language Model System / AI Infrastructure

large language model nlp aigc

4.1k

IDEA-CCNL/Fengshenbang-LM

Fengshenbang-LM is an open-source large model system by IDEA Research Institute, serving as infrastructure for Chinese AIGC and cognitive intelligence.