Tags : # machine-learning

Distributed AI Compute Engine

ai machine-learning distributed-computing

42.3k

ray-project/ray

Ray is a unified framework for scaling AI and Python applications from a laptop to a cluster, simplifying complex ML workloads with a distributed runtime and specialized libraries.

machine-learning deep-learning nlp

AI Framework

159.9k

huggingface/transformers

A comprehensive library providing state-of-the-art pre-trained models for various machine learning tasks across text, vision, audio, and multimodal domains, facilitating both inference and training.

Machine Learning Library

transformers

21.0k

huggingface/peft

A state-of-the-art library for Parameter-Efficient Fine-Tuning (PEFT) of large pretrained models, drastically reducing computational and storage costs.

peft fine-tuning llm

generative-ai google-cloud vertex-ai

Generative AI Development Kit

Google Cloud

16.7k

GoogleCloudPlatform/generative-ai

Provides sample code, notebooks, and resources for building and managing generative AI workflows on Google Cloud using Vertex AI and Gemini.

AI Engineering Platform

25.8k

mlflow/mlflow

An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.

ai engineering llm mlops

AI/ML Fine-tuning Framework

llm fine-tuning machine-learning

70.8k

hiyouga/LlamaFactory

A unified and efficient framework for fine-tuning over 100 large language models (LLMs) and vision-language models (VLMs) with both CLI and Web UI.

Time Series AI Platform

time series forecasting anomaly detection

3.9k

Nixtla/nixtla

A production-ready, pre-trained time series foundation model (TimeGPT) for accurate forecasting and anomaly detection across various domains with minimal code.

research machine-learning artificial-intelligence

Research Code Repository

37.8k

google-research/google-research

A comprehensive repository housing open-source code and datasets officially released by Google Research.

MLOps Framework

mlops machine-learning ai

10.1k

Netflix/metaflow

A human-centric Python framework for building, managing, and deploying real-life AI/ML systems from rapid prototyping to reliable production.

NLP Library

nlp python machine-learning

33.5k

explosion/spaCy

An industrial-strength Python library for advanced Natural Language Processing, offering state-of-the-art models and a production-ready training system.

AI/MLOps Platform

machine-learning mlops experiment-tracking

11.0k

wandb/wandb

An AI developer platform for tracking, visualizing, and managing machine learning models from experimentation to production.

Visualization Tool

neural-network deep-learning machine-learning

32.8k

lutzroeder/netron

A universal viewer for neural network, deep learning, and machine learning models, supporting a wide array of formats.

data versioning mlops experiment tracking

MLOps CLI Tool

git

15.6k

treeverse/dvc

DVC (Data Version Control) is a command-line tool and VS Code extension for managing data, models, and ML experiments, enabling reproducible machine learning projects.

AI/ML Interoperability Standard

machine learning deep learning ai models

20.7k

onnx/onnx

An open standard and format for machine learning models, enabling interoperability across different AI frameworks and hardware.

Educational Course

ai agents llm machine learning

28.2k

huggingface/agents-course

A comprehensive, free online course from Hugging Face designed to teach the fundamentals and advanced techniques of building AI agents.

generative-ai ai-agents tutorials

AI Development Resource

21.6k

NirDiamant/GenAI_Agents

A comprehensive repository offering over 50 tutorials and implementations for building Generative AI agents, from basic conversational bots to complex multi-agent systems.

Technical Tutorial Repository

27.0k

NirDiamant/RAG_Techniques

A repository showcasing various advanced Retrieval-Augmented Generation (RAG) techniques through detailed notebook tutorials.

rag llm generative-ai

ai-agents ai-research machine-learning

AI Research Automation Library

Node.js

7.4k

Orchestra-Research/AI-Research-SKILLs

A comprehensive open-source library providing AI agents with the skills to autonomously conduct the entire AI research lifecycle, from ideation to paper writing.

kubernetes ai inference generative ai

AI Inference Platform

kubernetes

5.4k

kserve/kserve

A standardized, scalable, multi-framework platform for deploying generative and predictive AI models on Kubernetes.

vector database semantic search rag

Vector Database

Docker

16.1k

weaviate/weaviate

An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and AI capabilities.

vector database vector search ai

Vector Database & Search Engine

Docker

30.7k

qdrant/qdrant

A high-performance, massive-scale vector database and search engine designed for next-generation AI applications.

kubernetes workflow-engine orchestration

Workflow Orchestration Engine

kubernetes

16.6k

argoproj/argo-workflows

An open-source, container-native workflow engine for Kubernetes, designed to orchestrate parallel jobs and complex multi-step tasks.

data-labeling annotation machine-learning

Data Labeling & Annotation Tool

Docker

27.1k

HumanSignal/label-studio

An open-source, multi-type data labeling and annotation tool with a simple UI and standardized output, designed to prepare and improve data for machine learning models.

awesome-list open-source ai

Curated List / Awesome List

2.0k

alvinunreal/awesome-opensource-ai

A meticulously curated list of battle-tested, production-proven open-source AI projects, models, tools, and infrastructure.

machine-learning mlops kubernetes

ML Workflow Orchestration Platform

Kubernetes

4.1k

kubeflow/pipelines

An open-source platform for building, deploying, and managing end-to-end machine learning workflows on Kubernetes.

mlops machine-learning production

Curated List of MLOps Libraries

20.5k

EthicalML/awesome-production-machine-learning

A comprehensive curated list of open-source libraries for deploying, monitoring, versioning, and scaling machine learning models in production.

amazon sagemaker machine learning jupyter notebooks

Machine Learning Example Repository

aws

10.9k

aws/amazon-sagemaker-examples

A collection of Jupyter notebooks demonstrating how to build, train, and deploy machine learning models using Amazon SageMaker and its new Python SDK, SageMaker-Core.

mlops machine-learning deep-learning

MLOps Platform

Kubernetes

3.7k

polyaxon/polyaxon

A comprehensive MLOps platform for managing, orchestrating, and scaling the machine learning lifecycle with reproducibility and automation.

Machine Learning Feature Store

feature store machine learning mlops

7.0k

feast-dev/feast

An open-source feature store for AI/ML that streamlines the management and serving of features for model training and online inference.

kubernetes ai-training llm-finetuning

Distributed AI Training Platform

Kubernetes

2.1k

kubeflow/trainer

A Kubernetes-native platform for scalable distributed AI model training and LLM fine-tuning across various frameworks.

mlops machine-learning tools

Curated List / Resource Collection

5.1k

kelvins/awesome-mlops

A comprehensive and categorized collection of awesome MLOps tools and resources, designed to help practitioners navigate the complex MLOps ecosystem.

AI/ML Development Tool

machine learning automl llm

2.6k

plexe-ai/plexe

Build machine learning models from natural language prompts using an AI-powered multi-agent system.

MLOps Learning Platform

mlops machine-learning ml-engineering

47.5k

GokuMohandas/Made-With-ML

A comprehensive educational platform teaching developers how to design, develop, deploy, and iterate on production-grade machine learning applications.

AI Agent Training Framework

ai-agents reinforcement-learning agent-training

17.0k

microsoft/agent-lightning

A versatile framework designed to train and optimize AI agents from any framework with minimal code changes, leveraging advanced algorithms like Reinforcement Learning.

AI Data Management Platform

ai machine-learning data-lake

9.1k

activeloopai/deeplake

Deep Lake is an AI data runtime and database optimized for deep learning, offering serverless multimodal data storage, scalable retrieval, and training capabilities.

AI/ML Deep Learning Framework

low-code llm deep-learning

11.7k

ludwig-ai/ludwig

A low-code, declarative framework for building and deploying custom large language models (LLMs) and other deep neural networks with ease and efficiency.

awesome-list generative-ai machine-learning

Awesome List / Resource Collection

11.9k

steven2358/awesome-generative-ai

A comprehensive, curated list of modern Generative Artificial Intelligence projects, services, and learning resources.

Educational Curriculum

ai machine-learning education

46.7k

microsoft/AI-For-Beginners

A 12-week, 24-lesson curriculum from Microsoft to learn Artificial Intelligence for beginners, including practical lessons, quizzes, and labs.

Curated List

5.8k

tensorchord/Awesome-LLMOps

A comprehensive and curated list of the best LLMOps tools, resources, and frameworks for developers working with large language models and other AI models.

llmops awesome-list ai

Machine Learning Toolkit for Recommendation Systems

recommendation-systems machine-learning deep-learning

21.7k

recommenders-team/recommenders

A comprehensive toolkit providing best practices, examples, and state-of-the-art algorithms to assist in prototyping, experimenting with, and operationalizing recommendation systems.

Machine Learning Data Library

ai machine learning datasets

21.5k

huggingface/datasets

A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.

pinecone vector-database jupyter-notebooks

Learning Resources & Code Examples

jupyter notebooks

3.0k

pinecone-io/examples

A comprehensive collection of Jupyter Notebooks and sample applications designed to help developers master Pinecone vector databases and common AI patterns through hands-on practice.

Python Library for Multimodal AI Data

multimodal-data machine-learning data-structure

3.1k

docarray/docarray

A Python library for representing, transmitting, storing, and retrieving multimodal data, designed for AI applications.

AI/ML Toolkit

langchain

2.5k

athina-ai/rag-cookbooks

A comprehensive repository offering practical implementations and evaluation guidance for advanced and agentic Retrieval-Augmented Generation (RAG) techniques.

rag llm ai

LLM Training Framework

2.2k

AI-Hypercomputer/maxtext

A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.

jax llm deep-learning

diffusion models fine-tuning machine learning

AI/ML Training Platform

DeepSpeed

2.8k

bghira/SimpleTuner

A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.

AI Fine-tuning Tool

multimodal-ai fine-tuning vlm

2.7k

roboflow/maestro

A streamlined tool to accelerate the fine-tuning of popular multimodal models like Florence-2, PaliGemma 2, and Qwen2.5-VL.

LLM Fine-tuning Toolkit

huggingface-transformers

2.9k

ashishpatel26/LLM-Finetuning

A collection of guides and code for efficiently fine-tuning large language models using PEFT (LoRA) and Hugging Face transformers.

llm finetuning peft

llm fine-tuning apple silicon mlx

Machine Learning Library

Apple Silicon

1.2k

ARahim3/mlx-tune

Enables efficient fine-tuning of LLMs, Vision, Audio, and OCR models on Apple Silicon Macs with an Unsloth-compatible API.

Educational Resource / Deep Learning Implementations

deep-learning pytorch machine-learning

66.5k

labmlai/annotated_deep_learning_paper_implementations

A comprehensive collection of PyTorch implementations for over 60 deep learning papers, featuring side-by-side annotated notes for enhanced understanding.

stable-diffusion google-colab ai-art-generation

Cloud-based AI Art Generation Utility

Google Colab

15.9k

camenduru/stable-diffusion-webui-colab

Provides Google Colab notebooks for easily deploying and running Stable Diffusion WebUI, enabling AI-powered image generation and training without local hardware.

Replaces:

Midjourney DALL-E...

AI/ML Training Tool

lora dreambooth stable-diffusion

6.0k

Akegarasu/lora-scripts

A comprehensive GUI and script collection for training LoRA and Dreambooth models for Stable Diffusion, built upon kohya-ss's sd-scripts.

Educational Resource / Technical Textbook

rlhf machine-learning llm

1.8k

natolambert/rlhf-book

A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.

rlhf awesome-list machine-learning

Awesome List / Research Resource Collection

4.4k

opendilab/awesome-RLHF

A continually updated, curated list of essential resources for Reinforcement Learning with Human Feedback (RLHF), encompassing research papers, codebases, and datasets.

AI/ML Research Framework

Hugging Face

1.6k

PKU-Alignment/safe-rlhf

A modular open-source framework for training constrained value-aligned Large Language Models (LLMs) using Safe Reinforcement Learning from Human Feedback (RLHF).

llm rlhf safety

Machine Learning Research Toolkit

rlhf reward modeling large language models

1.5k

RLHFlow/RLHF-Reward-Modeling

A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.

AI Data Curation Platform

data-quality data-curation ai

3.5k

Docta-ai/docta

Docta is an advanced data-centric AI platform that detects and rectifies issues in various data types to improve model performance.

Open-Source Financial Large Language Model Project

financial-llm open-source-ai fintech

19.9k

AI4Finance-Foundation/FinGPT

FinGPT democratizes access to large language models tailored for finance, offering cost-effective and rapidly adaptable solutions to overcome the limitations of proprietary financial AI.

Replaces:

BloombergGPT

comfyui video-generation ai-model

ComfyUI Custom Nodes / AI Video Generation Plugin

comfyui

3.5k

Lightricks/ComfyUI-LTXVideo

Extends ComfyUI with advanced custom nodes for the LTX-2 video generation model, enabling powerful text-to-video and image-to-video workflows.

diffusion models generative ai machine learning

Research Paper Collection & Taxonomy

3.3k

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

A comprehensive and continuously updated collection and categorization of research papers on diffusion models.

Text-to-Image Generation Library

3.5k

kuprel/min-dalle

A fast, minimal PyTorch port of DALL·E Mini for efficient text-to-image generation.

pytorch text-to-image ai

Speech Synthesis Library

Google Cloud Text-to-Speech Amazon Polly...

5.9k

snakers4/silero-models

A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.

text-to-speech tts ai

Replaces:

speech recognition text-to-speech speech translation

Speech AI Toolkit

PaddlePaddle

12.6k

PaddlePaddle/PaddleSpeech

An easy-to-use open-source toolkit built on PaddlePaddle, offering state-of-the-art models for diverse speech and audio tasks like ASR, TTS, translation, and speaker verification.

AI Voice Conversion Software

voice conversion ai audio processing

3.2k

IAHispano/Applio

A user-friendly, high-quality AI-powered tool for transforming voices with a focus on performance and customization.

generative ai machine learning learning roadmap

Educational Resource Hub

2.3k

genieincodebottle/generative-ai

A comprehensive repository offering structured learning paths, practical projects, and career preparation resources for Generative AI and Machine Learning.

Benchmarking and Evaluation Framework

3.2k

embeddings-benchmark/mteb

MTEB is a comprehensive benchmark and evaluation framework designed to assess the performance of text embedding models and retrieval systems across a wide range of tasks.

embeddings benchmark nlp

Machine Learning Library

anomaly-detection python machine-learning

9.8k

yzhao062/pyod

A comprehensive Python library for multi-modal anomaly detection, featuring 60+ algorithms and agentic AI capabilities for scalable, expert-level investigations.

Semantic Search System Toolkit

clip semantic-search embeddings

2.8k

rom1504/clip-retrieval

A comprehensive toolkit for computing CLIP embeddings and building scalable semantic search and retrieval systems for multimodal data.

mlops ai orchestration machine learning

MLOps Platform

Nuclio

1.7k

mlrun/mlrun

An open-source MLOps and AI orchestration platform for building, managing, and automating continuous machine learning and generative AI applications across their entire lifecycle.

postgres machine learning ai

AI/ML Database Extension

Postgres

6.8k

postgresml/postgresml

PostgresML integrates machine learning and AI capabilities, including LLMs and vector search, directly into PostgreSQL, leveraging GPUs for high-performance in-database inference.

Educational Resource Collection

5.4k

ashishps1/learn-ai-engineering

A comprehensive, curated collection of free resources for learning AI, Machine Learning, LLMs, and AI Engineering from scratch.

ai machine-learning llm

AI/ML Model & Speech Synthesis Library

6.2k

yl4579/StyleTTS2

StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.

text-to-speech tts ai

Deep Learning Library

text-to-speech deep-learning speech-synthesis

10.1k

mozilla/TTS

A deep learning library for advanced, high-quality, and efficient Text-to-Speech (TTS) synthesis, supporting multiple languages and models.

image dataset data processing machine learning

Data Processing CLI Tool

4.4k

rom1504/img2dataset

A highly efficient command-line tool to download, resize, and package large sets of image URLs into machine learning datasets.

generative-ai stable-diffusion tutorials

Educational Resource Hub

2.7k

FurkanGozukara/Stable-Diffusion

A comprehensive repository offering expert-level tutorials, guides, and courses on Generative AI, focusing on Stable Diffusion, SDXL, LoRA, DreamBooth, and related technologies.

AI Video Generation Tool

stable-diffusion ai-video-generation text-to-video

4.7k

nateraw/stable-diffusion-videos

Create dynamic and visually captivating videos by smoothly morphing between different text prompts using Stable Diffusion.

stable diffusion dreambooth colab

AI Image Generation & Training Platform

Google Colab

7.9k

TheLastBen/fast-stable-diffusion

Provides fast, cloud-based (Google Colab) notebooks for Stable Diffusion, ComfyUI, AUTOMATIC1111, and DreamBooth.

data science awesome list learning resources

Curated Resource List

28.8k

academic/awesome-datascience

A comprehensive, open-source repository of Data Science learning resources and tools for real-world problem-solving.

ai machine learning research

Research Paper Curation

12.3k

dair-ai/AI-Papers-of-the-Week

A weekly curated repository highlighting the most impactful machine learning and AI research papers.

Machine Learning Library

python machine-learning time-series

3.1k

tslearn-team/tslearn

A comprehensive Python toolkit for machine learning tasks specifically tailored for time series analysis.

awesome-list open-source ai

Curated List / Awesome List

3.5k

alvinreal/awesome-opensource-ai

A meticulously curated list of battle-tested, production-proven open-source AI projects, models, tools, and infrastructure.

ai engineering machine learning deep learning

AI Engineering Learning Platform

5.1k

rohitg00/ai-engineering-from-scratch

A comprehensive, AI-native learning platform to master AI engineering from foundational math to autonomous agent swarms, building and shipping real tools.

AutoML Platform

automl low-code machine-learning

9.8k

pycaret/pycaret

An open-source, low-code AutoML platform for Python, offering a sklearn-native engine and a React-based control plane for end-to-end machine learning workflows.

Online Course

mlops machine learning education

14.6k

DataTalksClub/mlops-zoomcamp

A free 9-week online course from DataTalks.Club, designed to teach the fundamentals of MLOps, from experimentation to deployment and monitoring.

ai machine-learning deep-learning

Educational Resource Repository

6.4k

SkalskiP/courses

A meticulously curated collection of links to free courses and resources covering various Artificial Intelligence (AI) topics, suitable for all learning levels.

ai avatar-generation web-app

AI Avatar Generator Web Application

Node.js

3.9k

premieroctet/photoshot

An open-source web application that leverages AI to generate personalized avatars from user-provided images.

google cloud bigquery data analytics

Cloud Solutions & Tools Repository

google cloud platform

3.0k

GoogleCloudPlatform/professional-services

A repository of common solutions and tools developed by Google Cloud's Professional Services team to address various challenges on Google Cloud Platform.