Tags : # fine-tuning

AI/ML Fine-tuning and Deployment Framework

13.9k

modelscope/ms-swift

A scalable and lightweight infrastructure for fine-tuning, inference, and deployment of over 1000 large language models (LLMs) and multimodal large language models (MLLMs) using advanced techniques.

llm mllm fine-tuning

Machine Learning Library

transformers

21.0k

huggingface/peft

A state-of-the-art library for Parameter-Efficient Fine-Tuning (PEFT) of large pretrained models, drastically reducing computational and storage costs.

peft fine-tuning llm

AI/ML Fine-tuning Framework

llm fine-tuning machine-learning

70.8k

hiyouga/LlamaFactory

A unified and efficient framework for fine-tuning over 100 large language models (LLMs) and vision-language models (VLMs) with both CLI and Web UI.

llm fine-tuning deep-learning

LLM Fine-tuning Framework

11.8k

axolotl-ai-cloud/axolotl

A free and open-source framework designed for efficient fine-tuning of large language models.

Low-code AI Application Development Platform

lightllm

3.8k

LazyAGI/LazyLLM

LazyLLM simplifies the creation and iterative optimization of multi-agent large language model (LLM) applications with a low-code approach.

multi-agent llm low-code

ai development llm ops evaluation

AI Development Platform

Ollama

4.8k

Kiln-AI/Kiln

A free, all-in-one platform for building, evaluating, and optimizing AI systems, offering tools for RAG, agents, fine-tuning, and synthetic data generation.

Educational Course / MLOps Tutorial

4.3k

decodingai-magazine/llm-twin-course

A free, hands-on course to build a production-ready LLM & RAG system, including a personalized AI replica, applying LLMOps best practices.

llm rag llmops

diffusion models fine-tuning machine learning

AI/ML Training Platform

DeepSpeed

2.8k

bghira/SimpleTuner

A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.

Diffusion Model Training Suite

diffusion models ai training fine-tuning

3.0k

Nerogar/OneTrainer

A comprehensive, one-stop solution for training various diffusion models with advanced features and a user-friendly interface.

LLM Dataset Generation and Evaluation Platform

14.1k

Easy Dataset is a powerful application for creating high-quality datasets for LLM fine-tuning, RAG, and model evaluation, featuring intelligent document processing and a comprehensive evaluation system.

llm fine-tuning rag

AI Development Platform

Docker

4.9k

h2oai/h2o-llmstudio

A no-code GUI and framework for easily fine-tuning state-of-the-art large language models (LLMs).

llm fine-tuning no-code

AI Fine-tuning Tool

multimodal-ai fine-tuning vlm

2.7k

roboflow/maestro

A streamlined tool to accelerate the fine-tuning of popular multimodal models like Florence-2, PaliGemma 2, and Qwen2.5-VL.

diffusion models text-to-image fine-tuning

AI/ML Model Fine-tuning Tool

conda

2.0k

adobe-research/custom-diffusion

Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.

text-to-sql llm fine-tuning

LLM Fine-tuning Hub

2.0k

eosphoros-ai/DB-GPT-Hub

A specialized hub providing models, datasets, and fine-tuning techniques to enhance Large Language Models' performance in Text-to-SQL, Text-to-NLU, and Text-to-GQL tasks.

Large Language Model (LLM) Training Framework

6.6k

yangjianxin1/Firefly

Firefly is an open-source, all-in-one tool designed for efficient pre-training, instruction fine-tuning, and DPO of a wide range of mainstream large language models, optimized for resource-constrained environments.

llm fine-tuning qlora

huggingface transformers nlp

Educational Resource / Course Code Repository

pytorch

3.9k

zyds/transformers-code

A comprehensive code repository accompanying a hands-on course for mastering Huggingface Transformers, covering fundamental concepts to advanced fine-tuning and deployment techniques.

LLM Fine-tuning Framework

3.7k

hiyouga/ChatGLM-Efficient-Tuning

An efficient framework for fine-tuning ChatGLM-6B and ChatGLM2-6B models using PEFT methods, including LoRA, QLoRA, and RLHF, with a Web UI.

llm fine-tuning peft

Educational Tutorial

linux

30.2k

datawhalechina/self-llm

A comprehensive Linux-based guide for beginners to quickly fine-tune and deploy open-source LLMs and MLLMs, tailored for Chinese learners.

llm mllm fine-tuning

Large Language Model (LLM) Project

18.9k

ymcui/Chinese-LLaMA-Alpaca

An open-source project providing Chinese LLaMA and instruction-tuned Alpaca large language models, optimized for Chinese NLP and local deployment on CPU/GPU.

llm chinese nlp llama

Machine Learning Library

13.5k

microsoft/LoRA

A Python library implementing LoRA (Low-Rank Adaptation) to efficiently fine-tune large language models by significantly reducing trainable parameters and storage requirements.

lora llm fine-tuning

llm fine-tuning long-context

AI Research Project

huggingface

2.7k

JIA-Lab-research/LongLoRA

LongLoRA is an efficient fine-tuning method and associated models/datasets designed to extend the context window of Large Language Models (LLMs) for processing longer inputs.

lora diffusion models fine-tuning

AI/ML Fine-tuning Tool

diffusers

7.5k

cloneofsimo/lora

Enables rapid and efficient fine-tuning of diffusion models, particularly Stable Diffusion, using Low-rank Adaptation (LoRA) to generate high-quality, custom images with significantly smaller model sizes.

Large Language Model (LLM)

ChatGPT API Commercial LLM APIs

2.5k

wenge-research/YAYI

YaYi is an open-source Chinese Large Language Model, built on LLaMA 2 & BLOOM, designed for secure, reliable, and domain-specific applications through extensive instruction tuning.

llm chinese-nlp llama2

Replaces:

llm alignment fine-tuning rlhf

LLM Alignment Toolkit

DeepSpeed

5.6k

huggingface/alignment-handbook

Provides robust training recipes and scripts to align large language models with human and AI preferences, enhancing helpfulness and safety.

AI/ML Training Framework

multi-modal llm alignment

4.7k

PKU-Alignment/align-anything

A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.

chinese-llm llama-2 alpaca-2

Large Language Model (LLM) Development Kit

huggingface-transformers

7.2k

ymcui/Chinese-LLaMA-Alpaca-2

An open-source project providing Chinese LLaMA-2 and Alpaca-2 large language models with expanded Chinese vocabulary, enhanced capabilities, and support for ultra-long contexts up to 64K.

AI Model Fine-tuning Tool

dreambooth stable-diffusion fine-tuning

7.7k

XavierXiao/Dreambooth-Stable-Diffusion

This project implements Google's Dreambooth technique on Stable Diffusion, enabling users to fine-tune a text-to-image model with a few custom examples for personalized image generation.

AI Model Security Tool

llm fingerprinting ai-security

3.5k

sentient-agi/OML-1.0-Fingerprinting

A framework for embedding secret cryptographic fingerprints into Large Language Models (LLMs) via fine-tuning to verify ownership and prevent unauthorized use.

AI/NLP Model Framework

3.1k

dbiir/UER-py

An open-source PyTorch-based framework for NLP pre-training and fine-tuning, offering modularity, reproducibility, and a comprehensive model zoo for various downstream tasks.

nlp pre-training pytorch

large vision-language model reinforcement learning fine-tuning

AI Model Framework

6.0k

om-ai-lab/VLM-R1

VLM-R1 is a stable and generalizable R1-style Large Vision-Language Model that leverages reinforcement learning to significantly improve visual understanding tasks.

AI Model Fine-tuning Library

stable-diffusion lora fine-tuning

2.5k

KohakuBlueleaf/LyCORIS

LyCORIS is a library implementing various parameter-efficient fine-tuning (PEFT) algorithms for Stable Diffusion, extending beyond conventional LoRA methods to enhance model adaptation.