LLaMA Factory: Efficient fine-tuning of more than a hundred open-source large models, easy model customization

Latest AI Resources9mos agorelease AI Sharing Circle

2.1K 00

General Introduction

LLaMA-Factory is a unified and efficient fine-tuning framework that supports flexible customization and efficient training of more than 100 large language models (LLMs). Through the built-in LLaMA Board web interface, users can perform model fine-tuning without writing code. The framework integrates a variety of advanced training methods and practical tips to significantly improve training speed and GPU memory utilization.

Function List

Multi-model support: Supports LLaMA, LLaVA, Mistral, Qwen, and many other language models.
Multiple training methods: Includes full volume trimming, freeze trimming, LoRA, QLoRA, and more.
efficient algorithm: Integration of GaLore, BAdam, Adam-mini, DoRA and other advanced algorithms.
practical skill: Support for FlashAttention-2, Unsloth, Liger Kernel and more.
Experimental monitoring: Provides monitoring tools such as LlamaBoard, TensorBoard, Wandb, MLflow, and more.
fast inference: Provides OpenAI-like APIs, Gradio UI, and CLI interfaces.
Dataset Support: Support for downloading pre-trained models and datasets from HuggingFace, ModelScope, and other platforms.

Using Help

Installation process

Clone the project code:

   git clone --depth 1 https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory

Install the dependencies:

   pip install -e ".[torch,metrics]"

Optional dependencies include: torch, torch-npu, metrics, deepspeed, liger-kernel, bitsandbytes, and more.

Data preparation

please refer to data/README.md Learn more about the dataset file format. You can use datasets on the HuggingFace / ModelScope / Modelers hub, or load datasets on local disk.

Quick Start

Use the following commands to run LoRA to fine-tune, reason about, and merge Llama3-8B-Instruct models:

llamafactory-cli train examples/train_lora/llama3_lora_sft.yaml
llamafactory-cli chat examples/inference/llama3_lora_sft.yaml
llamafactory-cli export examples/merge_lora/llama3_lora_sft.yaml

For more advanced usage see examples/README.mdThe

Using the LLaMA Board GUI

Fine-tuning is done through the LLaMA Board GUI provided by Gradio:

llamafactory-cli webui

Docker Deployment

For CUDA users:

cd docker/docker-cuda/
docker compose up -d
docker compose exec llamafactory bash

For Ascend NPU users:

cd docker/docker-npu/
docker compose up -d
docker compose exec llamafactory bash

For AMD ROCm users:

cd docker/docker-rocm/
docker compose up -d
docker compose exec llamafactory bash

API Deployment

Use OpenAI-style APIs and vLLM Reasoning:

API_PORT=8000 llamafactory-cli api examples/inference/llama3_vllm.yaml

Visit this page for API documentation.

Download models and datasets

If you have trouble downloading models and datasets from Hugging Face, you can use ModelScope:

export USE_MODELSCOPE_HUB=1

Train a model by specifying the ModelScope Hub's model ID, for example LLM-Research/Meta-Llama-3-8B-InstructThe

Recording Experimental Results with W&B

To use Weights & Biases records the results of the experiment with the following parameters in the yaml file:

wandb:
project: "your_project_name"
entity: "your_entity_name"

Latest AI Resources # Large model fine-tuning

文章版权归 AI Sharing Circle 所有，未经允许请勿转载。

Memo AI: Native Client for Video to Subtitle, Converting Multilingual Subtitles

Latest AI Resources # AI text-to-speech # AI Speech to Text # AI audio/video editor

8mos ago

02.7K

AudioGen-Omni - Multimodal Audio Generation Model from Racer

Latest AI Resources

5dys ago

01.1K

Summarizer Tool: Selecting different tones to quickly generate summaries of long texts

Latest AI Resources # AI Text and Audio/Video Summarization Tool

7mos ago

02K

DreamTalk: Generate expressive talking videos with a single avatar image!

Latest AI Resources # AI Java Open Source Projecct # AI Digital Man # Port Synchronization

8mos ago

02.5K

No comments

You must be logged in to leave a comment!

No comments...

LLaMA Factory: Efficient fine-tuning of more than a hundred open-source large models, easy model customization

General Introduction

Function List

Using Help

Installation process

Data preparation

Quick Start

Using the LLaMA Board GUI

Docker Deployment

API Deployment

Download models and datasets

Recording Experimental Results with W&B

Hoarder: open source AI bookmark management tools, support for multiple formats, intelligent labeling, full-text search

ModelScope Swift: a lightweight infrastructure for efficiently fine-tuning and deploying large models.

Related posts

Memo AI: Native Client for Video to Subtitle, Converting Multilingual Subtitles

AudioGen-Omni - Multimodal Audio Generation Model from Racer

Summarizer Tool: Selecting different tones to quickly generate summaries of long texts

DreamTalk: Generate expressive talking videos with a single avatar image!

No comments

Latest Collections

Latest Articles

LLaMA Factory: Efficient fine-tuning of more than a hundred open-source large models, easy model customization

General Introduction

Function List

Using Help

Installation process

Data preparation

Quick Start

Using the LLaMA Board GUI

Docker Deployment

API Deployment

Download models and datasets

Recording Experimental Results with W&B

Hoarder: open source AI bookmark management tools, support for multiple formats, intelligent labeling, full-text search

ModelScope Swift: a lightweight infrastructure for efficiently fine-tuning and deploying large models.

Related posts

Memo AI: Native Client for Video to Subtitle, Converting Multilingual Subtitles

AudioGen-Omni - Multimodal Audio Generation Model from Racer

Summarizer Tool: Selecting different tones to quickly generate summaries of long texts

DreamTalk: Generate expressive talking videos with a single avatar image!

No comments

Selected AI Tools

Latest Collections

Latest Articles