Fine-tuning Qwen2.5-VL-3B

News/Updates

2025/08/13
- Added LoRA (Low-Rank Adaptation) fine-tuning support for more efficient training.
- Added comprehensive comparison between full fine-tuning and LoRA approaches.
- Updated all dependencies to the latest versions, new version can be found in the requirements.
2025/02/08
- First version of the fine-tuning code is released.

Introduction

In the past five months since Qwen2-VL’s release, numerous developers have built new models on the Qwen2-VL vision-language models, providing us with valuable feedback. During this period, qwen team focused on building more useful vision-language models. Today, qwen team are excited to introduce the latest addition to the Qwen family: Qwen2.5-VL.

I personally prefer simple and transparent code, so I wrote a fine-tuning code script for Qwen2.5-VL, hoping to help anyone who likes to write their own training loops.

I have a WeChat subscription account "Backpropagation", where I occasionally write some technical articles, including this one ( https://mp.weixin.qq.com/s/mN9Pxpd2Wciw1-IAoFc08A ), welcome to follow.

Quick Start for Fine-tuning or continue pre-train Qwen2.5-VL 2B Model

%git clone https://github.com/zhangfaen/finetune-Qwen2.5-VL
%cd finetune-Qwen2.5-VL
%conda create --name qwen-vl-2.5 python=3.136
%conda activate qwen-vl-2.5
%pip install -r requirements.txt

Note:

# When run "%pip install -r requirements.txt", it will install "deepspeed" package, which need nvcc tool. 
# Below is my environment configuration:
%export LD_LIBRARY_PATH=:/usr/local/cuda/lib64
%export CUDA_HOME=/usr/local/cuda
%export PATH=$PATH:/usr/local/cuda/bin

%which nvcc
/usr/local/cuda/bin/nvcc

%nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Thu_Mar_28_02:18:24_PDT_2024
Cuda compilation tools, release 12.4, V12.4.131
Build cuda_12.4.r12.4/compiler.34097967_0

You can run the following command to begin:

./finetune_distributed_without_LoRA.sh # Note that the CUDA_VISIBLE_DEVICES variable in this file should be set to the appropriate value

If you want to debug with LoRA, please refer to the following commands:

./finetune_distributed_with_LoRA.sh

Benchmark Comparison with LoRA

We provide comprehensive benchmarks comparing full fine-tuning and LoRA approaches to help you debug and compare the best method for your use case.

You can run both approaches and compare their results:

# Compare results using our benchmark script
./Benchmark_comparison_with_LoRA.sh

Test the Fine-tuned Model

%export CUDA_VISIBLE_DEVICES="4"
%python test_on_trained_model_by_us.py # Test our fine-tuned or retrained Qwen2.5-VL 3B model

Note: The test_on_trained_model_by_us.py file defines model_dir. If you have fine-tuned multiple models, you can modify this file to specify the path of your fine-tuned model.

The above test_on_trained_model_by_us.py describes both pictures under test_data/.

Overall, the fine-tuned model seems to have not been greatly affected in performance. The following picture is a log file during the fine-tuning process.

It can be seen that the training loss is decreasing, indicating that the model has converged during the training process.

LoRA Fine-tuning Results

The following image shows the loss history during LoRA fine-tuning, demonstrating training with reduced parameter updates:

LoRA fine-tuning provides a more memory-efficient alternative to full fine-tuning while maintaining good performance. The loss curve shows stable convergence during training.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
comparison_output/20250806180020		comparison_output/20250806180020
readme_imgs		readme_imgs
test_data		test_data
train_data		train_data
train_output		train_output
util		util
.gitignore		.gitignore
Benchmark_comparison_with_LoRA.py		Benchmark_comparison_with_LoRA.py
Benchmark_comparison_with_LoRA.sh		Benchmark_comparison_with_LoRA.sh
README.md		README.md
debug_load_model.py		debug_load_model.py
finetune_distributed.py		finetune_distributed.py
finetune_distributed_with_LoRA.sh		finetune_distributed_with_LoRA.sh
finetune_distributed_without_LoRA.sh		finetune_distributed_without_LoRA.sh
requirements.txt		requirements.txt
test_on_trained_model_by_us.py		test_on_trained_model_by_us.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fine-tuning Qwen2.5-VL-3B

News/Updates

Introduction

Quick Start for Fine-tuning or continue pre-train Qwen2.5-VL 2B Model

Benchmark Comparison with LoRA

Test the Fine-tuned Model

LoRA Fine-tuning Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Languages

zhangfaen/finetune-Qwen2.5-VL

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Qwen2.5-VL-3B

News/Updates

Introduction

Quick Start for Fine-tuning or continue pre-train Qwen2.5-VL 2B Model

Benchmark Comparison with LoRA

Test the Fine-tuned Model

LoRA Fine-tuning Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Languages

Packages