Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
TS-MODELS-OPT
training
Autonomous-Driving-models
Commits
5ed76316
Commit
5ed76316
authored
Apr 08, 2026
by
雍大凯
Browse files
models add
parent
b2379236
Changes
290
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
957 additions
and
0 deletions
+957
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_kto.yaml
...vl/llama-factory/examples/train_lora/llama3_lora_kto.yaml
+44
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_ppo.yaml
...vl/llama-factory/examples/train_lora/llama3_lora_ppo.yaml
+43
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_pretrain.yaml
...ama-factory/examples/train_lora/llama3_lora_pretrain.yaml
+45
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_reward.yaml
...llama-factory/examples/train_lora/llama3_lora_reward.yaml
+46
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft.sh
...5-vl/llama-factory/examples/train_lora/llama3_lora_sft.sh
+36
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft.yaml
...vl/llama-factory/examples/train_lora/llama3_lora_sft.yaml
+46
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft_ds3.yaml
...lama-factory/examples/train_lora/llama3_lora_sft_ds3.yaml
+47
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft_ray.yaml
...lama-factory/examples/train_lora/llama3_lora_sft_ray.yaml
+61
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_preprocess.yaml
.../llama-factory/examples/train_lora/llama3_preprocess.yaml
+23
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama4_lora_sft_ds3.yaml
...lama-factory/examples/train_lora/llama4_lora_sft_ds3.yaml
+49
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/qwen2_5vl_lora_dpo.yaml
...llama-factory/examples/train_lora/qwen2_5vl_lora_dpo.yaml
+49
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/qwen2_5vl_lora_sft.yaml
...llama-factory/examples/train_lora/qwen2_5vl_lora_sft.yaml
+47
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_aqlm.yaml
...ma-factory/examples/train_qlora/llama3_lora_sft_aqlm.yaml
+44
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_awq.yaml
...ama-factory/examples/train_qlora/llama3_lora_sft_awq.yaml
+44
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_bnb_npu.yaml
...factory/examples/train_qlora/llama3_lora_sft_bnb_npu.yaml
+47
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_gptq.yaml
...ma-factory/examples/train_qlora/llama3_lora_sft_gptq.yaml
+44
-0
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_otfq.yaml
...ma-factory/examples/train_qlora/llama3_lora_sft_otfq.yaml
+46
-0
docker-hub/qwen2.5-vl/llama-factory/pyproject.toml
docker-hub/qwen2.5-vl/llama-factory/pyproject.toml
+95
-0
docker-hub/qwen2.5-vl/llama-factory/requirements.txt
docker-hub/qwen2.5-vl/llama-factory/requirements.txt
+36
-0
docker-hub/qwen2.5-vl/llama-factory/scripts/api_example/test_image.py
...wen2.5-vl/llama-factory/scripts/api_example/test_image.py
+65
-0
No files found.
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_kto.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
kto
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
pref_beta
:
0.1
### dataset
dataset
:
kto_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/kto
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
5.0e-6
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_ppo.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
reward_model
:
saves/llama3-8b/lora/reward
trust_remote_code
:
true
### method
stage
:
ppo
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/ppo
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-5
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### generate
max_new_tokens
:
512
top_k
:
0
top_p
:
0.9
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_pretrain.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
pt
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
c4_demo
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/pretrain
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: c4_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_reward.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
rm
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
dpo_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/reward
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: dpo_en_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft.sh
0 → 100644
View file @
5ed76316
#!/bin/bash
set
-x
MODEL_PATH
=
meta-llama/Meta-Llama-3-8B-Instruct
llamafactory-cli train
\
--model_name_or_path
${
MODEL_PATH
}
\
--trust_remote_code
\
--stage
sft
\
--do_train
\
--finetuning_type
lora
\
--lora_rank
8
\
--lora_target
all
\
--dataset
identity,alpaca_en_demo
\
--template
llama3
\
--cutoff_len
2048
\
--max_samples
1000
\
--overwrite_cache
\
--preprocessing_num_workers
16
\
--dataloader_num_workers
4
\
--output_dir
saves/llama3-8b/lora/sft
\
--logging_steps
10
\
--save_steps
500
\
--plot_loss
\
--overwrite_output_dir
\
--save_only_model
false
\
--report_to
none
\
--per_device_train_batch_size
1
\
--gradient_accumulation_steps
8
\
--learning_rate
1e-4
\
--num_train_epochs
3.0
\
--lr_scheduler_type
cosine
\
--warmup_ratio
0.1
\
--bf16
\
--ddp_timeout
180000000
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: alpaca_en_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft_ds3.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
deepspeed
:
examples/deepspeed/ds_z3_config.json
# choices: [ds_z0_config.json, ds_z2_config.json, ds_z3_config.json]
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
2
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: alpaca_en_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_lora_sft_ray.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
# or use local absolute path
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
dataset_dir
:
REMOTE:llamafactory/demo_data
# or use local absolute path
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
tmp_dir
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### ray
ray_run_name
:
llama3_8b_sft_lora
ray_storage_path
:
./saves
ray_num_workers
:
4
# Number of GPUs to use.
placement_strategy
:
PACK
resources_per_worker
:
GPU
:
1
# ray_init_kwargs:
# runtime_env:
# env_vars:
# <YOUR-ENV-VAR-HERE>: "<YOUR-ENV-VAR-HERE>"
# pip:
# - emoji
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: alpaca_en_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama3_preprocess.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
tokenized_path
:
saves/llama3-8b/dataset/sft
### output
output_dir
:
saves/llama3-8b/lora/sft
overwrite_output_dir
:
true
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/llama4_lora_sft_ds3.yaml
0 → 100644
View file @
5ed76316
# pip install git+https://github.com/hiyouga/transformers.git@llama4_train
### model
model_name_or_path
:
meta-llama/Llama-4-Scout-17B-16E-Instruct
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
deepspeed
:
examples/deepspeed/ds_z3_config.json
# choices: [ds_z0_config.json, ds_z2_config.json, ds_z3_config.json]
### dataset
dataset
:
mllm_demo,identity,alpaca_en_demo
template
:
llama4
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama4-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
2
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# eval_dataset: alpaca_en_demo
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/qwen2_5vl_lora_dpo.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
Qwen/Qwen2.5-VL-7B-Instruct
image_max_pixels
:
262144
video_max_pixels
:
16384
trust_remote_code
:
true
### method
stage
:
dpo
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
pref_beta
:
0.1
pref_loss
:
sigmoid
# choices: [sigmoid (dpo), orpo, simpo]
### dataset
dataset
:
rlhf_v
template
:
qwen2_vl
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/qwen2_5vl-7b/lora/dpo
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
5.0e-6
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_lora/qwen2_5vl_lora_sft.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
Qwen/Qwen2.5-VL-7B-Instruct
image_max_pixels
:
262144
video_max_pixels
:
16384
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
mllm_demo,identity,alpaca_en_demo
# video: mllm_video_demo
template
:
qwen2_vl
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/qwen2_5vl-7b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
resume_from_checkpoint
:
null
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_aqlm.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
ISTA-DASLab/Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x16
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_awq.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
TechxGenus/Meta-Llama-3-8B-Instruct-AWQ
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_bnb_npu.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
quantization_bit
:
4
quantization_method
:
bnb
double_quantization
:
false
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_gptq.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
TechxGenus/Meta-Llama-3-8B-Instruct-GPTQ
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/examples/train_qlora/llama3_lora_sft_otfq.yaml
0 → 100644
View file @
5ed76316
### model
model_name_or_path
:
meta-llama/Meta-Llama-3-8B-Instruct
quantization_bit
:
4
# choices: [8 (bnb/hqq/eetq), 4 (bnb/hqq), 3 (hqq), 2 (hqq)]
quantization_method
:
bnb
# choices: [bnb, hqq, eetq]
trust_remote_code
:
true
### method
stage
:
sft
do_train
:
true
finetuning_type
:
lora
lora_rank
:
8
lora_target
:
all
### dataset
dataset
:
identity,alpaca_en_demo
template
:
llama3
cutoff_len
:
2048
max_samples
:
1000
overwrite_cache
:
true
preprocessing_num_workers
:
16
dataloader_num_workers
:
4
### output
output_dir
:
saves/llama3-8b/lora/sft
logging_steps
:
10
save_steps
:
500
plot_loss
:
true
overwrite_output_dir
:
true
save_only_model
:
false
report_to
:
none
# choices: [none, wandb, tensorboard, swanlab, mlflow]
### train
per_device_train_batch_size
:
1
gradient_accumulation_steps
:
8
learning_rate
:
1.0e-4
num_train_epochs
:
3.0
lr_scheduler_type
:
cosine
warmup_ratio
:
0.1
bf16
:
true
ddp_timeout
:
180000000
### eval
# val_size: 0.1
# per_device_eval_batch_size: 1
# eval_strategy: steps
# eval_steps: 500
docker-hub/qwen2.5-vl/llama-factory/pyproject.toml
0 → 100644
View file @
5ed76316
[build-system]
requires
=
["setuptools>=61.0"]
build-backend
=
"setuptools.build_meta"
[project]
name
=
"llamafactory"
dynamic
=
[
"version"
,
"dependencies"
,
"optional-dependencies"
,
"requires-python"
,
"scripts"
,
"authors"
,
"description"
,
"readme"
,
"license"
,
"keywords"
,
"classifiers"
]
[tool.ruff]
target-version
=
"py39"
line-length
=
119
indent-width
=
4
[tool.ruff.lint]
ignore
=
[
"C408"
,
# collection
"C901"
,
# complex
"E501"
,
# line too long
"E731"
,
# lambda function
"E741"
,
# ambiguous var name
"D100"
,
# no doc public module
"D101"
,
# no doc public class
"D102"
,
# no doc public method
"D103"
,
# no doc public function
"D104"
,
# no doc public package
"D105"
,
# no doc magic method
"D107"
,
# no doc __init__
]
extend-select
=
[
"C"
,
# complexity
"E"
,
# error
"F"
,
# pyflakes
"I"
,
# isort
"W"
,
# warning
"UP"
,
# pyupgrade
"D"
,
# pydocstyle
"PT009"
,
# pytest assert
"RUF022"
,
# sort __all__
]
[tool.ruff.lint.isort]
lines-after-imports
=
2
known-first-party
=
["llamafactory"]
known-third-party
=
[
"accelerate"
,
"datasets"
,
"gradio"
,
"numpy"
,
"peft"
,
"torch"
,
"transformers"
,
"trl"
,
]
[tool.ruff.lint.pydocstyle]
convention
=
"google"
[tool.ruff.format]
quote-style
=
"double"
indent-style
=
"space"
docstring-code-format
=
true
skip-magic-trailing-comma
=
false
line-ending
=
"auto"
[tool.uv]
conflicts
=
[
[
{
extra
=
"torch-npu"
}
,
{
extra
=
"aqlm"
}
,
],
[
{
extra
=
"torch-npu"
}
,
{
extra
=
"liger-kernel"
}
,
],
[
{
extra
=
"torch-npu"
}
,
{
extra
=
"vllm"
}
,
],
[
{
extra
=
"sglang"
}
,
{
extra
=
"minicpm_v"
}
,
],
]
docker-hub/qwen2.5-vl/llama-factory/requirements.txt
0 → 100644
View file @
5ed76316
# core deps
transformers>=4.49.0,<=4.52.4,!=4.52.0; sys_platform != 'darwin'
transformers>=4.49.0,<=4.51.3,!=4.52.0; sys_platform == 'darwin'
datasets>=2.16.0,<=3.6.0
accelerate>=1.3.0,<=1.7.0
peft>=0.14.0,<=0.15.2
trl>=0.8.6,<=0.9.6
tokenizers>=0.19.0,<=0.21.1
# gui
gradio>=4.38.0,<=5.31.0
matplotlib>=3.7.0
tyro<0.9.0
# ops
einops
numpy<2.0.0
pandas>=2.0.0
scipy
# model and tokenizer
sentencepiece
tiktoken
modelscope>=1.14.0
hf-transfer
# python
fire
omegaconf
packaging
protobuf
pyyaml
pydantic<=2.10.6
# api
uvicorn
fastapi
sse-starlette
# media
av
librosa
docker-hub/qwen2.5-vl/llama-factory/scripts/api_example/test_image.py
0 → 100644
View file @
5ed76316
# Copyright 2025 the LlamaFactory team.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
import
os
from
openai
import
OpenAI
from
transformers.utils.versions
import
require_version
require_version
(
"openai>=1.5.0"
,
"To fix: pip install openai>=1.5.0"
)
def
main
():
client
=
OpenAI
(
api_key
=
"{}"
.
format
(
os
.
getenv
(
"API_KEY"
,
"0"
)),
base_url
=
"http://localhost:{}/v1"
.
format
(
os
.
getenv
(
"API_PORT"
,
8000
)),
)
messages
=
[]
messages
.
append
(
{
"role"
:
"user"
,
"content"
:
[
{
"type"
:
"text"
,
"text"
:
"Output the color and number of each box."
},
{
"type"
:
"image_url"
,
"image_url"
:
{
"url"
:
"https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen2-VL/boxes.png"
},
},
],
}
)
result
=
client
.
chat
.
completions
.
create
(
messages
=
messages
,
model
=
"test"
)
messages
.
append
(
result
.
choices
[
0
].
message
)
print
(
"Round 1:"
,
result
.
choices
[
0
].
message
.
content
)
# The image shows a pyramid of colored blocks with numbers on them. Here are the colors and numbers of ...
messages
.
append
(
{
"role"
:
"user"
,
"content"
:
[
{
"type"
:
"text"
,
"text"
:
"What kind of flower is this?"
},
{
"type"
:
"image_url"
,
"image_url"
:
{
"url"
:
"https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen2-VL/flowers.jpg"
},
},
],
}
)
result
=
client
.
chat
.
completions
.
create
(
messages
=
messages
,
model
=
"test"
)
messages
.
append
(
result
.
choices
[
0
].
message
)
print
(
"Round 2:"
,
result
.
choices
[
0
].
message
.
content
)
# The image shows a cluster of forget-me-not flowers. Forget-me-nots are small ...
if
__name__
==
"__main__"
:
main
()
Prev
1
2
3
4
5
6
7
8
9
10
…
15
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment