"test/git@developer.sourcefind.cn:change/sglang.git" did not exist on "70645f4d7d1447d2c5aa0667b88af92a20018b17"
Unverified Commit b5d888e2 authored by ShuoZhang2003's avatar ShuoZhang2003 Committed by GitHub
Browse files

Update README.md

parent 141de0ba
...@@ -143,9 +143,9 @@ We also offer Monkey's model definition and training code, which you can explore ...@@ -143,9 +143,9 @@ We also offer Monkey's model definition and training code, which you can explore
**ATTENTION:** Specify the path to your training data, which should be a json file consisting of a list of conversations. **ATTENTION:** Specify the path to your training data, which should be a json file consisting of a list of conversations.
Inspired by Qwen-VL, we freeze the Large Language Model (LLM) and introduce LoRA into four linear layers ```"c_attn", "attn.c_proj", "w1", "w2"``` for training. This step makes it possible to train Monkey using 8 NVIDIA 3090 GPUs. The specific implementation code is in ```model_qwen_new.py```. Inspired by Qwen-VL, we freeze the Large Language Model (LLM) and introduce LoRA into four linear layers ```"c_attn", "attn.c_proj", "w1", "w2"``` for training. This step makes it possible to train Monkey using 8 NVIDIA 3090 GPUs. The specific implementation code is in ```model_qwen_nvdia3090.py```.
- Add LoRA: You need to replace the contents of ```model_qwen.py``` with the contents of ```model_qwen_new.py```. - Add LoRA: You need to replace the contents of ```model_qwen.py``` with the contents of ```model_qwen_nvdia3090.py```.
- Freeze LLM: You need to freeze other modules except LoRA and Resampler modules in ```finetune_multitask.py```. - Freeze LLM: You need to freeze other modules except LoRA and Resampler modules in ```finetune_multitask.py```.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment