Update README.md

b5d888e2 · ShuoZhang2003 · GitHub · 141de0ba · b5d888e2
Unverified Commit b5d888e2 authored Dec 16, 2023 by ShuoZhang2003 Committed by GitHub Dec 16, 2023
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

README.md README.md +2 -2

No files found.
--- a/README.md
+++ b/README.md
@@ -143,9 +143,9 @@ We also offer Monkey's model definition and training code, which you can explore
 **ATTENTION:** Specify the path to your training data, which should be a json file consisting of a list of conversations.
-Inspired by Qwen-VL, we freeze the Large Language Model (LLM) and introduce LoRA into four linear layers ```"c_attn", "attn.c_proj", "w1", "w2"``` for training. This step makes it possible to train Monkey using 8 NVIDIA 3090 GPUs. The specific implementation code is in ```model_qwen_new.py```.
+Inspired by Qwen-VL, we freeze the Large Language Model (LLM) and introduce LoRA into four linear layers ```"c_attn", "attn.c_proj", "w1", "w2"``` for training. This step makes it possible to train Monkey using 8 NVIDIA 3090 GPUs. The specific implementation code is in ```model_qwen_nvdia3090.py```.
- - Add LoRA: You need to replace the contents of ```model_qwen.py``` with the contents of ```model_qwen_new.py```.
+ - Add LoRA: You need to replace the contents of ```model_qwen.py``` with the contents of ```model_qwen_nvdia3090.py```.
 - Freeze LLM: You need to freeze other modules except LoRA and Resampler modules in ```finetune_multitask.py```.