Commit 8bcc3dc7 authored by chenych's avatar chenych
Browse files

Merge branch 'master' into 'master'

验证了glm4 chatglm3-6b模型的kto方法

See merge request !3
parents b0501784 99259580
...@@ -18,6 +18,8 @@ LLaMA Factory是一个大语言模型训练和推理的框架,支持了魔搭 ...@@ -18,6 +18,8 @@ LLaMA Factory是一个大语言模型训练和推理的框架,支持了魔搭
| 模型名 | 模型大小 | Template | | 模型名 | 模型大小 | Template |
| ------------------------------------------------------------ | -------------------------------- | --------- | | ------------------------------------------------------------ | -------------------------------- | --------- |
| [Baichuan 2](https://huggingface.co/baichuan-inc) | 7B/13B | baichuan2 | | [Baichuan 2](https://huggingface.co/baichuan-inc) | 7B/13B | baichuan2 |
| [ChatGLM3](https://huggingface.co/THUDM) | 6B | chatglm3 |
| [GLM-4](https://huggingface.co/THUDM) | 9B | glm4 |
| [Gemma 2](https://huggingface.co/google) | 2B/9B | gemma | | [Gemma 2](https://huggingface.co/google) | 2B/9B | gemma |
| [Llama 2](https://huggingface.co/meta-llama) | 7B/13B/70B | llama2 | | [Llama 2](https://huggingface.co/meta-llama) | 7B/13B/70B | llama2 |
| [Llama 3/Llama 3.1](https://huggingface.co/meta-llama) | 8B/70B | llama3 | | [Llama 3/Llama 3.1](https://huggingface.co/meta-llama) | 8B/70B | llama3 |
...@@ -25,6 +27,7 @@ LLaMA Factory是一个大语言模型训练和推理的框架,支持了魔搭 ...@@ -25,6 +27,7 @@ LLaMA Factory是一个大语言模型训练和推理的框架,支持了魔搭
| [XVERSE](https://hf-mirror.com/xverse) | 7B/13B | xverse | | [XVERSE](https://hf-mirror.com/xverse) | 7B/13B | xverse |
| [OLMo](https://hf-mirror.com/allenai) | 1B/7B | olmo | | [OLMo](https://hf-mirror.com/allenai) | 1B/7B | olmo |
持续更新中... 持续更新中...
> 对于所有“基座”(Base)模型,`template` 参数可以是 `default`, `alpaca`, `vicuna` 等任意值。但“对话”(Instruct/Chat)模型请务必使用**对应的模板**。 > 对于所有“基座”(Base)模型,`template` 参数可以是 `default`, `alpaca`, `vicuna` 等任意值。但“对话”(Instruct/Chat)模型请务必使用**对应的模板**。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment