Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
pariskang
CMLM-ZhongJing
Commits
acd549d4
Unverified
Commit
acd549d4
authored
Feb 21, 2024
by
pariskang
💬
Committed by
GitHub
Feb 21, 2024
Browse files
Update README.md
parent
9d6cf7a7
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
0 deletions
+1
-0
README.md
README.md
+1
-0
No files found.
README.md
View file @
acd549d4
...
...
@@ -10,6 +10,7 @@
# 训练及推理声明
我们开源了针对Qwen1.5-1.8B-Chat模型的微调权重,在一张Tesla T4显卡即可实现高速推理。通过在我们专有医疗数据集上进行多次迭代训练确保模型在中医药领域具备较强理解和生成能力。模型权重可在
[
https://huggingface.co/CMLL/ZhongJing-2-1_8b
](
https://huggingface.co/CMLL/ZhongJing-2-1_8b
)
下载。
推荐使用
[
colab
](
https://colab.research.google.com/drive/1DCPomUsfTxqkqxKpK-AIGvBSPbkOm7R3#scrollTo=jsn4szdjdtmF
)
免费GPU推理。
## 1.指令数据构建:
目前大多如Alpaca、Belle等工作基于self-instruct思路。self-instruct思路可以很好的调用大语言模型的知识,生成多样和具有创造性的指令,在常规问答场景可以快速构造海量指令实现指令调优。但在一些专业知识容错率较低的领域,比如医疗和法律场景,幻觉输出会导致噪声指令数据从而影响模型的准确性。典型的情况是比如不当的诊断及处方建议甚至影响患者生命,事实性错误的法律条文和法理的引用会造成权益人的败诉。因此,如何快速调用OpenAI API且不牺牲指令数据的专业性成为指令数据构造及标注等场景的重要研究方向。以下将简述我们的初步实验探索。
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment