Update url.md

78f552f2 · chenzk · ffcd4891 · 78f552f2
Commit 78f552f2 authored Apr 17, 2025 by chenzk
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 3 deletions

README.md README.md +1 -3

No files found.
--- a/README.md
+++ b/README.md
@@ -85,7 +85,7 @@ chmod +x /usr/local/lib/python3.10/site-packages/gradio/frpc_linux_amd64_v0.3
 ```

 ## 数据集
-[ShareGPT4V](http://113.200.138.88:18080/aidatasets/project-dependency/lin-chen/ShareGPT4V.git)、[coco2017](http://113.200.138.88:18080/aidatasets/coco2017.git)、[LLaVA-Pretrain](http://113.200.138.88:18080/aidatasets/liuhaotian/llava-pretrain.git)、`sam`、`web-celebrity`等为需要的公共数据集，其中，后面一些数据集可向论文作者咨询下载源，`自建数据集custom`为用户在自己应用场景微调需要自己制作的数据集，`input_wavs`为custom需要的音频文件，`input_imgs`为custom需要的图像文件，它们用于prompt，以上数据集皆不影响推理。
+[ShareGPT4V](https://huggingface.co/datasets/Lin-Chen/ShareGPT4V)、[coco2017](https://cocodataset.org/#home)、[LLaVA-Pretrain](https://huggingface.co/datasets/liuhaotian/LLaVA-Pretrain)、`sam`、`web-celebrity`等为需要的公共数据集，其中，后面一些数据集可向论文作者咨询下载源，`自建数据集custom`为用户在自己应用场景微调需要自己制作的数据集，`input_wavs`为custom需要的音频文件，`input_imgs`为custom需要的图像文件，它们用于prompt，以上数据集皆不影响推理。

 1、用户在自己应用场景微调所需数据集按如下方式制作json文件`custom.json`，json中的数据为多模态配对数据，其中set：`sharegpt4`是提示加载图像或视频数据的关键字。
 ```
@@ -202,8 +202,6 @@ DCU与GPU精度一致，推理框架：pytorch。
    └── InternViT-300M-448px
 ```

-预训练权重快速下载中心：[SCNet AIModels](http://113.200.138.88:18080/aimodels) ，项目中的预训练权重可从快速下载通道下载：[VITA/VITA_ckpt](http://113.200.138.88:18080/aimodels/vita-mllm/VITA.git) 、[InternViT-300M-448px](http://113.200.138.88:18080/aimodels/opengvlab/InternViT-300M-448px.git)。
-
 Hugging Face下载地址为：[VITA/VITA_ckpt](https://huggingface.co/VITA-MLLM/VITA) 、[InternViT-300M-448px](https://huggingface.co/OpenGVLab/InternViT-300M-448px)。
 ## 源码仓库及问题反馈
 - http://developer.sourcefind.cn/codes/modelzoo/vita_pytorch.git