Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
vision_transformer_jax
Commits
d0748ce3
Commit
d0748ce3
authored
Oct 31, 2024
by
suily
Browse files
Update README.md
parent
eb099bf1
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
19 additions
and
7 deletions
+19
-7
README.md
README.md
+19
-7
No files found.
README.md
View file @
d0748ce3
...
@@ -29,7 +29,7 @@ ViT主要包括patch embeding、transformer encoder、MLP head三部分:以图
...
@@ -29,7 +29,7 @@ ViT主要包括patch embeding、transformer encoder、MLP head三部分:以图
### Docker(方法一)
### Docker(方法一)
```
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/jax:0.4.23-ubuntu20.04-dtk24.04.1-py3.10
docker pull image.sourcefind.cn:5000/dcu/admin/base/jax:0.4.23-ubuntu20.04-dtk24.04.1-py3.10
docker run -it --network=host --privileged=true --name=vit --device=/dev/kfd --device=/dev/dri --group-add video --shm-size=32G --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /path/your_code_data/:/path/your_code_data/ <imageID> /bin/bash # <imageID>为以上拉取的docker的镜像ID替换
docker run -it --network=host --privileged=true --name=vit --device=/dev/kfd --device=/dev/dri --group-add video --shm-size=32G --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /path/your_code_data/:/path/your_code_data/
-v /opt/hyhal/:/opt/hyhal/:ro
<imageID> /bin/bash # <imageID>为以上拉取的docker的镜像ID替换
cd /your_code_path/vision_transformer
cd /your_code_path/vision_transformer
pip install flax==0.6.9 # flax会强制安装某版本ai包
pip install flax==0.6.9 # flax会强制安装某版本ai包
...
@@ -46,7 +46,7 @@ pip install jaxlib-0.4.23+das1.1.git387bd43.abi1.dtk2404-cp310-cp310-manylinux_2
...
@@ -46,7 +46,7 @@ pip install jaxlib-0.4.23+das1.1.git387bd43.abi1.dtk2404-cp310-cp310-manylinux_2
### Dockerfile(方法二)
### Dockerfile(方法二)
```
```
docker build --no-cache -t vit:latest .
docker build --no-cache -t vit:latest .
docker run -it --network=host --privileged=true --name=vit --device=/dev/kfd --device=/dev/dri --group-add video --shm-size=32G --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /path/your_code_data/:/path/your_code_data/ vit /bin/bash
docker run -it --network=host --privileged=true --name=vit --device=/dev/kfd --device=/dev/dri --group-add video --shm-size=32G --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v /path/your_code_data/:/path/your_code_data/
-v /opt/hyhal/:/opt/hyhal/:ro
vit /bin/bash
cd /your_code_path/vision_transformer
cd /your_code_path/vision_transformer
pip install flax==0.6.9 # flax会强制安装某版本ai包
pip install flax==0.6.9 # flax会强制安装某版本ai包
...
@@ -105,8 +105,7 @@ vim /usr/local/lib/python3.10/site-packages/tensorflow_datasets/core/utils/gcs_u
...
@@ -105,8 +105,7 @@ vim /usr/local/lib/python3.10/site-packages/tensorflow_datasets/core/utils/gcs_u
│ └── label.labels.txt
│ └── label.labels.txt
```
```
### 推理数据集
### 推理数据集
推理所用图片和文件可根据scnet或以下代码进行下载:
推理所用图片和文件可根据
[
scnet
](
)或以下代码进行下载:
-
http://113.200.138.88:18080/aidatasets/project-dependency/vision_transformer_jax
```
```
# ./dataset是存储地址,可自订
# ./dataset是存储地址,可自订
wget https://storage.googleapis.com/bit_models/ilsvrc2012_wordnet_lemmas.txt -P ./dataset
wget https://storage.googleapis.com/bit_models/ilsvrc2012_wordnet_lemmas.txt -P ./dataset
...
@@ -119,7 +118,7 @@ wget https://picsum.photos/384 -O ./dataset/picsum.jpg # 将图片调整为384
...
@@ -119,7 +118,7 @@ wget https://picsum.photos/384 -O ./dataset/picsum.jpg # 将图片调整为384
│ └── picsum.jpg
│ └── picsum.jpg
```
```
## 训练
## 训练
检查点可通过scnet或以下方式进行下载:
检查点可通过
[
scnet
](
http://113.200.138.88:18080/aimodels/findsource-dependency/vision_transformer_jax/-/tree/master/imagenet21k?ref_type=heads
)
或以下方式进行下载:
```
```
cd /your_code_path/vision_transformer/test_result # test_result为检查点下载地址,可自订
cd /your_code_path/vision_transformer/test_result # test_result为检查点下载地址,可自订
wget https://storage.googleapis.com/vit_models/imagenet21k/ViT-B_16.npz
wget https://storage.googleapis.com/vit_models/imagenet21k/ViT-B_16.npz
...
@@ -141,7 +140,7 @@ sh test.sh
...
@@ -141,7 +140,7 @@ sh test.sh
# config.optim_dtype='bfloat16' # 精度
# config.optim_dtype='bfloat16' # 精度
```
```
## 推理
## 推理
检查点可通过scnet或以下方式进行下载:
检查点可通过
[
scnet
](
http://113.200.138.88:18080/aimodels/findsource-dependency/vision_transformer_jax/-/tree/master/imagenet21k+imagenet2012?ref_type=heads
)
或以下方式进行下载:
```
```
cd /your_code_path/vision_transformer/test_result # test_result为检查点下载地址,可自订
cd /your_code_path/vision_transformer/test_result # test_result为检查点下载地址,可自订
wget https://storage.googleapis.com/vit_models/imagenet21k+imagenet2012/ViT-B_16.npz -O ViT-B_16_imagenet2012.npz
wget https://storage.googleapis.com/vit_models/imagenet21k+imagenet2012/ViT-B_16.npz -O ViT-B_16_imagenet2012.npz
...
@@ -207,7 +206,20 @@ gpu推理结果:
...
@@ -207,7 +206,20 @@ gpu推理结果:
0.00004 : sandbar, sand_bar
0.00004 : sandbar, sand_bar
```
```
### 精度
### 精度
无
k800
*
1(1410Mhz,80G,cuda11.8):
| 参数 | acc | loss |
| -------------------------------- | ------- | -------- |
| model_datasets='b16,cifar10'
<br>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.98047 | 0.428023 |
| model_datasets='b16,cifar100'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.89206 | 1.25078 |
| model_datasets='l16,cifar10'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.98890 | 0.348941 |
| model_datasets='l16,cifar100'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.91375 | 1.05141 |
k100
*
1(1270Mhz,64G,dtk24.04.1):
| 参数 | acc | loss |
| ------------------------------------------------------------ | ------- | -------- |
| model_datasets='b16,cifar10'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.98037 | 0.43239 |
| model_datasets='b16,cifar100'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.89001 | 1.2273 |
| model_datasets='l16,cifar10'
<br/>
config.batch=512
<br/>
config.total_steps=500
<br/>
config.optim_dtype = 'bfloat16' | 0.98921 | 0.306221 |
## 应用场景
## 应用场景
### 算法类别
### 算法类别
`图像分类`
`图像分类`
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment