Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
donut_pytorch
Commits
3c6d818d
Commit
3c6d818d
authored
Aug 10, 2022
by
Geewook Kim
Browse files
chore: add yaml files, related to #7
parent
787d918b
Changes
3
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
66 additions
and
0 deletions
+66
-0
config/train_docvqa.yaml
config/train_docvqa.yaml
+22
-0
config/train_rvlcdip.yaml
config/train_rvlcdip.yaml
+22
-0
config/train_zhtrainticket.yaml
config/train_zhtrainticket.yaml
+22
-0
No files found.
config/train_docvqa.yaml
0 → 100644
View file @
3c6d818d
resume_from_checkpoint_path
:
null
result_path
:
"
./result"
pretrained_model_name_or_path
:
"
naver-clova-ix/donut-base"
dataset_name_or_paths
:
[
"
./dataset/docvqa"
]
# should be prepared from https://rrc.cvc.uab.es/?ch=17
sort_json_key
:
True
train_batch_sizes
:
[
2
]
val_batch_sizes
:
[
4
]
input_size
:
[
2560
,
1920
]
max_length
:
128
align_long_axis
:
False
num_nodes
:
8
seed
:
2022
lr
:
3e-5
warmup_steps
:
10000
num_training_samples_per_epoch
:
39463
max_epochs
:
300
max_steps
:
null
num_workers
:
8
val_check_interval
:
1.0
check_val_every_n_epoch
:
1
gradient_clip_val
:
0.25
verbose
:
True
config/train_rvlcdip.yaml
0 → 100644
View file @
3c6d818d
resume_from_checkpoint_path
:
null
result_path
:
"
./result"
pretrained_model_name_or_path
:
"
naver-clova-ix/donut-base"
dataset_name_or_paths
:
[
"
./dataset/rvlcdip"
]
# should be prepared from https://www.cs.cmu.edu/~aharley/rvl-cdip/
sort_json_key
:
True
train_batch_sizes
:
[
2
]
val_batch_sizes
:
[
4
]
input_size
:
[
2560
,
1920
]
max_length
:
8
align_long_axis
:
False
num_nodes
:
8
seed
:
2022
lr
:
2e-5
warmup_steps
:
10000
num_training_samples_per_epoch
:
320000
max_epochs
:
100
max_steps
:
null
num_workers
:
8
val_check_interval
:
1.0
check_val_every_n_epoch
:
1
gradient_clip_val
:
1.0
verbose
:
True
config/train_zhtrainticket.yaml
0 → 100644
View file @
3c6d818d
resume_from_checkpoint_path
:
null
result_path
:
"
./result"
pretrained_model_name_or_path
:
"
naver-clova-ix/donut-base"
dataset_name_or_paths
:
[
"
./dataset/zhtrainticket"
]
# should be prepared from https://github.com/beacandler/EATEN
sort_json_key
:
True
train_batch_sizes
:
[
8
]
val_batch_sizes
:
[
1
]
input_size
:
[
960
,
1280
]
max_length
:
256
align_long_axis
:
False
num_nodes
:
1
seed
:
2022
lr
:
3e-5
warmup_steps
:
300
num_training_samples_per_epoch
:
1368
max_epochs
:
10
max_steps
:
null
num_workers
:
8
val_check_interval
:
1.0
check_val_every_n_epoch
:
1
gradient_clip_val
:
1.0
verbose
:
True
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment