Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
textmonkey_pytorch
Commits
63f5186c
Unverified
Commit
63f5186c
authored
Nov 09, 2023
by
Melos
Committed by
GitHub
Nov 09, 2023
Browse files
Update README.md
parent
30f7a82d
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
5 deletions
+5
-5
README.md
README.md
+5
-5
No files found.
README.md
View file @
63f5186c
# Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
# Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models


<div
align=
"center"
>
<div
align=
"center"
>
Zhang Li
*, Biao Yang*
, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu†, Xiang Bai
Zhang Li
*, Biao Yang*
, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu†, Xiang Bai
...
@@ -22,7 +22,7 @@ Zhang Li*, Biao Yang*, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun,
...
@@ -22,7 +22,7 @@ Zhang Li*, Biao Yang*, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun,
## performance
## performance


## Demo
## Demo
...
@@ -30,13 +30,13 @@ We have a demo open for everyone to play.[Demo](https://74a00f7621c2ecf691.gradi
...
@@ -30,13 +30,13 @@ We have a demo open for everyone to play.[Demo](https://74a00f7621c2ecf691.gradi
## Cases
## Cases
Our model
is able to
accurately describe almost all the details in the image.
Our model
can
accurately describe almost all the details in the image.


Besides, our model has also demonstrated some capabilities in fine-grained question answering and even answering questions involving world knowledge.
Besides, our model has also demonstrated some capabilities in fine-grained question answering and even answering questions involving world knowledge.


With the power of large-scale architecture, we have also achieved impressive performance on document-based tasks.
With the power of large-scale architecture, we have also achieved impressive performance on document-based tasks.
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment