• NielsRogge's avatar
    Add MarkupLM (#19198) · f3d2f7a6
    NielsRogge authored
    
    
    * First draft
    
    * Make basic test work
    
    * Fix most tokenizer tests
    
    * More improvements
    
    * Make more tests pass
    
    * Fix more tests
    
    * Fix some code quality
    
    * Improve truncation
    
    * Implement feature extractor
    
    * Improve feature extractor and add tests
    
    * Improve feature extractor tests
    
    * Fix pair_input test partly
    
    * Add fast tokenizer
    
    * Improve implementation
    
    * Fix rebase
    
    * Fix rebase
    
    * Fix most of the tokenizer tests.
    
    * propose solution for fast
    
    * add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer
    
    * add: modify markuplmconverter
    
    * add: some modify on converter and tokenizerfast
    
    * Fix style, copies
    
    * Make fixup
    
    * Update tokenization_markuplm.py
    
    * Update test_tokenization_markuplm.py
    
    * Update markuplm related
    
    * Improve processor, add integration test
    
    * Add processor test file
    
    * Improve processor
    
    * Improve processor tests
    
    * Fix more processor tests
    
    * Fix processor tests
    
    * Update docstrings
    
    * Add Copied from statements
    
    * Add more Copied from statements
    
    * Add code examples
    
    * Improve code examples
    
    * Add model to doc tests
    
    * Adding dependency check
    
    * Add dummy file
    
    * Add requires_backends
    
    * Add model to toctree
    
    * Fix more things, disable dependency check for now
    
    * Apply more suggestions
    
    * Add soft dependency
    
    * Add annotators to tests
    
    * Fix style
    
    * Remove from_slow=True
    
    * Remove print statements
    
    * Add sanity check
    
    * Fix processor test
    
    * Fix processor tests, add more docs
    
    * Add doc tests for mdx file
    
    * Add more tips
    
    * Apply suggestions
    Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
    Co-authored-by: default avatarlockon-n <45759388+lockon-n@users.noreply.github.com>
    Co-authored-by: default avatarSaulLu <lucilesaul.com@gmail.com>
    Co-authored-by: default avatarlockon-n <dd098309@126.com>
    f3d2f7a6
test_feature_extraction_markuplm.py 3.56 KB