• Suraj Patil's avatar
    VisionTextDualEncoder (#13511) · fc1d97f2
    Suraj Patil authored
    
    
    * init vision_text_dual_encoder
    
    * fix merge
    
    * remove extra heads
    
    * fix tests
    
    * remove VISION_TEXT_DUAL_ENCODER_PRETRAINED_CONFIG_ARCHIVE_MAP
    
    * remove archive map
    
    * fix imports
    
    * fix more imports
    
    * fix init
    
    * delete tokenizers
    
    * fix imports
    
    * clean
    
    * support clip's vision model
    
    * handle None config
    
    * begin tests
    
    * more test and few fixes
    
    * warn about newly init weights
    
    * more tests
    
    * add loss to model
    
    * remove extra classes from doc
    
    * add processor
    
    * doc and small fixes
    
    * add start docstr
    
    * update flax model
    
    * flax tests
    
    * more flax tests
    
    * doc
    
    * quality
    
    * doc and quality
    
    * fix doc
    
    * doc
    
    * remove comments
    
    * update warning
    
    * quality
    
    * fix docs
    
    * Apply suggestions from code review
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    
    * replace asserts, fix imports
    
    * update imports
    
    * fix import
    
    * address some review comments
    
    * fix check
    
    * reduce tolerance
    
    * fix test
    
    * add flax integration test
    
    * Apply suggestions from code review
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * address Sylvain's comments
    
    * fix style
    
    * add pt_flax_equivalence test in PT tests
    
    * add pt integration test
    
    * update test
    
    * use pre-trained checkpoint in examples
    Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    fc1d97f2
test_modeling_vision_text_dual_encoder.py 21 KB