1. 15 Aug, 2023 21 commits
    • Bin Jia's avatar
      [test] Hotfix/fix some model test and refactor check util api (#4369) · 5c6f1831
      Bin Jia authored
      * fix llama test
      
      * fix test bug of bert, blip2, bloom, gpt2
      
      * fix llama test
      
      * fix opt test
      
      * fix sam test
      
      * fix sam test
      
      * fix t5 test
      
      * fix vit test
      
      * fix whisper test
      
      * fix whisper test
      
      * polish code
      
      * adjust allclose parameter
      
      * Add mistakenly deleted code
      
      * addjust allclose
      
      * change loss function for some base model
      5c6f1831
    • FoolPlayer's avatar
      [shardformer] support Blip2 (#4243) · 879301d0
      FoolPlayer authored
      * support base blip2
      
      * add support for downstream blip2 model
      
      * update readme
      
      * add forward injection
      
      * skip not compatible models test
      
      * fix test for gemini and low_level_zero_pugin
      879301d0
    • klhhhhh's avatar
    • klhhhhh's avatar
      [shardformer] pre-commit check files · 4da05052
      klhhhhh authored
      4da05052
    • klhhhhh's avatar
      [shardformer] ChatGLM support layernorm sharding · f155ae89
      klhhhhh authored
      f155ae89
    • klhhhhh's avatar
      [shardformer] delete some file · 00f6ef15
      klhhhhh authored
      00f6ef15
    • klhhhhh's avatar
      [shardformer] support chatglm without layernorm · dad00c42
      klhhhhh authored
      dad00c42
    • klhhhhh's avatar
      [shardformer] polish code · cbb54d32
      klhhhhh authored
      cbb54d32
    • klhhhhh's avatar
      6ee4c9ee
    • klhhhhh's avatar
      import chatglm · 7377be7a
      klhhhhh authored
      7377be7a
    • Kun Lin's avatar
      Feature/chatglm (#4240) · ed34bb13
      Kun Lin authored
      * [shardformer] added tests
      
      * [shardformer] vit test finish and support
      
      * [shardformer] chatglm ready
      
      * import chatglm
      
      * [shardformer] add test kit in model zoo for chatglm
      
      * [sharformer] add first version of policy of chatglm
      
      * [shardformer] polish chatglm code
      
      * [shardformer] polish code
      
      * [shardformer] support chatglm without layernorm
      
      * [shardformer] chatglm shard without mlp sharding
      
      * [shardformer] delete some file
      
      * [shardformer] ChatGLM support layernorm sharding
      
      * [shardformer] register without auto policy
      
      * [shardformer] pre-commit check files
      
      * [shardformer] fix chatglm configuration with pre-commit
      ed34bb13
    • FoolPlayer's avatar
      [shardformer] support whisper (#4212) · 9ee4ebea
      FoolPlayer authored
      * support whisper
      
      * fix bug in vocabembedding
      
      * support downstream model of whisper
      
      * update readme
      9ee4ebea
    • FoolPlayer's avatar
      [shardformer] support SAM (#4231) · dd2bf026
      FoolPlayer authored
      * 1.support sam 2.add fused qkv for nn.Linear
      
      * update utils support set element in list
      
      * overtwrite SamVisionAttention foward to use DropoutForParallelInput
      
      * remove unused code
      dd2bf026
    • Baizhou Zhang's avatar
      [pipeline] support fp32 for HybridPlugin/merge shardformer test and pipeline... · 0ceec8f9
      Baizhou Zhang authored
      [pipeline] support fp32 for HybridPlugin/merge shardformer test and pipeline test into one file (#4354)
      
      * add naive optimizer for 3DPlugin/refactor gpt2 shardformer test
      
      * merge tests of PP/DP/TP combinations into one test file
      
      * fix bug when sync grad for dp in HybridPlugin
      
      * update supported precisions for 3DPlugin/fix bug when shifting tp_degree
      
      * improve the passing of lazy_init
      
      * modify lazy_init/use sync_shared_params
      0ceec8f9
    • FoolPlayer's avatar
      [shardformer] support pipeline base vit model (#4284) · b3f5d7a3
      FoolPlayer authored
      
      
      * Feature/vit support (#4182)
      
      * [shardformer] added tests
      
      * [shardformer] vit test finish and support
      
      * fix attention dropout
      
      * support base vit pipeline
      
      * support vit downstream model
      
      * fix vit shard test
      
      * modify hidden states return type
      
      ---------
      Co-authored-by: default avatarKun Lin <81014421+klhhhhh@users.noreply.github.com>
      b3f5d7a3
    • Baizhou Zhang's avatar
      [pipeline] add pipeline support for T5Stack/T5EncoderModel (#4300) · 36e546b2
      Baizhou Zhang authored
      * modify t5 policy & add test
      
      * pipeline stage distribution for t5
      
      * complete t5 base policy
      
      * t5 stack: halfway
      
      * modify gpt2 pipeline test
      
      * complete pipeline forward for T5Stack/T5EncoderModel
      
      * fix docstring
      
      * move t5 util tests to test_pipeline
      36e546b2
    • Baizhou Zhang's avatar
      [pipeline] support shardformer for GPT2ForQuestionAnswering & complete... · 2a2eacfa
      Baizhou Zhang authored
      [pipeline] support shardformer for GPT2ForQuestionAnswering & complete pipeline support for GPT2 (#4245)
      
      * change for transformers loggers
      
      * add forward for GPT2ForQuestionAnswering
      
      * fix assert
      
      * fix torchrec test
      2a2eacfa
    • Jianghai's avatar
      [pipeline] All bert models (#4233) · e7cc62d7
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      
      * finish llama
      
      * causal lm and sequence classification
      
      * revision
      
      * add pure pipeline test
      
      * finish some bert models
      
      * finish all bert models
      
      * finish bert tests
      
      * fix bugs
      
      * fix bugs
      
      * fix test pipeline
      
      * fix data gen for qa
      
      * update the set pipeline forward
      
      * shared params
      
      * fix bugs
      e7cc62d7
    • Jianghai's avatar
      [pipeline] add bloom model pipeline (#4210) · 37d22f68
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * finish bloom model
      
      * test shard gpt2
      
      * clear cache
      37d22f68
    • Jianghai's avatar
      [pipeline] Llama causal lm and llama for sequence classification pipeline (#4208) · 31bcf867
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      
      * finish llama
      
      * causal lm and sequence classification
      
      * revision
      31bcf867
    • Jianghai's avatar
      [pipeline] Llama pipeline (#4205) · 16220310
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      16220310
  2. 04 Jul, 2023 5 commits
  3. 09 Jun, 2023 1 commit
  4. 26 Apr, 2023 1 commit
    • Hongxin Liu's avatar
      [booster] add low level zero plugin (#3594) · 4b3240cb
      Hongxin Liu authored
      * [booster] add low level zero plugin
      
      * [booster] fix gemini plugin test
      
      * [booster] fix precision
      
      * [booster] add low level zero plugin test
      
      * [test] fix booster plugin test oom
      
      * [test] fix booster plugin test oom
      
      * [test] fix googlenet and inception output trans
      
      * [test] fix diffuser clip vision model
      
      * [test] fix torchaudio_wav2vec2_base
      
      * [test] fix low level zero plugin test
      4b3240cb
  5. 22 Mar, 2023 1 commit
    • YuliangLiu0306's avatar
      [FX] refactor experimental tracer and adapt it with hf models (#3157) · f57d3495
      YuliangLiu0306 authored
      * pass gpt trace and meta_prop
      
      * pass t5 trace and meta_prop
      
      * [FX] refactor experimental tracer and adapt it with hf models
      
      * pass all mainstream model zoo
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * fix CI
      
      * skip tests
      
      * fix CI
      
      * using packaging version
      
      * polish
      f57d3495
  6. 20 Mar, 2023 2 commits
  7. 17 Mar, 2023 2 commits
    • ver217's avatar
      [lazyinit] add correctness verification (#3147) · 6ae8ed04
      ver217 authored
      * [lazyinit] fix shared module
      
      * [tests] add lazy init test utils
      
      * [tests] add torchvision for lazy init
      
      * [lazyinit] fix pre op fn
      
      * [lazyinit] handle legacy constructor
      
      * [tests] refactor lazy init test models
      
      * [tests] refactor lazy init test utils
      
      * [lazyinit] fix ops don't support meta
      
      * [tests] lazy init test timm models
      
      * [lazyinit] fix set data
      
      * [lazyinit] handle apex layers
      
      * [tests] lazy init test transformers models
      
      * [tests] lazy init test torchaudio models
      
      * [lazyinit] fix import path
      
      * [tests] lazy init test torchrec models
      
      * [tests] update torch version in CI
      
      * [tests] revert torch version in CI
      
      * [tests] skip lazy init test
      6ae8ed04
    • Frank Lee's avatar
      [booster] implemented mixed precision class (#3151) · ed192905
      Frank Lee authored
      * [booster] implemented mixed precision class
      
      * polish code
      ed192905
  8. 15 Mar, 2023 4 commits
  9. 14 Mar, 2023 2 commits