1. 15 Aug, 2023 15 commits
    • Hongxin Liu's avatar
      [shardformer] support inplace sharding (#4251) · d921ce83
      Hongxin Liu authored
      * [shardformer] embedding support inplace sharding
      
      * [shardformer] linear support inplace sharding
      
      * [shardformer] layernorm support inplace sharding
      
      * [shardformer] qkv support inplace sharding
      
      * [test] update shardformer layer test
      
      * [shardformer] fix shared param sharding
      
      * [shardformer] fix bert policy
      
      * [shardformer] fix bloom policy
      
      * [shardformer] fix llama policy
      
      * [shardformer] fix opt policy
      
      * [shardformer] fix t5 policy
      
      * [shardformer] fix fused qkv linear
      
      * [shardformer] fix bugs
      
      * force sync
      
      * [test] fix bugs
      
      * [test] fix transformer version
      d921ce83
    • Baizhou Zhang's avatar
      [pipeline] support shardformer for GPT2ForQuestionAnswering & complete... · 2a2eacfa
      Baizhou Zhang authored
      [pipeline] support shardformer for GPT2ForQuestionAnswering & complete pipeline support for GPT2 (#4245)
      
      * change for transformers loggers
      
      * add forward for GPT2ForQuestionAnswering
      
      * fix assert
      
      * fix torchrec test
      2a2eacfa
    • Jianghai's avatar
      [bugs] hot fix some testing bugs for new models (#4268) · d9be0472
      Jianghai authored
      * hot fix
      
      * hot fx tracer
      d9be0472
    • Jianghai's avatar
      [pipeline] finish bloom models pipeline and tests (#4223) · 34f0e34a
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * finish bloom model
      
      * test shard gpt2
      
      * clear cache
      
      * support all bloom models
      
      * add bloom models policies
      
      * finish bloom pipeline and tests
      
      * add set pipeline
      
      * finish bloom
      34f0e34a
    • Jianghai's avatar
      [pipeline] All bert models (#4233) · e7cc62d7
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      
      * finish llama
      
      * causal lm and sequence classification
      
      * revision
      
      * add pure pipeline test
      
      * finish some bert models
      
      * finish all bert models
      
      * finish bert tests
      
      * fix bugs
      
      * fix bugs
      
      * fix test pipeline
      
      * fix data gen for qa
      
      * update the set pipeline forward
      
      * shared params
      
      * fix bugs
      e7cc62d7
    • Baizhou Zhang's avatar
      [pipeline] add pipeline forward for variants of gpt2 (#4238) · a14d3520
      Baizhou Zhang authored
      * add forward for GPTLMHeadModel
      
      * add test for gpt_lm
      
      * arranging get_held_layers method
      
      * arrange forward replacement
      
      * add forward for GPT2ForTokenClassification
      
      * add forward for GPT2ForSequenceClassification
      
      * fix test_shard_gpt2.py
      
      * add GPT2DoubleHeadsmodel & fix bugs
      
      * add id checking in get_shared_params
      a14d3520
    • Baizhou Zhang's avatar
      [pipeline] Add Pipeline Forward for GPT2Model Shardformer (#4224) · 208ac8f2
      Baizhou Zhang authored
      * * fix typehint & docstring in sharder.py
      
      * * update pipeline forward for GPT2Model
      
      * * add test for pipeline forward of GPT2Model
      
      * * add cache cleaning in gpt2 test
      
      * * change assert to raise command
      208ac8f2
    • Jianghai's avatar
      [pipeline] add bloom model pipeline (#4210) · 37d22f68
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * finish bloom model
      
      * test shard gpt2
      
      * clear cache
      37d22f68
    • Jianghai's avatar
      [pipeline] Llama causal lm and llama for sequence classification pipeline (#4208) · 31bcf867
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      
      * finish llama
      
      * causal lm and sequence classification
      
      * revision
      31bcf867
    • Jianghai's avatar
      [pipeline] Llama pipeline (#4205) · 16220310
      Jianghai authored
      * bloom policy
      
      * llama pipeline forward and tests
      
      * fix the output and attention_mask
      
      * fix name
      
      * bind argument to policy
      
      * Revert "bloom policy"
      
      This reverts commit 8dee68a0a22568dbeed6d4563372b25e1e825fb0.
      
      This policy should be revert and copied to feature/bloom
      
      * revert the bloom changes
      
      * cancel unneeded inputs
      
      * gpt
      16220310
    • Jianghai's avatar
      [pipeline] Bert pipeline for shardformer and its tests (#4197) · 1094e0f0
      Jianghai authored
      * add pipeline forward
      
      * complete pipeline forward check
      
      * fix bert forward without pipeline
      
      * fix comments
      
      * discard useless line
      
      * add todo
      
      * clean prints
      
      * fix distribute layers
      1094e0f0
    • Hongxin Liu's avatar
      [shardformer] support lazy init (#4202) · 890774b2
      Hongxin Liu authored
      * [shardformer] support lazy init
      
      * [shardformer] linear support lazy init
      
      * [shardformer] embedding support lazy init
      
      * [shardformer] norm support lazy init
      
      * [shardformer] fused linear support lazy init
      
      * [test] update shardformer test layer
      
      * [test] shardformer with lazy init fit ddp
      
      * [lazy] hotfix deepcopy of param
      
      * [shardformer] fix bert policy and update test
      
      * [shardformer] fix bloom policy and update test
      
      * [shardformer] fix opt policy and update test
      
      * [shardformer] fix t5 policy and update test
      
      * [shardformer] fix gpt2 policy and update test
      
      * [shardformer] fix llama policy and update test
      890774b2
    • Jianghai's avatar
      [pipeline] move bert related pipeline components to shardformer (#4187) · f3bcc292
      Jianghai authored
      * move bert related pipeline components to shardformer
      
      * fix bugs
      
      * revision
      
      * fix bert model tests
      
      * fix bert_lm_head model tests
      
      * fix tests
      
      * fix tests
      
      * done checks
      
      * skip bloom
      f3bcc292
    • ver217's avatar
      [test] add shard util tests · 5fc60a3a
      ver217 authored
      5fc60a3a
    • ver217's avatar
      [test] update shardformer tests · 2d6cc07f
      ver217 authored
      2d6cc07f
  2. 04 Jul, 2023 25 commits