Commits · 184b992dcb2a0890adaa19eb9b541c3e4f9d2a08 · gaoqiong / flash-attention

29 Jul, 2023 1 commit
- [GPT] Implement parallel LLaMa · 184b992d
  Tri Dao authored Jul 28, 2023
  
  184b992d
26 Jul, 2023 2 commits
- [GPT] Add LLaMa-13B to test · 56ccaff1
  Tri Dao authored Jul 26, 2023
  
  56ccaff1
- [Rotary] Fix tests when loading state dict with rotary inv_freqs · 8e9820a5
  Tri Dao authored Jul 26, 2023
  
  8e9820a5
02 Jul, 2023 1 commit
- [Rotary] Make sure frequency calculation is in fp32 · 62e98144
  Tri Dao authored Jul 02, 2023
  
  62e98144
05 May, 2023 1 commit
- [LLaMa] Fix last norm layer to use RMSNorm instead of LayerNorm · a9a4b4e4
  Tri Dao authored May 04, 2023
  
  a9a4b4e4
19 Apr, 2023 1 commit
- Implement LLaMa · 96d10f65
  Tri Dao authored Apr 18, 2023
  
  96d10f65