tests/models/test_gpt_generation_parallel.py · 78b7a1dc1869e03a39cf3d2e2d9e5dbb1f669810 · gaoqiong / flash-attention · GitLab

Find file Blame History Permalink

[OPT] Load fp16 weights on CPU before moving to GPU · 78b7a1dc
Tri Dao authored Jan 22, 2023

78b7a1dc

test_gpt_generation_parallel.py 6.38 KB