-
Daniel Hiltgen authored
On the smaller GPUs, the initial model load of llama2 took over 30s (the default timeout for the DoGenerate helper)
73e2c8f6
On the smaller GPUs, the initial model load of llama2 took over 30s (the default timeout for the DoGenerate helper)