"server/text_generation_server/models/gpt_neox.py" did not exist on "32a253063dae768e71a0b0aa099cfbbe962032d1"
-
Daniel Hiltgen authored
Now that we call the GPU discovery routines many times to update memory, this splits initial discovery from free memory updating.
43ed358f