"examples/python_rs/llm/vllm_nixl/worker.py" did not exist on "dd31a322ae4311f899d8c8950a2bbc8c1a9f47da"
support more flexible setting of conv head; slice inputs when batch size is...
support more flexible setting of conv head; slice inputs when batch size is too large in PFNLayer to avoid bugs (#124) * support more flexible setting * slice inputs of nn.Linear when batch size is too large
Showing
Please register or sign in to comment