"vllm/vscode:/vscode.git/clone" did not exist on "87f1b8ca2c94d173ac8b2f149ffbdd55b9adc800"
Unverified Commit f6f392c8 authored by Iman Tabrizian's avatar Iman Tabrizian Committed by GitHub
Browse files

Remove link to the fix for disagg + eagle3 for TRT-LLM example (#2006)


Signed-off-by: default avatarIman Tabrizian <10105175+Tabrizian@users.noreply.github.com>
parent cc90ca62
......@@ -32,7 +32,6 @@ For advanced control over how requests are routed between prefill and decode wor
## Notes
* To run Eagle Speculative Decoding with Llama 4, ensure the container meets the following criteria:
* Built with a version of TensorRT-LLM based on the 0.21 release [Link](https://github.com/NVIDIA/TensorRT-LLM/tree/release/0.21)
* The TensorRT-LLM build includes the changes from this PR [Link](https://github.com/NVIDIA/TensorRT-LLM/pull/5975)
* If you need to download model weights off huggingface, make sure you run the command `huggingface-cli login` and have access to the necessary gated models.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment