Docs: Quick fix for Speculative_decoding doc (#3228)

Co-authored-by: Chayenne <zhaochenyang@ucla.edu> Co-authored-by: Chayenne <zhaochen20@outlook.com>

Docs: Quick fix for Speculative_decoding doc (#3228)
Co-authored-by: Chayenne <zhaochenyang@ucla.edu> Co-authored-by: Chayenne <zhaochen20@outlook.com>
656f7fc1 · Jhin · GitHub · cf0f7eaf · 656f7fc1
Unverified Commit 656f7fc1 authored Jan 31, 2025 by Jhin Committed by GitHub Jan 31, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 3 deletions

docs/backend/speculative_decoding.ipynb docs/backend/speculative_decoding.ipynb +4 -3

No files found.
--- a/docs/backend/speculative_decoding.ipynb
+++ b/docs/backend/speculative_decoding.ipynb
@@ -8,10 +8,11 @@
    "\n",
    "SGLang now provides an EAGLE-based speculative decoding option. The implementation aims to maximize speed and efficiency and is considered to be among the fastest in open-source LLM engines.\n",
    "\n",
+    "**Note:** Currently, Speculative Decoding in SGLang does not support radix cache.\n",
+    "\n",
    "To run the following tests or benchmarks, you also need to install [**cutex**](https://pypi.org/project/cutex/):  \n",
-    "> ```bash\n",
-    "> pip install cutex\n",
-    "> ```\n",
+    "\n",
+    "`pip install cutex`\n",
    "\n",
    "### Performance Highlights\n",
    "\n",