Unverified Commit 3a302150 authored by Atream's avatar Atream Committed by GitHub
Browse files

Merge pull request #795 from hybcloud/main

fix minor typo
parents f46b3fd5 6b551a6e
...@@ -86,7 +86,7 @@ Memory: standard DDR5-4800 server DRAM (1 TB), each socket with 8×DDR5-4800 ...@@ -86,7 +86,7 @@ Memory: standard DDR5-4800 server DRAM (1 TB), each socket with 8×DDR5-4800
#### Change Log #### Change Log
- Longer Context (from 4K to 8K for 24GB VRAM) and Slightly Faster Speed (+15%):<br> - Longer Context (from 4K to 8K for 24GB VRAM) and Slightly Faster Speed (+15%):<br>
Integrated the highly efficient Triton MLA Kernel from the fantastic sglang project, enable much longer context length and slightly faster prefill/decode speed Integrated the highly efficient Triton MLA Kernel from the fantastic sglang project, enable much longer context length and slightly faster prefill/decode speed
- We suspect that some of the improvements come from the change of hardwre platform (4090D->4090) - We suspect that some of the improvements come from the change of hardware platform (4090D->4090)
#### Benchmark Results #### Benchmark Results
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment