TUNING.md 407 Bytes
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
## Tuning SGLang Infer System with AMD GPUs
This AppNote describes the SGLang performance tuning technical, code harness and running steps for systems with AMD Instinct GPUs.
Harness code, examples and steps are provided in detail, to facilitate easy reproduce & use to tune performance towards workloads.
Three primary runtime areas are covered:
- Triton Kernels


- Torch Tunable Ops 


- Torch Compile