14 lines
407 B
Markdown
14 lines
407 B
Markdown
## Tuning SGLang Infer System with AMD GPUs
|
|
This AppNote describes the SGLang performance tuning technical, code harness and running steps for systems with AMD Instinct GPUs.
|
|
Harness code, examples and steps are provided in detail, to facilitate easy reproduce & use to tune performance towards workloads.
|
|
Three primary runtime areas are covered:
|
|
- Triton Kernels
|
|
|
|
|
|
- Torch Tunable Ops
|
|
|
|
|
|
- Torch Compile
|
|
|
|
|