Chayenne
|
7d1485d376
|
Add get weights by parameter name for llama (#2266)
|
2024-11-29 23:36:38 -08:00 |
|
Lianmin Zheng
|
dd5eba4c88
|
Remove fused_moe_grok (#2223)
|
2024-11-27 14:28:55 -08:00 |
|
kk
|
a9ca297d76
|
[3rdparty, document] Updated Documentation that for triton fused_moe kernel tuning for AMD Instinct GPUs (#2191)
Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: HAI <hixiao@gmail.com>
|
2024-11-28 02:23:10 +08:00 |
|
leishaoSC
|
d1150e9a00
|
Updated Instructions on Profiling SGLang Infer System with AMD GPUs (#1966)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2024-11-08 23:19:03 -08:00 |
|
Xuehai Pan
|
a5e0defb5a
|
minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926)
|
2024-11-06 13:46:04 +00:00 |
|
jacky.cheng
|
d59a47828c
|
[3rdparty, document] Updated Documentation that covers performance tuning techniques for AMD Instinct GPUs. (#1871)
Co-authored-by: root <root@dell300x-pla-t10-23.pla.dcgpu>
|
2024-11-01 12:12:59 -07:00 |
|
HAI
|
5010e0d2ca
|
[3rdparty, document] Add 3rdparty/amd, with profiling and tuning instructions to be added (#1822)
|
2024-10-29 10:51:02 -07:00 |
|
Cody Yu
|
26c3494152
|
[Submodule] Change FlashInfer to import (#156)
|
2024-02-06 19:28:29 -08:00 |
|
Liangsheng Yin
|
08ab2a1655
|
Json Decode && Mutl-Turns (#4)
|
2024-01-15 00:49:29 -08:00 |
|
Liangsheng Yin
|
ead5b39f82
|
Add flashinfer && Oultines (#1)
|
2024-01-08 08:26:18 -08:00 |
|