Logo
Explore Help
Register Sign In
EngineX-Hygon/sglang
5
0
Fork 0
You've already forked sglang
Code Issues Pull Requests Actions 7 Projects Releases Wiki Activity
1,694 Commits 8 Branches 0 Tags
41d7e5b7e68f3aa0ef741d8774333c3518522d5e
Commit Graph

8 Commits

Author SHA1 Message Date
Lianmin Zheng
8a6906127a Improve linear.py to load sharded weights & remove the dependency of Parameters from vllm (#2784)
Co-authored-by: SangBin Cho rkooo567@gmail.com
2025-01-07 23:29:10 -08:00
Chayenne
7d1485d376 Add get weights by parameter name for llama (#2266) 2024-11-29 23:36:38 -08:00
Lianmin Zheng
dd5eba4c88 Remove fused_moe_grok (#2223) 2024-11-27 14:28:55 -08:00
kk
a9ca297d76 [3rdparty, document] Updated Documentation that for triton fused_moe kernel tuning for AMD Instinct GPUs (#2191)
Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: HAI <hixiao@gmail.com>
2024-11-28 02:23:10 +08:00
leishaoSC
d1150e9a00 Updated Instructions on Profiling SGLang Infer System with AMD GPUs (#1966)
Co-authored-by: wunhuang <wunhuang@amd.com>
2024-11-08 23:19:03 -08:00
Xuehai Pan
a5e0defb5a minor: Add basic editorconfig and pre-commit hooks to enforce style for whitespaces (#1926) 2024-11-06 13:46:04 +00:00
jacky.cheng
d59a47828c [3rdparty, document] Updated Documentation that covers performance tuning techniques for AMD Instinct GPUs. (#1871)
Co-authored-by: root <root@dell300x-pla-t10-23.pla.dcgpu>
2024-11-01 12:12:59 -07:00
HAI
5010e0d2ca [3rdparty, document] Add 3rdparty/amd, with profiling and tuning instructions to be added (#1822) 2024-10-29 10:51:02 -07:00
Powered by Gitea Version: 1.24.3 Page: 339ms Template: 8ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API