Commit Graph

29 Commits

Author SHA1 Message Date
Yineng Zhang
4ab43cfb3e chore: bump v0.4.2 (#3180) 2025-01-27 21:42:05 +08:00
Yineng Zhang
e94fb7cb10 chore: bump v0.4.1.post7 (#3009) 2025-01-20 21:50:55 +08:00
Yineng Zhang
b3e99dfb22 chore: bump v0.4.1.post6 (#2899) 2025-01-15 16:23:42 +08:00
kk
b8cd09f27a update ROCm docker for layernorm kernel optimization (#2885)
Co-authored-by: wunhuang <wunhuang@amd.com>
2025-01-14 16:59:43 +08:00
kk
e808c1df3e Integrate ROCm ater package for ck moe function feasibility (#2854)
Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: Lin, Soga <soga.lin@amd.com>
2025-01-13 08:23:07 +00:00
sogalin
a18ab81ddd Update base image for ROCm (#2852)
Co-authored-by: HAI <hixiao@gmail.com>
2025-01-13 14:39:44 +08:00
Yineng Zhang
f624901cdd chore: bump v0.4.1.post5 (#2840) 2025-01-11 23:10:02 +08:00
Yineng Zhang
2f0d386496 chore: bump v0.4.1.post4 (#2713) 2025-01-06 01:29:54 +08:00
kk
148254d4db Improve moe reduce sum kernel performance (#2705)
Co-authored-by: wunhuang <wunhuang@amd.com>
2025-01-02 01:11:06 -08:00
kk
b6e0cfb5e1 ROCm base image update (#2692)
Co-authored-by: wunhuang <wunhuang@amd.com>
2025-01-01 12:12:19 +08:00
Lianmin Zheng
03d5fbfd44 Release 0.4.1.post3 - upload the config.json to PyPI (#2647) 2024-12-29 14:25:53 -08:00
Yineng Zhang
3ccf566b0d chore: bump v0.4.1.post2 (#2643) 2024-12-30 00:11:46 +08:00
Yineng Zhang
ef5b0ff90b chore: bump v0.4.1.post1 (#2616) 2024-12-28 00:11:06 +08:00
kk
b438a2e512 Fix triton kernel performance regression (#2611)
Co-authored-by: wunhuang <wunhuang@amd.com>
2024-12-27 15:54:38 +08:00
Yineng Zhang
efc52f85e2 chore: bump v0.4.1 (#2582) 2024-12-26 07:14:51 +08:00
Yineng Zhang
8f4d04e540 chore: bump v0.4.0.post2 (#2525) 2024-12-21 21:16:34 +08:00
Lianmin Zheng
e5f227c0ee Release v0.4.0.post1 (#2375) 2024-12-06 06:08:19 -08:00
Yineng Zhang
f8b0326934 chore: bump v0.4.0 (#2338) 2024-12-03 11:55:41 -08:00
HAI
0639bf15d1 ROCm Container: set SGLANG_SET_CPU_AFFINITY=1 (#2328) 2024-12-02 23:20:33 -08:00
Yineng Zhang
fae4e5e99a chore: bump v0.3.6.post3 (#2259) 2024-11-30 01:41:16 +08:00
HAI
b79fffdcb5 Update Install Method 2. From source (#2232) 2024-11-27 22:46:55 -08:00
Lianmin Zheng
fed4c6946a Release v0.3.6.post2 (#2214)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
2024-11-27 03:35:30 -08:00
Lianmin Zheng
ac5a0f0488 Release v0.3.6.post1 (#2189) 2024-11-25 17:31:37 -08:00
Yineng Zhang
9a00e6f453 chore: bump v0.3.6 (#2120) 2024-11-22 19:27:30 +08:00
Lianmin Zheng
32c9a7ec11 Release v0.3.5.post2 (#2046) 2024-11-15 06:54:00 -08:00
Lianmin Zheng
f407fcf9ef Release v0.3.5.post1 (#2022) 2024-11-13 10:27:12 -08:00
HAI
d32fba2a4d [ENV, ROCm] update environment settings (#1939) 2024-11-07 18:24:36 -08:00
Lianmin Zheng
65859754f1 Release v0.3.5 (#1908) 2024-11-03 13:48:11 -08:00
HAI
d8e9d61f86 [Build, ROCm] Dockerfile.rocm for Instinct GPUs, with package updates (#1861) 2024-10-31 16:38:16 -07:00