Yineng Zhang
|
4ab43cfb3e
|
chore: bump v0.4.2 (#3180)
|
2025-01-27 21:42:05 +08:00 |
|
Yineng Zhang
|
e94fb7cb10
|
chore: bump v0.4.1.post7 (#3009)
|
2025-01-20 21:50:55 +08:00 |
|
Yineng Zhang
|
b3e99dfb22
|
chore: bump v0.4.1.post6 (#2899)
|
2025-01-15 16:23:42 +08:00 |
|
kk
|
b8cd09f27a
|
update ROCm docker for layernorm kernel optimization (#2885)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2025-01-14 16:59:43 +08:00 |
|
kk
|
e808c1df3e
|
Integrate ROCm ater package for ck moe function feasibility (#2854)
Co-authored-by: wunhuang <wunhuang@amd.com>
Co-authored-by: Lin, Soga <soga.lin@amd.com>
|
2025-01-13 08:23:07 +00:00 |
|
sogalin
|
a18ab81ddd
|
Update base image for ROCm (#2852)
Co-authored-by: HAI <hixiao@gmail.com>
|
2025-01-13 14:39:44 +08:00 |
|
Yineng Zhang
|
f624901cdd
|
chore: bump v0.4.1.post5 (#2840)
|
2025-01-11 23:10:02 +08:00 |
|
Yineng Zhang
|
2f0d386496
|
chore: bump v0.4.1.post4 (#2713)
|
2025-01-06 01:29:54 +08:00 |
|
kk
|
148254d4db
|
Improve moe reduce sum kernel performance (#2705)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2025-01-02 01:11:06 -08:00 |
|
kk
|
b6e0cfb5e1
|
ROCm base image update (#2692)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2025-01-01 12:12:19 +08:00 |
|
Lianmin Zheng
|
03d5fbfd44
|
Release 0.4.1.post3 - upload the config.json to PyPI (#2647)
|
2024-12-29 14:25:53 -08:00 |
|
Yineng Zhang
|
3ccf566b0d
|
chore: bump v0.4.1.post2 (#2643)
|
2024-12-30 00:11:46 +08:00 |
|
Yineng Zhang
|
ef5b0ff90b
|
chore: bump v0.4.1.post1 (#2616)
|
2024-12-28 00:11:06 +08:00 |
|
kk
|
b438a2e512
|
Fix triton kernel performance regression (#2611)
Co-authored-by: wunhuang <wunhuang@amd.com>
|
2024-12-27 15:54:38 +08:00 |
|
Yineng Zhang
|
efc52f85e2
|
chore: bump v0.4.1 (#2582)
|
2024-12-26 07:14:51 +08:00 |
|
Yineng Zhang
|
8f4d04e540
|
chore: bump v0.4.0.post2 (#2525)
|
2024-12-21 21:16:34 +08:00 |
|
Lianmin Zheng
|
e5f227c0ee
|
Release v0.4.0.post1 (#2375)
|
2024-12-06 06:08:19 -08:00 |
|
Yineng Zhang
|
f8b0326934
|
chore: bump v0.4.0 (#2338)
|
2024-12-03 11:55:41 -08:00 |
|
HAI
|
0639bf15d1
|
ROCm Container: set SGLANG_SET_CPU_AFFINITY=1 (#2328)
|
2024-12-02 23:20:33 -08:00 |
|
Yineng Zhang
|
fae4e5e99a
|
chore: bump v0.3.6.post3 (#2259)
|
2024-11-30 01:41:16 +08:00 |
|
HAI
|
b79fffdcb5
|
Update Install Method 2. From source (#2232)
|
2024-11-27 22:46:55 -08:00 |
|
Lianmin Zheng
|
fed4c6946a
|
Release v0.3.6.post2 (#2214)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2024-11-27 03:35:30 -08:00 |
|
Lianmin Zheng
|
ac5a0f0488
|
Release v0.3.6.post1 (#2189)
|
2024-11-25 17:31:37 -08:00 |
|
Yineng Zhang
|
9a00e6f453
|
chore: bump v0.3.6 (#2120)
|
2024-11-22 19:27:30 +08:00 |
|
Lianmin Zheng
|
32c9a7ec11
|
Release v0.3.5.post2 (#2046)
|
2024-11-15 06:54:00 -08:00 |
|
Lianmin Zheng
|
f407fcf9ef
|
Release v0.3.5.post1 (#2022)
|
2024-11-13 10:27:12 -08:00 |
|
HAI
|
d32fba2a4d
|
[ENV, ROCm] update environment settings (#1939)
|
2024-11-07 18:24:36 -08:00 |
|
Lianmin Zheng
|
65859754f1
|
Release v0.3.5 (#1908)
|
2024-11-03 13:48:11 -08:00 |
|
HAI
|
d8e9d61f86
|
[Build, ROCm] Dockerfile.rocm for Instinct GPUs, with package updates (#1861)
|
2024-10-31 16:38:16 -07:00 |
|