Yineng Zhang
|
80002562a8
|
docs: update README (#2878)
|
2025-01-14 12:48:17 +08:00 |
|
Yineng Zhang
|
41d7e5b7e6
|
docs: update link (#2857)
|
2025-01-13 18:40:48 +08:00 |
|
Lianmin Zheng
|
72c7776355
|
Fix linear.py and improve weight loading (#2851)
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
|
2025-01-13 01:39:14 -08:00 |
|
Yineng Zhang
|
197cbf9bab
|
docs: update README (#2841)
|
2025-01-11 23:11:38 +08:00 |
|
Yineng Zhang
|
f624901cdd
|
chore: bump v0.4.1.post5 (#2840)
|
2025-01-11 23:10:02 +08:00 |
|
Rodrigo Garcia
|
a990daff9c
|
Included multi-node DeepSeekv3 example (#2707)
|
2025-01-02 22:17:03 +08:00 |
|
Lianmin Zheng
|
ad20b7957e
|
Eagle speculative decoding part 3: small modifications to the general scheduler (#2709)
Co-authored-by: kavioyu <kavioyu@tencent.com>
|
2025-01-02 02:09:08 -08:00 |
|
Lianmin Zheng
|
8c3b420eec
|
[Docs] clean up structured outputs docs (#2654)
|
2024-12-29 23:57:16 -08:00 |
|
Yineng Zhang
|
098d659c0e
|
docs: update README (#2651)
|
2024-12-30 13:33:29 +08:00 |
|
Lzhang-hub
|
76d14f8cb9
|
add 2*h20 node serving example for deepseek v3 (#2650)
Co-authored-by: Yineng Zhang <me@zhyncs.com>
|
2024-12-30 13:04:38 +08:00 |
|
Lianmin Zheng
|
03d5fbfd44
|
Release 0.4.1.post3 - upload the config.json to PyPI (#2647)
|
2024-12-29 14:25:53 -08:00 |
|
Yineng Zhang
|
763dd55d17
|
docs: update README (#2644)
|
2024-12-30 01:24:06 +08:00 |
|
Ke Bao
|
8a2681e26a
|
Update readme (#2625)
|
2024-12-28 13:39:56 +08:00 |
|
Yineng Zhang
|
d9e6ee382b
|
docs: update README (#2618)
|
2024-12-28 00:21:53 +08:00 |
|
Lianmin Zheng
|
f46f394f4d
|
Update README.md (#2605)
|
2024-12-26 10:58:49 -08:00 |
|
Lianmin Zheng
|
773951548d
|
Fix logprob_start_len for multi modal models (#2597)
Co-authored-by: libra <lihu723@gmail.com>
Co-authored-by: fzyzcjy <ch271828n@outlook.com>
Co-authored-by: Wang, Haoyu <haoyu.wang@intel.com>
|
2024-12-26 06:27:45 -08:00 |
|
fsygd
|
637de9e8ce
|
update readme of DeepSeek V3 (#2596)
|
2024-12-26 21:31:56 +08:00 |
|
Yineng Zhang
|
635a042623
|
docs: update deepseek v3 example (#2592)
|
2024-12-26 17:43:37 +08:00 |
|
Yineng Zhang
|
75ad0a143f
|
docs: add deepseek v3 launch instructions (#2589)
|
2024-12-25 23:26:54 -08:00 |
|