Commit Graph

11 Commits

Author SHA1 Message Date
Shenggui Li
c6a4852136 [docs] added torch.compile cache to dpsk manual (#3737) 2025-02-21 00:11:40 -08:00
Baizhou Zhang
ac05310098 [Docs] Modify ep related server args and remove cublas part of deepseek (#3732) 2025-02-21 03:37:56 +08:00
Baizhou Zhang
67fc595bb8 [Feature] Apply Cublas Grouped Gemm kernel (#3629) 2025-02-18 15:18:31 +08:00
Jhin
bf2a70872e Update DeepSeek V3 Doc (#3541) 2025-02-12 23:15:37 -08:00
Didier Durand
eefcbdd353 fix deepseek_v3 typo (#3497) 2025-02-12 02:58:36 +08:00
Shi Shuai
591e751e07 Fix: Runtime error for function calling (#3300) 2025-02-06 20:52:01 -08:00
Chayenne
76ca91dff2 Docs/CI: Enable Fake Finish for Docs Only PR (#3350) 2025-02-06 19:33:31 -08:00
Chayenne
2584f6d944 Docs: Add Performance Demonstaration for DPA (#3005) 2025-01-20 01:00:52 -08:00
Shi Shuai
c4f9707e16 Improve: Token-In Token-Out Usage for RLHF (#2843) 2025-01-11 15:14:26 -08:00
Chayenne
5cc1170552 Doc: add block-wise FP8 in dpsk model reference (#2830) 2025-01-10 00:26:59 -08:00
Xiaotong Jiang
11fffbc95a [Doc]: Deepseek reference docs (#2787) 2025-01-09 13:43:12 -08:00