Pleaplusone
1a1f9a6d89
port deepseekv2 and mtp to main branch ( #429 )
...
### What this PR does / why we need it?
This PR ports all the deepseek graph mode code and mtp code from v0.7.3
to the main branch
---------
Signed-off-by: SidaoY <1024863041@qq.com >
Signed-off-by: linfeng-yuan <1102311262@qq.com >
Signed-off-by: Yizhou Liu <liuyizhou5@h-partners.com >
Signed-off-by: mengwei805 <mengwei25@huawei.com >
Signed-off-by: libaokui <libaokui@huawei.com >
Signed-off-by: q00832892 <qiaoyang19@huawei.com >
Signed-off-by: ganyi <pleaplusone.gy@gmail.com >
Co-authored-by: SidaoY <1024863041@qq.com >
Co-authored-by: linfeng-yuan <1102311262@qq.com >
Co-authored-by: Yizhou Liu <liuyizhou5@h-partners.com >
Co-authored-by: mengwei805 <mengwei25@huawei.com >
Co-authored-by: libaokui <libaokui@huawei.com >
2025-04-19 17:38:18 +08:00
hfadzxy
9935d45728
[CI]Add model basic accuracy test(Qwen2.5-0.5B-Instruct) ( #460 )
...
### What this PR does / why we need it?
Add model basic accuracy test(Qwen2.5-0.5B-Instruct)
Signed-off-by: hfadzxy <starmoon_zhang@163.com >
2025-04-17 14:59:56 +08:00
Mengqing Cao
6061f33670
[Bugfix][Model] Fix api in DeepSeek model ( #545 )
...
### What this PR does / why we need it?
Fix api in DeepSeekV2, aligning with the latest code of the main branch
in vllm.
### Does this PR introduce _any_ user-facing change?
N/A
### How was this patch tested?
Test locally with deepseek-v2-lite, and will add CI by @Potabk.
Plz update the model UT after this pr is merged, thx! cc @Potabk
Signed-off-by: MengqingCao <cmq0113@163.com >
2025-04-17 11:56:05 +08:00
Mengqing Cao
f6cf92e7d5
[quant][bugfix] fix deepseek quant bug ( #478 )
...
see #465
Signed-off-by: MengqingCao <cmq0113@163.com >
Co-authored-by: zzzzwwjj <1183291235@qq.com >
2025-04-08 09:15:56 +08:00
Mengqing Cao
344228a5da
[deepseek][bugfix] support deepseek quant ( #469 )
...
- support deepseek quant
- add w8a8_dynamic quant
see #391
Signed-off-by: MengqingCao <cmq0113@163.com >
Co-authored-by: zzzzwwjj <1183291235@qq.com >
2025-04-07 10:56:12 +08:00