Files
xc-llm-ascend/docs/source/user_guide
SILONG ZENG adc57c5951 [release] Add GLM5 known issue for 2-node PD mixed deployment (#7436)
### What this PR does / why we need it?
Documented an issue in the 2-node PD mixed deployment scenario where
inference may hang when concurrency exceeds 8.(GLM5)

Noted that the issue has been fixed in PR:
- #7235 
- #7290.
---------
Signed-off-by: MrZ20 <2609716663@qq.com>
Signed-off-by: Mengqing Cao <cmq0113@163.com>
Co-authored-by: Mengqing Cao <cmq0113@163.com>
2026-03-18 10:03:18 +00:00
..