Files
xc-llm-ascend/tests
Nagisa125 600bf80c6d [CI]Fix the error caused by layer_sharding in dsv32 (#8719)
### What this PR does / why we need it?

This PR fixes the error in DSV32 mixed deployment caused by enabling
layer_sharding.

- Currently, mixed deployment no longer supports the enabling of
layer_sharding. Therefore, it has been removed from the service-oriented
configuration.
- The error "RPC call to sample_tokens timed out" occurred because the
dshm size limit was set too small. Therefore, it was increased to 512
Gi.

### Does this PR introduce _any_ user-facing change?

no.

### How was this patch tested?

The nightly test has passed.

Signed-off-by: wyh145 <1987244901@qq.com>
2026-04-30 10:35:48 +08:00
..