### What this PR does / why we need it?
This PR fixes the error in DSV32 mixed deployment caused by enabling
layer_sharding.
- Currently, mixed deployment no longer supports the enabling of
layer_sharding. Therefore, it has been removed from the service-oriented
configuration.
- The error "RPC call to sample_tokens timed out" occurred because the
dshm size limit was set too small. Therefore, it was increased to 512
Gi.
### Does this PR introduce _any_ user-facing change?
no.
### How was this patch tested?
The nightly test has passed.
Signed-off-by: wyh145 <1987244901@qq.com>