This PR fixes a mlapo accuracy problem related with weight processing. Furthermore, add back mlapo related e2e test with quantized deepseek model. - vLLM version: v0.11.0rc3 - vLLM main: 83f478bb19 Signed-off-by: whx-sjtu <2952154980@qq.com>
83f478bb19