From 068c3a0167515a3f77b5b080c43164db31171fb7 Mon Sep 17 00:00:00 2001
From: Shanshan Shen <467638484@qq.com>
Date: Tue, 3 Jun 2025 11:44:45 +0800
Subject: [PATCH] [Bugfix] Add verification for `quant_action.choices` to avoid
 `TypeError` (#1046)

### What this PR does / why we need it?

When I run vllm-ascend, I get this error msg:

```bash
Traceback (most recent call last):
  File "/home/sss/software/miniconda3/envs/vllm-v1/bin/vllm", line 8, in <module>
    sys.exit(main())
  File "/home/sss/github/vllm-project/vllm/vllm/entrypoints/cli/main.py", line 50, in main
    cmd.subparser_init(subparsers).set_defaults(
  File "/home/sss/github/vllm-project/vllm/vllm/entrypoints/cli/serve.py", line 101, in subparser_init
    serve_parser = make_arg_parser(serve_parser)
  File "/home/sss/github/vllm-project/vllm/vllm/entrypoints/openai/cli_args.py", line 254, in make_arg_parser
    parser = AsyncEngineArgs.add_cli_args(parser)
  File "/home/sss/github/vllm-project/vllm/vllm/engine/arg_utils.py", line 1582, in add_cli_args
    current_platform.pre_register_and_update(parser)
  File "/home/sss/github/vllm-project/vllm-ascend/vllm_ascend/platform.py", line 80, in pre_register_and_update
    if ASCEND_QUATIZATION_METHOD not in quant_action.choices:
TypeError: argument of type 'NoneType' is not iterable
[ERROR] 2025-06-03-02:53:42 (PID:6005, Device:-1, RankID:-1) ERR99999 UNKNOWN applicaiton exception
```

This is because the `choices` attribute in `quant_action` can be `None`
and we don't check it.

```bash
# quant_action
_StoreAction(option_strings=['--quantization', '-q'], dest='quantization', nargs=None, const=None, default=None, type=<class 'str'>, choices=None, required=False, help='Method used to quantize the weights. If `None`, we first check the\n`quantization_config` attribute in the model config file. If that is\n`None`, we assume the model weights are not quantized and use `dtype` to\ndetermine the data type of the weights.', metavar=None)
```

Thus, I have added check for the `choices` to handle the scenario of
`choices=None`.

### Does this PR introduce _any_ user-facing change?
yes, vllm server with ascend quantization works now.

### How was this patch tested?
by `vllm server --quantization ascend` command.

Related: https://github.com/vllm-project/vllm/issues/19004

Signed-off-by: shen-shanshan <467638484@qq.com>
Co-authored-by: wangxiyuan <wangxiyuan1007@gmail.com>
---
 vllm_ascend/platform.py | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)
diff --git a/vllm_ascend/platform.py b/vllm_ascend/platform.py
index 8da897b..8606eb4 100644
--- a/vllm_ascend/platform.py
+++ b/vllm_ascend/platform.py
@@ -75,7 +75,8 @@ class NPUPlatform(Platform):
         # and the user can enable quantization using "vllm serve --quantization ascend".
         if parser is not None:
             quant_action = parser._option_string_actions.get('--quantization')
-            if quant_action and hasattr(quant_action, 'choices'):
+            if quant_action and hasattr(quant_action,
+                                        'choices') and quant_action.choices:
                 if ASCEND_QUATIZATION_METHOD not in quant_action.choices:
                     quant_action.choices.append(ASCEND_QUATIZATION_METHOD)