[Feature] Support AWQ MoE W4A16 Quantization (#142)
Signed-off-by: tangshiwen <tangshiwen@baidu.com> Co-authored-by: Li Wei <liwei.109@outlook.com>
This commit is contained in:
@@ -31,7 +31,7 @@ Like vLLM, we now support quantization methods such as compressed-tensors, AWQ,
|
||||
<td style="padding: 10px; border: 1px solid #000;">✅</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">✅</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">✅</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">WIP</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">✅</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">✅</td>
|
||||
<td style="padding: 10px; border: 1px solid #000;">WIP</td>
|
||||
</tr>
|
||||
|
||||
Reference in New Issue
Block a user