Support --enable-llama4-multimodal (#5254)
This commit is contained in:
@@ -281,7 +281,6 @@ class ModelRunner:
|
||||
f"Automatically reduce --mem-fraction-static to {self.mem_fraction_static:.3f} "
|
||||
f"because this is a multimodal model."
|
||||
)
|
||||
|
||||
logger.info(
|
||||
"Automatically turn off --chunked-prefill-size for multimodal model."
|
||||
)
|
||||
|
||||
Reference in New Issue
Block a user