This website requires JavaScript.
Explore
Help
Register
Sign In
Chranos
/
enginex-mlu370-vllm
Watch
1
Star
0
Fork
0
You've already forked enginex-mlu370-vllm
forked from
EngineX-Cambricon/enginex-mlu370-vllm
Code
Pull Requests
Activity
Files
e1a2afd244e250c5fafb3f4dcb48071b8bf3a98d
enginex-mlu370-vllm
/
vllm-v0.6.2
/
.buildkite
/
lm-eval-harness
/
configs
History
…
..
DeepSeek-V2-Lite-Chat.yaml
…
Meta-Llama-3-8B-Instruct-Channelwise-compressed-tensors.yaml
…
Meta-Llama-3-8B-Instruct-FBGEMM-nonuniform.yaml
…
Meta-Llama-3-8B-Instruct-FP8-compressed-tensors.yaml
…
Meta-Llama-3-8B-Instruct-FP8.yaml
…
Meta-Llama-3-8B-Instruct-INT8-compressed-tensors-asym.yaml
…
Meta-Llama-3-8B-Instruct-INT8-compressed-tensors.yaml
…
Meta-Llama-3-8B-Instruct-nonuniform-compressed-tensors.yaml
…
Meta-Llama-3-8B-Instruct.yaml
…
Meta-Llama-3-8B-QQQ.yaml
…
Meta-Llama-3-70B-Instruct-FBGEMM-nonuniform.yaml
…
Meta-Llama-3-70B-Instruct.yaml
…
Meta-Llama-3.2-1B-Instruct-INT8-compressed-tensors.yaml
…
Minitron-4B-Base-FP8.yaml
…
Mixtral-8x7B-Instruct-v0.1-FP8.yaml
…
Mixtral-8x7B-Instruct-v0.1.yaml
…
Mixtral-8x22B-Instruct-v0.1-FP8-Dynamic.yaml
…
models-large.txt
…
models-small.txt
…
Qwen2-1.5B-Instruct-FP8W8.yaml
…
Qwen2-1.5B-Instruct-INT8-compressed-tensors.yaml
…
Qwen2-1.5B-Instruct-W8A16-compressed-tensors.yaml
…
Qwen2-57B-A14-Instruct.yaml
…