[2/n]decouple quantization implementation from vLLM dependency (#8112)
Co-authored-by: walker-ai <yiyun.wyt@antgroup.com> Co-authored-by: leoneo <1320612015@qq.com>
This commit is contained in:
1950
sgl-kernel/csrc/gemm/gptq/gptq_kernel.cu
Normal file
1950
sgl-kernel/csrc/gemm/gptq/gptq_kernel.cu
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user