ModelHub XC 9c343a9d7d 初始化项目,由ModelHub XC社区提供模型
Model: prithivMLmods/QVikhr-3-4B-it-F32-GGUF
Source: Original Platform
2026-06-19 12:45:13 +08:00

license, base_model, language, pipeline_tag, library_name, tags
license base_model language pipeline_tag library_name tags
apache-2.0
Vikhrmodels/QVikhr-3-4B-Instruction
en
ru
text-generation transformers
ru
text-generation-inference

QVikhr-3-4B-it-F32-GGUF

QVikhr-3-4B-Instruction, Instructive model based on Qwen/Qwen3-4B, trained on the Russian-language dataset GrandMaster2. Designed for high-efficiency text processing in Russian and English, delivering precise responses and fast task execution.

Model Files

File Name Size Type Description
QVikhr-3-4B-Instruction.Q2_K.gguf 1.67 GB Model Q2_K quantized model (smallest)
QVikhr-3-4B-Instruction.Q3_K_S.gguf 1.89 GB Model Q3_K_S quantized model
QVikhr-3-4B-Instruction.Q3_K_M.gguf 2.08 GB Model Q3_K_M quantized model
QVikhr-3-4B-Instruction.Q3_K_L.gguf 2.24 GB Model Q3_K_L quantized model
QVikhr-3-4B-Instruction.Q4_K_S.gguf 2.38 GB Model Q4_K_S quantized model
QVikhr-3-4B-Instruction.Q4_K_M.gguf 2.5 GB Model Q4_K_M quantized model
QVikhr-3-4B-Instruction.Q5_K_S.gguf 2.82 GB Model Q5_K_S quantized model
QVikhr-3-4B-Instruction.Q5_K_M.gguf 2.89 GB Model Q5_K_M quantized model
QVikhr-3-4B-Instruction.Q6_K.gguf 3.31 GB Model Q6_K quantized model
QVikhr-3-4B-Instruction.Q8_0.gguf 4.28 GB Model Q8_0 quantized model
QVikhr-3-4B-Instruction.BF16.gguf 8.05 GB Model BF16 precision model
QVikhr-3-4B-Instruction.F16.gguf 8.05 GB Model F16 precision model
QVikhr-3-4B-Instruction.F32.gguf 16.1 GB Model F32 full precision model (largest)
.gitattributes 2.58 kB Config Git LFS configuration
config.json 31 Bytes Config Model configuration
README.md 149 Bytes Documentation Repository documentation

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Description
Model synced from source: prithivMLmods/QVikhr-3-4B-it-F32-GGUF
Readme 27 KiB