Yi-34B-Llama-GGUF/README.md at main

Files

ModelHub XC de19135537 初始化项目，由ModelHub XC社区提供模型

Model: simustar/Yi-34B-Llama-GGUF
Source: Original Platform

2026-05-15 15:30:36 +08:00

pipeline_tag, language, tags

pipeline_tag

language

The following tables list the available Yi-34B-Llamafied model files with their respective quantization methods and characteristics.

Key:

Q-Method	File Name	Size	Quality Loss	Recommended
Q2	Yi-34B-Llama_Q2_K	Smallest	Extreme (not recommended)
Q3	Yi-34B-Llama_Q3_K_S	Very Small	Very High
Q3	Yi-34B-Llama_Q3_K_M	Very Small	Very High
Q3	Yi-34B-Llama_Q3_K_L	Small	Substantial
Q4	Yi-34B-Llama_Q4_K_S	Small	Significant
Q4	Yi-34B-Llama_Q4_K_M	Medium	Balanced	Recommended
Q5	Yi-34B-Llama_Q5_K_S	Large	Low	Recommended
Q5	Yi-34B-Llama_Q5_K_M	Large	Very Low	Recommended
Q6	Yi-34B-Llama_Q6_K	Very Large	Extremely Low
Q8	Yi-34B-Llama_Q8_0	Very Large	Extremely Low (not recommended)

Please choose the model that best suits your needs based on the size and quality loss trade-offs.