base_model, inference, model_creator, model_name, pipeline_tag, quantized_by, tags
base_model inference model_creator model_name pipeline_tag quantized_by tags
mogaio/TinyLlama-con-brainstorming-v0.2 false mogaio TinyLlama-con-brainstorming-v0.2 text-generation afrideva
gguf
ggml
quantized
q2_k
q3_k_m
q4_k_m
q5_k_m
q6_k
q8_0

mogaio/TinyLlama-con-brainstorming-v0.2-GGUF

Quantized GGUF model files for TinyLlama-con-brainstorming-v0.2 from mogaio

Name Quant method Size
tinyllama-con-brainstorming-v0.2.fp16.gguf fp16 2.20 GB
tinyllama-con-brainstorming-v0.2.q2_k.gguf q2_k 483.12 MB
tinyllama-con-brainstorming-v0.2.q3_k_m.gguf q3_k_m 550.83 MB
tinyllama-con-brainstorming-v0.2.q4_k_m.gguf q4_k_m 668.80 MB
tinyllama-con-brainstorming-v0.2.q5_k_m.gguf q5_k_m 783.03 MB
tinyllama-con-brainstorming-v0.2.q6_k.gguf q6_k 904.40 MB
tinyllama-con-brainstorming-v0.2.q8_0.gguf q8_0 1.17 GB

Original Model Card:

Description
Model synced from source: afrideva/TinyLlama-con-brainstorming-v0.2-GGUF
Readme 25 KiB