--- datasets: - Open-Orca/OpenOrca library_name: transformers tags: - llama pipeline_tag: text-generation --- GGML/GGUF(v2) Quantizations of the model: https://huggingface.co/winglian/basilisk-4b This is winglian/llama-2-4b, a 4B parameter Llama-2 model, finetuned with open orca CoT data. I tried to run on latest llama.cpp commit, but I was getting an error(GGML_ASSERT: llama.cpp:8136: false), then I converted again the model to gguf using this llama.cpp commit https://github.com/ggerganov/llama.cpp/tree/019ba1dcd0c7775a5ac0f7442634a330eb0173cc it seems to be working now.