初始化项目,由ModelHub XC社区提供模型

Model: jamesburton/Phi-4-reasoning-vision-15B-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-13 14:28:07 +08:00
commit 8716d2260a
13 changed files with 181 additions and 0 deletions

46
.gitattributes vendored Normal file
View File

@@ -0,0 +1,46 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-f16.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q2_k.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q3_k_l.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q3_k_s.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q4_k_s.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q5_k_s.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
phi-4-reasoning-vision-q8_0.gguf filter=lfs diff=lfs merge=lfs -text

102
README.md Normal file
View File

@@ -0,0 +1,102 @@
---
license: mit
language:
- en
base_model: microsoft/Phi-4-reasoning-vision-15B
tags:
- phi4
- phi-4
- gguf
- quantized
- llama-cpp
- ollama
- text-generation
- reasoning
model_type: phi3
quantized_by: jamesburton
pipeline_tag: text-generation
---
# Phi-4-reasoning-vision-15B-GGUF
GGUF format conversions of [microsoft/Phi-4-reasoning-vision-15B](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B) for use with [llama.cpp](https://github.com/ggerganov/llama.cpp) and [Ollama](https://ollama.com).
> **Note:** This conversion includes the **text backbone only** (language model weights). Vision encoder and multimodal projector weights are excluded, as llama.cpp does not yet support the `phi4-siglip` vision architecture. The text model is architecturally identical to Phi-4-reasoning-plus (`Phi3ForCausalLM`).
## Available Files
| Filename | Quant Type | Size | Description |
|---|---|---|---|
| `phi-4-reasoning-vision-f16.gguf` | F16 | ~28 GB | Full precision (float16) |
| `phi-4-reasoning-vision-q8_0.gguf` | Q8_0 | ~15 GB | 8-bit quantization (near-lossless) |
| `phi-4-reasoning-vision-q6_k.gguf` | Q6_K | ~12 GB | 6-bit K-quant |
| `phi-4-reasoning-vision-q5_k_m.gguf` | Q5_K_M | ~9.9 GB | 5-bit K-quant medium |
| `phi-4-reasoning-vision-q5_k_s.gguf` | Q5_K_S | ~9.5 GB | 5-bit K-quant small |
| `phi-4-reasoning-vision-q4_K_M.gguf` | Q4_K_M | ~8.5 GB | 4-bit K-quant medium (recommended) |
| `phi-4-reasoning-vision-q4_k_s.gguf` | Q4_K_S | ~7.9 GB | 4-bit K-quant small |
| `phi-4-reasoning-vision-q3_k_l.gguf` | Q3_K_L | ~7.4 GB | 3-bit K-quant large |
| `phi-4-reasoning-vision-q3_k_m.gguf` | Q3_K_M | ~6.9 GB | 3-bit K-quant medium |
| `phi-4-reasoning-vision-q3_k_s.gguf` | Q3_K_S | ~6.1 GB | 3-bit K-quant small |
| `phi-4-reasoning-vision-q2_k.gguf` | Q2_K | ~5.2 GB | 2-bit K-quant (smallest, lowest quality) |
## How to Use
### With Ollama
```bash
# Download the Q4_K_M GGUF and create a Modelfile:
cat > Modelfile <<'EOF'
FROM ./phi-4-reasoning-vision-q4_K_M.gguf
TEMPLATE """<|system|>
{{ if .System }}{{ .System }}{{ else }}You are a helpful AI assistant with vision capabilities. You can analyze images and reason about them step by step.{{ end }}<|end|>
<|user|>
{{ .Prompt }}<|end|>
<|assistant|>
"""
PARAMETER stop "<|end|>"
PARAMETER stop "<|endoftext|>"
PARAMETER temperature 0.7
PARAMETER top_p 0.9
PARAMETER num_ctx 4096
EOF
ollama create phi4-vision -f Modelfile
ollama run phi4-vision
```
### With llama.cpp
```bash
./llama-cli -m phi-4-reasoning-vision-q4_K_M.gguf -p "Explain the theory of relativity in simple terms." -n 512
```
## Model Details
- **Original Model:** [microsoft/Phi-4-reasoning-vision-15B](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B)
- **Architecture:** Phi3ForCausalLM (text backbone of Phi-4-reasoning-vision)
- **Parameters:** ~15B (text model)
- **Hidden Size:** 5120
- **Layers:** 40
- **Attention Heads:** 40 (10 KV heads, GQA)
- **Vocab Size:** 100,352
- **Tokenizer:** GPT-2 (BPE)
- **Context Length:** Up to 131,072 tokens (with RoPE scaling)
- **License:** [MIT](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B/blob/main/LICENSE)
## Conversion Details
- Converted using [llama.cpp](https://github.com/ggerganov/llama.cpp) `convert_hf_to_gguf.py`
- Vision tower (`model.vision_tower.*`) and multimodal projector (`model.mm_projector.*`) weights were skipped during conversion
- The model config was remapped from `Phi4ForCausalLMV` (phi4-siglip) to `Phi3ForCausalLM` (phi3) since the text backbone is architecturally identical
- Quantization performed via `llama_model_quantize()` with CUDA acceleration
- 243 text tensors converted, 452 vision tensors excluded
## Original Model Card
For full details on training, capabilities, safety, and intended use, please refer to the [original model card](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B).
## Disclaimer
This is an unofficial GGUF conversion. The original model was created by Microsoft Research. All credit for the model architecture, training, and capabilities belongs to the Microsoft Phi team. Please refer to the [original model's license](https://huggingface.co/microsoft/Phi-4-reasoning-vision-15B/blob/main/LICENSE) for usage terms.

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:de9e9e275b2bb4b58f6c9e14dcad3aede3024ee439387029c45b59df82a1875c
size 29323398624

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a17be98d16263c18eeca3e8f405cd2b0800c931514c9e6da81df52ea755b1bf7
size 5547347424

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:507cb93372afd5c4bee59993a77e8048fa6b993b082860ce759519ee6738df02
size 7930154464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b50c64d6aa51495a9e3a964280a5bae8f2da625757ced1524d04571e5c997ed1
size 7363268064

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f32bcd9d0040bb0ac16e4a10cae5ab3f1d7ca05f1fe0f0cb7d844f4f2acf7739
size 6504746464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a722757f84ddacb4bbd94402a8cdd298f12c8aabff6c4c947e7dc190037e6e7a
size 9053113824

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ac6250fdd2ee0203b5d253669925361091e4a4b3985d1483a7f76e53e69d2c79
size 8440761824

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1c29cfe965e3f63d49fc39f8ed9d656339a8f1468f60d20a65a930bb30c51b3b
size 10604187104

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a98895c418bf79b0f860805b74bc2e2f58285c07050fe47a0541709b10d2782b
size 10151579104

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:10e68ff9e22db49dbda59a707c90346e428f4ec452359d474c8b9e18d86b15f9
size 12030250464

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:668156993a6e88d216db164d89ddc2bc56960948679f63a9e461ee4e9a3b0978
size 15580499424