初始化项目,由ModelHub XC社区提供模型

Model: Mungert/GELab-Zero-4B-preview-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-16 19:58:21 +08:00
commit c6dd44b90b
32 changed files with 355 additions and 0 deletions

65
.gitattributes vendored Normal file
View File

@@ -0,0 +1,65 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-f16_q8_0.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q2_k_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q2_k_s.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q3_k_s.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q6_k_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q4_k_s.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q4_0.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q4_1.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q5_0.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q5_1.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq2_xs.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq2_xxs.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq2_s.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq2_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq3_xs.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq3_xxs.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq3_m.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq4_xs.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-iq4_nl.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-imatrix.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-bf16.mmproj filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-q8_0.mmproj filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-f16.mmproj filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-bf16.gguf filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-f32.mmproj filter=lfs diff=lfs merge=lfs -text
GELab-Zero-4B-preview-bf16_q8_0.gguf filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:05022287fff9772b4891cd9e28cbce96a973cab24dd883789de74ca5de1d5580
size 8051286880

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:756ff53567285b8fe64f2e5862ef432b5a983afd4dcddfb3991b4b384479b851
size 839326528

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:79da3d343a53289a57a007a4dae729366dc7cde6a131ed28c7882f57ef97d80e
size 5927920480

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f6112bf086dc810c7a2a137d11e2bbdceacc82f524686bb48fac4d207a3c86c9
size 836180800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:01415e62a613abc572fd97f891953eb38cc4e54be5cd544ebae3816d9c7cc8cc
size 5927920480

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b82f00922840cf40733dbbe5bae8e000dbdfc19a3fc02a0221a47d8a1d050010
size 1661410112

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:15e25dbd61e8c75fad1bba1f7dec84bc49688236655c59008a8c50f118bbda9f
size 3872640

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e3ddfac709f390efdbddea3e98d7939fb3dac16c30b86b2bc5860cb9b59f3861
size 1555748000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:16e9f3a3b375a127efc9243446458b75443ffc9237afe4efa79a7e1369fe746f
size 1491522720

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7aa64e2140c9b9ff6c5ed202d40b3971687512b1565ca7ee4cfa76520104715d
size 1446180000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:40aba88d31a359110bfb9ec0450c84a04b888c7feb1791b5f7b2d78dc5467939
size 1341936800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a56034a254882adccc4ac2a1360d9f017786a323e3b4a7acd8bfdcb1e5dd5c8f
size 2003038880

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:88d1d419d761492dfc53016f9f08d7df81c1ebc00024c84136a83cee202bfc85
size 1824207520

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2017438322a3cc754c0caa3bb76087f74090cb8ea0ec22201e615f6af8c80329
size 1773703840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:08ff89da7a630b596244628b9d0680f28b8833c29bf1c73771d6a2b34adde27c
size 2281067680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:489472284e2e3a8c957cf4c65a1c4c3a70e1fa341d9acd9f53c46859f3f820b4
size 2270753440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:165ab16ce6899ebbb8732ce4997a35ed5d63ab06979dc878749ea3c14323bed2
size 1591547040

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b593d2cc2428e95f4f6ed80c1632000b4798cc8c126a4caab8e66a9a338bc6e2
size 1542927520

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7522c8b2c2c6a18c3d3e5c9acf30e20270884ea6ae27086d1df840a288a7b8f9
size 2059563680

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5529da6bee57aa0b990595e79046b30b127beccbfebb18c2579ce525258a19a7
size 2007905440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4d87de4b595338fb7a4884d71cbbb14af7806257a0fd14430bb1a47981327b93
size 2463749280

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0dc9ee930a19e06f328289d51d233d8f9c9cd9617a32af3afe4ca5ecc1ad9b40
size 2544972960

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2f7b62003045b76630855d6c48d5292e6bf26c7c906006f62140eed0c9a64011
size 2572723360

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8e3157d3ab81da28084915b6e73f912f1ff0040d66304e2045fa689812968b57
size 2407559840

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:56a2d7c2ec377b9132c9db53eb3b62bcac6d16877c95589c787c5d44fd044058
size 2917913760

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fdc92e4f5c011fe0f9ae5eb6c0f31db5c4cc41496b06a1100a6c413e9936574f
size 3144996000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d5c59eb720b0bdffb911bc0b61508a03148389812c13f86518506ba4c0fa7bc3
size 2994857120

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:476d14a39f6cb01c3f1d1ff6bbaa72dbacca0ea2ce64a438b667861d1554d081
size 3400463520

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4a0892ed9d6e90d51d331fe3eaa63522f7e04a5dd4011aa2a3a50315169790b8
size 4280406880

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7df023d5c8029117eadeb581a574ba16554ca2795a698df3c5c6b0a8a7620e02
size 453974848

200
README.md Normal file
View File

@@ -0,0 +1,200 @@
---
base_model: Qwen/Qwen3-VL-4B-Instruct
language:
- en
- zh
license: apache-2.0
tags:
- gui-agent
- phone-use agent
- computer-use agent
- pua
- android
- multimodal
- gelab-zero
pipeline_tag: image-text-to-text
library_name: transformers
---
# <span style="color: #7FFF7F;">GELab-Zero-4B-preview GGUF Models</span>
## <span style="color: #7F7FFF;">Model Generation Details</span>
This model was generated using [llama.cpp](https://github.com/ggerganov/llama.cpp) at commit [`e68c19b0f`](https://github.com/ggerganov/llama.cpp/commit/e68c19b0fdbb18d7a19217194c2795897e9c683c).
---
## <span style="color: #7FFF7F;">Quantization Beyond the IMatrix</span>
I've been experimenting with a new quantization approach that selectively elevates the precision of key layers beyond what the default IMatrix configuration provides.
In my testing, standard IMatrix quantization underperforms at lower bit depths, especially with Mixture of Experts (MoE) models. To address this, I'm using the `--tensor-type` option in `llama.cpp` to manually "bump" important layers to higher precision. You can see the implementation here:
👉 [Layer bumping with llama.cpp](https://github.com/Mungert69/GGUFModelBuilder/blob/main/model-converter/tensor_list_builder.py)
While this does increase model file size, it significantly improves precision for a given quantization level.
### **I'd love your feedback—have you tried this? How does it perform for you?**
---
<a href="https://readyforquantum.com/huggingface_gguf_selection_guide.html" style="color: #7FFF7F;">
Click here to get info on choosing the right GGUF model format
</a>
---
<!--Begin Original Model Card-->
# GELab-Zero-4B-preview
This model is part of the **GELab-Zero** project, which presents the **Step-GUI Technical Report** [[Paper](https://huggingface.co/papers/2512.15431)] [[Project Page](https://opengelab.github.io/)] [[Code](https://github.com/stepfun-ai/gelab-zero)].
## Model Details
This model is part of the [**GELab-Zero**](https://github.com/stepfun-ai/gelab-zero) project, which aims to accelerate the innovation and application deployment of GUI Agents by providing:
1. **A 4B GUI Agent model** capable of running on local computers.
2. **Plug-and-play inference infrastructure** that handles ADB connections, dependency installation, and task recording/replay (**available in the** [**GELab-Zero**](https://github.com/stepfun-ai/gelab-zero)).
### Key Capabilities
* **Local Deployment**: Optimized for consumer-grade hardware, balancing low latency with privacy.
* **GUI Navigation**: Proficient in detecting and interacting with UI elements (click, type, slide, wait, etc.) based on visual cues.
* **Complex Task Execution**: Handles multi-step long-horizon tasks across various apps (Food, Transportation, Shopping, Social, etc.).
* **Open-World Generalization**: Capable of zero-shot operation across diverse unseen applications and complex dynamic interfaces without requiring app-specific adaptation.
## Usage
### Quick Start with Ollama
The easiest way to run inference is using Ollama.
1.**Install Ollama**: Download from [ollama.com](https://ollama.com/).
2.**Download the Model**:
```bash
# Install huggingface-cli
pip install huggingface_hub
# Download model
huggingface-cli download --resume-download stepfun-ai/GELab-Zero-4B-preview --local-dir gelab-zero-4b-preview
```
3.**Create and Run in Ollama**:
```bash
cd gelab-zero-4b-preview
ollama create gelab-zero-4b-preview -f Modelfile
# Test the model
curl -X POST http://localhost:11434/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "gelab-zero-4b-preview",
"messages": [{"role": "user", "content": "Hello, GELab-Zero!"}]
}'
```
To use this model for actual Android device control (ADB connection, task execution), please use the [GELab-Zero](https://github.com/stepfun-ai/gelab-zero).
## Citation
If you find GELab-Zero-4B-preview useful for your research, please consider citing our work :)
```bibtex
@misc{yan2025stepguitechnicalreport,
title={Step-GUI Technical Report},
author={Haolong Yan and Jia Wang and Xin Huang and Yeqing Shen and Ziyang Meng and Zhimin Fan and Kaijun Tan and Jin Gao and Lieyu Shi and Mi Yang and Shiliang Yang and Zhirui Wang and Brian Li and Kang An and Chenyang Li and Lei Lei and Mengmeng Duan and Danxun Liang and Guodong Liu and Hang Cheng and Hao Wu and Jie Dong and Junhao Huang and Mei Chen and Renjie Yu and Shunshan Li and Xu Zhou and Yiting Dai and Yineng Deng and Yingdan Liang and Zelin Chen and Wen Sun and Chengxu Yan and Chunqin Xu and Dong Li and Fengqiong Xiao and Guanghao Fan and Guopeng Li and Guozhen Peng and Hongbing Li and Hang Li and Hongming Chen and Jingjing Xie and Jianyong Li and Jingyang Zhang and Jiaju Ren and Jiayu Yuan and Jianpeng Yin and Kai Cao and Liang Zhao and Liguo Tan and Liying Shi and Mengqiang Ren and Min Xu and Manjiao Liu and Mao Luo and Mingxin Wan and Na Wang and Nan Wu and Ning Wang and Peiyao Ma and Qingzhou Zhang and Qiao Wang and Qinlin Zeng and Qiong Gao and Qiongyao Li and Shangwu Zhong and Shuli Gao and Shaofan Liu and Shisi Gao and Shuang Luo and Xingbin Liu and Xiaojia Liu and Xiaojie Hou and Xin Liu and Xuanti Feng and Xuedan Cai and Xuan Wen and Xianwei Zhu and Xin Liang and Xin Liu and Xin Zhou and Yingxiu Zhao and Yukang Shi and Yunfang Xu and Yuqing Zeng and Yixun Zhang and Zejia Weng and Zhonghao Yan and Zhiguo Huang and Zhuoyu Wang and Zheng Ge and Jing Li and Yibo Zhu and Binxing Jiao and Xiangyu Zhang and Daxin Jiang},
year={2025},
eprint={2512.15431},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.15431},
}
@software{gelab_zero_2025,
title={GELab-Zero: An Advanced Mobile Agent Inference System},
author={GELab Team},
year={2025},
url={https://github.com/stepfun-ai/gelab-zero}
}
@misc{gelab_engine,
title={GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning},
author={Haolong Yan and Yeqing Shen and Xin Huang and Jia Wang and Kaijun Tan and Zhixuan Liang and Hongxin Li and Zheng Ge and Osamu Yoshie and Si Li and Xiangyu Zhang and Daxin Jiang},
year={2025},
eprint={2512.02423},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.02423},
}
```
<!--End Original Model Card-->
---
# <span id="testllm" style="color: #7F7FFF;">🚀 If you find these models useful</span>
Help me test my **AI-Powered Quantum Network Monitor Assistant** with **quantum-ready security checks**:
👉 [Quantum Network Monitor](https://readyforquantum.com/?assistant=open&utm_source=huggingface&utm_medium=referral&utm_campaign=huggingface_repo_readme)
The full Open Source Code for the Quantum Network Monitor Service available at my github repos ( repos with NetworkMonitor in the name) : [Source Code Quantum Network Monitor](https://github.com/Mungert69). You will also find the code I use to quantize the models if you want to do it yourself [GGUFModelBuilder](https://github.com/Mungert69/GGUFModelBuilder)
💬 **How to test**:
Choose an **AI assistant type**:
- `TurboLLM` (GPT-4.1-mini)
- `HugLLM` (Hugginface Open-source models)
- `TestLLM` (Experimental CPU-only)
### **What Im Testing**
Im pushing the limits of **small open-source models for AI network monitoring**, specifically:
- **Function calling** against live network services
- **How small can a model go** while still handling:
- Automated **Nmap security scans**
- **Quantum-readiness checks**
- **Network Monitoring tasks**
🟡 **TestLLM** Current experimental model (llama.cpp on 2 CPU threads on huggingface docker space):
-**Zero-configuration setup**
- ⏳ 30s load time (slow inference but **no API costs**) . No token limited as the cost is low.
- 🔧 **Help wanted!** If youre into **edge-device AI**, lets collaborate!
### **Other Assistants**
🟢 **TurboLLM** Uses **gpt-4.1-mini** :
- **It performs very well but unfortunatly OpenAI charges per token. For this reason tokens usage is limited.
- **Create custom cmd processors to run .net code on Quantum Network Monitor Agents**
- **Real-time network diagnostics and monitoring**
- **Security Audits**
- **Penetration testing** (Nmap/Metasploit)
🔵 **HugLLM** Latest Open-source models:
- 🌐 Runs on Hugging Face Inference API. Performs pretty well using the lastest models hosted on Novita.
### 💡 **Example commands you could test**:
1. `"Give me info on my websites SSL certificate"`
2. `"Check if my server is using quantum safe encyption for communication"`
3. `"Run a comprehensive security audit on my server"`
4. '"Create a cmd processor to .. (what ever you want)" Note you need to install a [Quantum Network Monitor Agent](https://readyforquantum.com/Download/?utm_source=huggingface&utm_medium=referral&utm_campaign=huggingface_repo_readme) to run the .net code on. This is a very flexible and powerful feature. Use with caution!
### Final Word
I fund the servers used to create these model files, run the Quantum Network Monitor service, and pay for inference from Novita and OpenAI—all out of my own pocket. All the code behind the model creation and the Quantum Network Monitor project is [open source](https://github.com/Mungert69). Feel free to use whatever you find helpful.
If you appreciate the work, please consider [buying me a coffee](https://www.buymeacoffee.com/mahadeva) ☕. Your support helps cover service costs and allows me to raise token limits for everyone.
I'm also open to job opportunities or sponsorship.
Thank you! 😊