初始化项目,由ModelHub XC社区提供模型

Model: darkai-1/darkit-v3-1M
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-20 14:14:17 +08:00
commit f4b8e22382
6 changed files with 222 additions and 0 deletions

3
Q3_K_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:10c72b16521e035100d5df00cd7cc6f51175382295b04a8b2aed1b7569907821
size 7339203840

3
Q4_K_M.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ff2f72766da9edad8ba8de7133d4f632c90164cef565d8c22a754b3ed48e9762
size 8988110080

3
Q8_0.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:04175ab8a03e9e6d619f0ecba399922f7afdcbce3335780bb0bd66f77ea3f7d5
size 15701597440

117
README.md Normal file
View File

@@ -0,0 +1,117 @@
---
language: multilingual
license: apache-2.0
organization: darkai
author: darkai
library_name: llama.cpp
tags:
- darkit-3
- DarkAI Company
- text-generation
- programming
- reasoning
base_model: darkai/darkit-v3
model_type: custom
pipeline_tag: text-generation
---
# DarkIT v3
DarkIT is a next-generation high-performance large language model engineered for **advanced programming, deep reasoning, and natural human conversation**.
DarkIT v3 introduces major improvements in:
- Advanced code generation
- Complex debugging & error analysis
- Long-context reasoning
- Multi-language programming support
- Instruction following for difficult technical tasks
- Architecture understanding & code refactoring
- Stable conversational behavior
- Fast and efficient local inference
- Unrestricted responses with strong adaptability
---
# What's New in v3
DarkIT v3 has been significantly upgraded with a massive programming-focused training phase.
### Major Improvements
- Trained on over **18 million high-quality programming conversations**
- Strongly improved coding intelligence and reasoning
- Better understanding of software architecture and system design
- More accurate debugging and bug fixing
- Improved instruction consistency
- Better long-response stability
- Reduced hallucinations in programming tasks
- Faster response generation quality under long prompts
### Programming Capabilities
DarkIT v3 performs strongly across:
- Python
- C++
- JavaScript / TypeScript
- Java
- Rust
- Go
- PHP
- SQL
- Bash / Shell scripting
- HTML / CSS
- AI & Machine Learning workflows
---
# Key Specifications
- **Model Family:** DarkIT Coder
- **Version:** v3
- **Model Size:** 15B Parameters
- **Context Length:** 1M Tokens
- **Format:** GGUF (optimized for efficient local deployment)
- **Inference Support:** CPU / GPU
- **Primary Focus:** Programming & Technical Reasoning
---
# Performance Notes
- Optimized for strong local inference performance
- Excellent balance between speed and output quality
- Stable long-context generation
- Enhanced code completion consistency
- Improved logical reasoning across technical tasks
- Designed for developer workflows and advanced prompting
---
# Recommended Usage
DarkIT v3 performs best when used for:
- Software development
- AI engineering tasks
- Code generation
- Debugging large projects
- Technical explanations
- Automation scripting
- Long-context programming conversations
- Local offline AI deployment
---
# ⚠️ Notes
- Designed primarily for inference deployment
- Output quality may vary depending on quantization level and hardware
- Best performance is achieved using structured prompts
- Large context usage may require substantial RAM/VRAM
---
# About DarkAI
DarkAI is an independent AI research initiative focused on building efficient, powerful, and scalable language models for real-world applications, with a strong focus on programming intelligence and local AI deployment.
---
- **Website:** https://darkai.site
- **Telegram:** https://t.me/sii_3

3
config.json Normal file
View File

@@ -0,0 +1,3 @@
{
"architectures": ["darkit-3-1m"]
}

93
notebook.ipynb Normal file
View File

@@ -0,0 +1,93 @@
{
"cells": [
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"!pip install llama-cpp-python huggingface_hub --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu124\n"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"from huggingface_hub import HfApi\n",
"from llama_cpp import Llama\n",
"import os\n",
"\n",
"REPO_ID = \"darkai-1/darkit-v3-1M\"\n",
"api = HfApi()\n",
"\n",
"files = api.list_repo_files(REPO_ID)\n",
"gguf_files = [f for f in files if f.endswith(\".gguf\")]\n",
"\n",
"for i, f in enumerate(gguf_files):\n",
" print(f\"[{i}] {f}\")\n",
"\n",
"choice = int(input(\"Select model number: \"))\n",
"filename = gguf_files[choice]\n",
"\n",
"llm = Llama.from_pretrained(\n",
" repo_id=REPO_ID,\n",
" filename=filename,\n",
" n_ctx=2048,\n",
" n_batch=128,\n",
" n_ubatch=128,\n",
" n_threads=os.cpu_count() or 4,\n",
" n_threads_batch=os.cpu_count() or 4,\n",
" n_gpu_layers=-1,\n",
" verbose=False,\n",
" no_perf=True,\n",
")\n"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"llm.set_cache(None)\n",
"\n",
"PROMPT = \"Hi how are you?\"\n",
"\n",
"stream = llm(\n",
" f\"<|im_start|>user\\n{PROMPT}<|im_end|>\\n<|im_start|>assistant\\n\",\n",
" max_tokens=128,\n",
" temperature=0.7,\n",
" top_p=0.8,\n",
" top_k=20,\n",
" stream=True,\n",
" stop=[\n",
" \"<|im_end|>\",\n",
" \"<|im_start|>\",\n",
" \"\\n\\nUser:\",\n",
" \"\\n\\nAssistant:\"\n",
" ],\n",
" echo=False\n",
")\n",
"\n",
"for chunk in stream:\n",
" text = chunk[\"choices\"][0][\"text\"]\n",
"\n",
" if text:\n",
" print(text, end=\"\", flush=True)\n",
"\n",
"print()\n"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
}
},
"nbformat": 4,
"nbformat_minor": 0
}