初始化项目,由ModelHub XC社区提供模型
Model: darkai-1/darkit-v3-1M Source: Original Platform
This commit is contained in:
3
Q3_K_M.gguf
Normal file
3
Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:10c72b16521e035100d5df00cd7cc6f51175382295b04a8b2aed1b7569907821
|
||||
size 7339203840
|
||||
3
Q4_K_M.gguf
Normal file
3
Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ff2f72766da9edad8ba8de7133d4f632c90164cef565d8c22a754b3ed48e9762
|
||||
size 8988110080
|
||||
3
Q8_0.gguf
Normal file
3
Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:04175ab8a03e9e6d619f0ecba399922f7afdcbce3335780bb0bd66f77ea3f7d5
|
||||
size 15701597440
|
||||
117
README.md
Normal file
117
README.md
Normal file
@@ -0,0 +1,117 @@
|
||||
---
|
||||
language: multilingual
|
||||
license: apache-2.0
|
||||
organization: darkai
|
||||
author: darkai
|
||||
library_name: llama.cpp
|
||||
tags:
|
||||
- darkit-3
|
||||
- DarkAI Company
|
||||
- text-generation
|
||||
- programming
|
||||
- reasoning
|
||||
base_model: darkai/darkit-v3
|
||||
model_type: custom
|
||||
pipeline_tag: text-generation
|
||||
---
|
||||
# DarkIT v3
|
||||
|
||||
DarkIT is a next-generation high-performance large language model engineered for **advanced programming, deep reasoning, and natural human conversation**.
|
||||
|
||||
DarkIT v3 introduces major improvements in:
|
||||
- Advanced code generation
|
||||
- Complex debugging & error analysis
|
||||
- Long-context reasoning
|
||||
- Multi-language programming support
|
||||
- Instruction following for difficult technical tasks
|
||||
- Architecture understanding & code refactoring
|
||||
- Stable conversational behavior
|
||||
- Fast and efficient local inference
|
||||
- Unrestricted responses with strong adaptability
|
||||
|
||||
---
|
||||
|
||||
# What's New in v3
|
||||
|
||||
DarkIT v3 has been significantly upgraded with a massive programming-focused training phase.
|
||||
|
||||
### Major Improvements
|
||||
- Trained on over **18 million high-quality programming conversations**
|
||||
- Strongly improved coding intelligence and reasoning
|
||||
- Better understanding of software architecture and system design
|
||||
- More accurate debugging and bug fixing
|
||||
- Improved instruction consistency
|
||||
- Better long-response stability
|
||||
- Reduced hallucinations in programming tasks
|
||||
- Faster response generation quality under long prompts
|
||||
|
||||
### Programming Capabilities
|
||||
DarkIT v3 performs strongly across:
|
||||
- Python
|
||||
- C++
|
||||
- JavaScript / TypeScript
|
||||
- Java
|
||||
- Rust
|
||||
- Go
|
||||
- PHP
|
||||
- SQL
|
||||
- Bash / Shell scripting
|
||||
- HTML / CSS
|
||||
- AI & Machine Learning workflows
|
||||
|
||||
---
|
||||
|
||||
# Key Specifications
|
||||
|
||||
- **Model Family:** DarkIT Coder
|
||||
- **Version:** v3
|
||||
- **Model Size:** 15B Parameters
|
||||
- **Context Length:** 1M Tokens
|
||||
- **Format:** GGUF (optimized for efficient local deployment)
|
||||
- **Inference Support:** CPU / GPU
|
||||
- **Primary Focus:** Programming & Technical Reasoning
|
||||
|
||||
---
|
||||
|
||||
# Performance Notes
|
||||
|
||||
- Optimized for strong local inference performance
|
||||
- Excellent balance between speed and output quality
|
||||
- Stable long-context generation
|
||||
- Enhanced code completion consistency
|
||||
- Improved logical reasoning across technical tasks
|
||||
- Designed for developer workflows and advanced prompting
|
||||
|
||||
---
|
||||
|
||||
# Recommended Usage
|
||||
|
||||
DarkIT v3 performs best when used for:
|
||||
- Software development
|
||||
- AI engineering tasks
|
||||
- Code generation
|
||||
- Debugging large projects
|
||||
- Technical explanations
|
||||
- Automation scripting
|
||||
- Long-context programming conversations
|
||||
- Local offline AI deployment
|
||||
|
||||
---
|
||||
|
||||
# ⚠️ Notes
|
||||
|
||||
- Designed primarily for inference deployment
|
||||
- Output quality may vary depending on quantization level and hardware
|
||||
- Best performance is achieved using structured prompts
|
||||
- Large context usage may require substantial RAM/VRAM
|
||||
|
||||
---
|
||||
|
||||
# About DarkAI
|
||||
|
||||
DarkAI is an independent AI research initiative focused on building efficient, powerful, and scalable language models for real-world applications, with a strong focus on programming intelligence and local AI deployment.
|
||||
|
||||
---
|
||||
|
||||
- **Website:** https://darkai.site
|
||||
- **Telegram:** https://t.me/sii_3
|
||||
3
config.json
Normal file
3
config.json
Normal file
@@ -0,0 +1,3 @@
|
||||
{
|
||||
"architectures": ["darkit-3-1m"]
|
||||
}
|
||||
93
notebook.ipynb
Normal file
93
notebook.ipynb
Normal file
@@ -0,0 +1,93 @@
|
||||
{
|
||||
"cells": [
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"!pip install llama-cpp-python huggingface_hub --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu124\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"from huggingface_hub import HfApi\n",
|
||||
"from llama_cpp import Llama\n",
|
||||
"import os\n",
|
||||
"\n",
|
||||
"REPO_ID = \"darkai-1/darkit-v3-1M\"\n",
|
||||
"api = HfApi()\n",
|
||||
"\n",
|
||||
"files = api.list_repo_files(REPO_ID)\n",
|
||||
"gguf_files = [f for f in files if f.endswith(\".gguf\")]\n",
|
||||
"\n",
|
||||
"for i, f in enumerate(gguf_files):\n",
|
||||
" print(f\"[{i}] {f}\")\n",
|
||||
"\n",
|
||||
"choice = int(input(\"Select model number: \"))\n",
|
||||
"filename = gguf_files[choice]\n",
|
||||
"\n",
|
||||
"llm = Llama.from_pretrained(\n",
|
||||
" repo_id=REPO_ID,\n",
|
||||
" filename=filename,\n",
|
||||
" n_ctx=2048,\n",
|
||||
" n_batch=128,\n",
|
||||
" n_ubatch=128,\n",
|
||||
" n_threads=os.cpu_count() or 4,\n",
|
||||
" n_threads_batch=os.cpu_count() or 4,\n",
|
||||
" n_gpu_layers=-1,\n",
|
||||
" verbose=False,\n",
|
||||
" no_perf=True,\n",
|
||||
")\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "code",
|
||||
"execution_count": null,
|
||||
"metadata": {},
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"llm.set_cache(None)\n",
|
||||
"\n",
|
||||
"PROMPT = \"Hi how are you?\"\n",
|
||||
"\n",
|
||||
"stream = llm(\n",
|
||||
" f\"<|im_start|>user\\n{PROMPT}<|im_end|>\\n<|im_start|>assistant\\n\",\n",
|
||||
" max_tokens=128,\n",
|
||||
" temperature=0.7,\n",
|
||||
" top_p=0.8,\n",
|
||||
" top_k=20,\n",
|
||||
" stream=True,\n",
|
||||
" stop=[\n",
|
||||
" \"<|im_end|>\",\n",
|
||||
" \"<|im_start|>\",\n",
|
||||
" \"\\n\\nUser:\",\n",
|
||||
" \"\\n\\nAssistant:\"\n",
|
||||
" ],\n",
|
||||
" echo=False\n",
|
||||
")\n",
|
||||
"\n",
|
||||
"for chunk in stream:\n",
|
||||
" text = chunk[\"choices\"][0][\"text\"]\n",
|
||||
"\n",
|
||||
" if text:\n",
|
||||
" print(text, end=\"\", flush=True)\n",
|
||||
"\n",
|
||||
"print()\n"
|
||||
]
|
||||
}
|
||||
],
|
||||
"metadata": {
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3",
|
||||
"language": "python",
|
||||
"name": "python3"
|
||||
}
|
||||
},
|
||||
"nbformat": 4,
|
||||
"nbformat_minor": 0
|
||||
}
|
||||
Reference in New Issue
Block a user