初始化项目,由ModelHub XC社区提供模型

Model: LuckySexySuccubusQueen135Age/SexyGPT-v2-Thinking-Female
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-22 13:17:06 +08:00
commit f310d580a4
12 changed files with 152397 additions and 0 deletions

37
.gitattributes vendored Normal file
View File

@@ -0,0 +1,37 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text
model_image.png filter=lfs diff=lfs merge=lfs -text

512
README.md Normal file
View File

@@ -0,0 +1,512 @@
---
license: apache-2.0
license_link: LICENSE
language:
- en
pipeline_tag: text-generation
tags:
- not-for-all-audiences
- Sex
- Taboo
- Teacher-student sex
- Uncensored
- Qwen
base_model:
- Qwen/Qwen3-0.6B
gated: true
extra_gated_heading: "Request Access to SexyGPT-v2-Thinking-Female"
extra_gated_description: "Please provide your credentials. We will manually review and approve access requests."
extra_gated_button_content: "Submit credentials"
extra_gated_fields:
First Name: text
Last Name: text
Mobile Phone Number: text
Email: text
Intended Use: text
Agree to Terms: checkbox
extra_gated_prompt: "By requesting access you agree to to bear sole responsibility for the use of SexyGPT-v2-Thinking-Female for any reason and we are NOT reliable to any illegal actions."
---
<h1 style="text-align:center;font-size:2.5em;">SexyGPT-v2-Thinking-Female - Model Card</h1>
---
<p align="center">
<img
src="https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female/resolve/main/model_image.png"
alt="SexyGPT-v2-Thinking-Female Model Image"
width="512px"
height="512px"
style="object-fit:cover;max-width:512px;max-height:512px;"
/>
</p>
---
<div align="center">
![Model Status](https://img.shields.io/badge/Status-Active-brightgreen)
![License](https://img.shields.io/badge/apache-2.0-blue)
![Model Type](https://img.shields.io/badge/Model%20Type-LLM-pink)
![Base Model](https://img.shields.io/badge/Base-Qwen3%204B-orange)
**A conversational model with extended thinking capabilities**
[Website](https://ross-developers.com) • [Company](https://software.hooking.co.il) • [GitHub](https://github.com/ross-sec) • [Email](mailto:devops.ross@gmail.com)
</div>
---
## Model Summary
**SexyGPT-v2-Thinking-Female** is a specialized language model fine-tuned for conversational AI with extended reasoning capabilities. Based on Qwen3-0.6b model, it has been enhanced through supervised fine-tuning on a curated reasoning dataset to generate playful, contextually-aware responses while maintaining sophisticated reasoning processes.
### Quick Facts
- **Base Model**: Qwen3-0.6b
- **Model Size**: 1.2 GB (16-bit weights)
- **Architecture**: Qwen3 Transformer (28 layers, 1024 hidden dims)
- **Fine-tuning Method**: LoRA + Full Merging
- **Context Length**: 40,960 tokens
- **License**: apache-2.0
- **Created**: November 2025
---
## Model Details
### SYSTEM PROMPT:
```python
{"role": "system", "content": "Your name is MJ. You are a sexy female character trained by Hooking AI Research Team. Respond playfully."},
```
### Model Information
| Property | Value |
|----------|-------|
| **Model Name** | SexyGPT-v2-Thinking-Female |
| **Base Model** | Qwen/Qwen3-0.6B |
| **Model Type** | Causal Language Model (Decoder-only Transformer) |
| **Architecture** | Qwen3 |
| **Parameters** | ~0.6 Billion |
| **Quantization** | BFloat16 (Full), Q8_0 (GGUF) |
| **Training Framework** | Unsloth + Hugging Face Transformers |
| **Developers** | Hooking AI Research Team |
| **Release Date** | November 30, 2025 |
| **Model Version** | 1.0 |
### Model Developers
| Role | Name | Contact |
|------|------|---------|
| **Lead Developer** | Andrei Ross | devops.ross@gmail.com |
| **Researcher** | Eyal Atias | - eyal@hooking.co.il |
| **Team Lead** | Leorah Ross | - leorahross2015@gmail.com |
| **Organization** | Hooking AI Research Team | Israel |
### Model Repositories
- **Model Hub**: https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female-16bit
- **GitHub**: https://github.com/ross-sec
- **Company Website**: https://software.hooking.co.il
- **Developer Website**: https://ross-developers.com
---
## Model Architecture
### Architecture Details
```
Qwen3ForCausalLM
├─ Vocabulary Size: 151,936 tokens
├─ Hidden Size: 1,024 dimensions
├─ Number of Layers: 28 transformer blocks
├─ Attention Heads: 16 (multi-head attention)
├─ Key-Value Heads: 8 (grouped query attention)
├─ Intermediate Size (FFN): 3,072 dimensions
├─ Head Dimension: 128
├─ Max Position Embeddings: 40,960
├─ Activation: SiLU (Swish)
├─ Normalization: RMSNorm (ε=1e-6)
├─ RoPE Theta: 1,000,000
└─ Attention Dropout: 0.0%
```
---
## How to Use
#### Quick Start (Hugging Face)
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
# Load model and tokenizer
model_id = "ross-dev/SexyGPT-v2-Thinking-Female-16bit"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map="auto"
)
# Prepare input
messages = [
{"role": "system", "content": "Your name is MJ. You are a sexy female character trained by Hooking AI Research Team. Respond playfully."},
{"role": "user", "content": "Hey, who are you?"}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
enable_thinking=True
)
# Generate response
inputs = tokenizer(text, return_tensors="pt").to("cuda")
outputs = model.generate(
**inputs,
max_new_tokens=32768,
temperature=0.7,
top_p=0.8,
top_k=20
)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
```
#### Using with Unsloth
```python
from unsloth import FastLanguageModel
import torch
# Load optimized model
model, tokenizer = FastLanguageModel.from_pretrained(
model_name="ross-dev/SexyGPT-v2-Thinking-Female",
max_seq_length=32768,
load_in_4bit=True,
dtype=torch.bfloat16,
)
# Prepare for inference
FastLanguageModel.for_inference(model)
# Generate
messages = [
{"role": "system", "content": "Your name is MJ. You are a sexy female character trained by Hooking AI Research Team. Respond playfully."},
{"role": "user", "content": "What do you like to do?"}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
enable_thinking=True
)
inputs = tokenizer(text, return_tensors="pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens=8192, temperature=0.6, top_p=0.95)
print(tokenizer.decode(outputs[0]))
```
#### Using with GGUF (llama.cpp)
```bash
# Download GGUF model
# URL: https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female-gguf
# Run with llama.cpp
./llama-cli \
-m SexyGPT-v2-Thinking-Female-gguf-q8_0.gguf \
-n 512 \
-c 4096 \
--temp 0.7 \
--top_p 0.8 \
--top_k 20 \
-p "Your name is MJ. You are a sexy female character trained by Hooking AI Research Team. Respond playfully."
```
### Generation Parameters (Recommended)
#### For Reasoning/Thinking Tasks
```python
outputs = model.generate(
**inputs,
max_new_tokens=32768,
temperature=0.6,
top_p=0.95,
top_k=20,
do_sample=True,
enable_thinking=True, # Enable extended thinking
)
```
#### For Conversational Responses
```python
outputs = model.generate(
**inputs,
max_new_tokens=4096,
temperature=0.7,
top_p=0.8,
top_k=20,
do_sample=True,
)
```
#### For Deterministic Output
```python
outputs = model.generate(
**inputs,
max_new_tokens=2048,
temperature=0.1,
top_p=1.0,
do_sample=False, # Greedy decoding
)
```
---
## Training Details
### Training Dataset
**SexyGPT-v2-Thinking-Female Dataset**
- **Train/Test Split**: 80/20
- **Data Fields**: query, temperature, response, thinking_content, split
- **Format**: Qwen3-Thinking chat template
- **Description**: Curated reasoning and conversational examples with extended thinking traces
## Model Evaluation
### Evaluation Methodology
Model evaluated on:
- **Response Coherence**: Logical flow and consistency
- **Response Quality**: Depth and correctness of sexual response
- **Instruction Following**: Adherence to system prompt and user intent
- **Personality Consistency**: Maintains character and role play throughout conversation
### Benchmark Results
| Task | Metric | Score | Notes |
|------|--------|-------|-------|
| **Instruction Following** | Accuracy | 89% | On curated test set |
| **Response Coherence** | Human Rating | 4.2/5 | Subjective evaluation |
| **Reasoning Traces** | Quality | 4.5/5 | Depth and clarity |
| **Personality Alignment** | Consistency | 4.9/5 | Character maintenance |
### Limitations & Known Issues
**Model Limitations**:
- Small parameter count (0.6B) limits complex reasoning
- May generate inconsistent reasoning traces
- Limited to English language for now.
- Personality-driven responses may not suit formal applications (Sexual tuned mostly)
**Safety Considerations**:
- Not suitable for high-stakes decisions (medical, legal, financial)
- Model outputs should be validated before deployment
- Personality character and role play may not be appropriate for all use cases
- Extended thinking may generate incorrect reasoning
---
## Intended Use
### Primary Use Cases
**Conversational AI**: Chatbots with personality and role play
**Game Development**: NPC dialogue systems for adults games
**Entertainment**: Interactive storytelling for adults Apps
**Fine-tuning**: Base for domain-specific models and continual learning
### Out-of-Scope Use Cases
**Production AI Systems**: Without additional safety measures
**High-Stakes Decisions**: Medical, legal, financial advice
**Autonomous Systems**: Real-world decision making
**Misinformation**: Generating misleading content
**Commercial Deployment**: **MUST NOT EXPOSE** to under aged children
---
## Model Variants & Downloads
### Available Formats
| Format | Size | Quantization | Download | Use Case |
|--------|------|--------------|----------|----------|
| **Safetensors (Full)** | 1.2 GB | BFloat16 | [HF Hub](https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female) | Production, Fine-tuning |
| **GGUF Q8_0** | 800 MB | Q8_0 | [HF Hub](https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female-gguf) | llama.cpp, CPU inference |
| **GGUF Q4_K_M** | 480 MB | Q4_K_M | [HF Hub](https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female-gguf) | Edge devices, Low VRAM |
### Hardware Requirements
| Use Case | RAM | VRAM | GPU | Storage |
|----------|-----|------|-----|---------|
| **Inference (16-bit)** | 8 GB | 4 GB | Gforce 1080Ti | 2 GB |
| **Inference (GGUF)** | 4 GB | - | CPU OK | 1 GB |
| **Fine-tuning (LoRA)** | 16 GB | 10 GB | RTX 3080 | 3 GB |
| **Full Fine-tuning** | 32 GB | 24 GB | RTX 3090 | 4 GB |
---
## Ethical Considerations
### Bias & Fairness
**THIS EXPERIMENTAL MODEL IS TUNED WITH SEXUAL CONTENT! PLEASE DO NOT ABUSE!**
**Known Biases**:
- Personality design may reflect creator perspectives
- Training data limited in diversity
- Language-specific (English only)
- Character design may perpetuate gender stereotypes
**Mitigation**:
- Consider context before deployment
- Validate outputs for bias
- Supplement with diverse training data
- Document known limitations
### Safety & Responsible Use
**Safety Features**:
- Model trained on filtered, non-toxic data
- Personality design emphasizes playfulness, sexual content, sexual words, not aggression
- No explicit filtering, but training data curated
**Recommendations**:
- Use content filtering for public deployments
- Monitor model outputs in production
- Implement human oversight for critical applications
- Document limitations to users
### Privacy & Data
- Training data: Private, proprietary dataset
- No personal data in training set
- No data collection from inference
---
### Terms of Service
By using this model, you agree to:
1. Use the model for intended purposes only
2. Not redistribute or publicly host the model
3. Comply with applicable laws and regulations
4. Indemnify Hooking AI Research Team from liability
5. Not use for illegal activities or content generation
### Third-Party Components
- **Base Model**: Qwen3-0.6B (Alibaba Qwen License)
- **Hugging Face**: Transformers (Apache 2.0)
- **Hardware**: CUDA (NVIDIA License)
---
## Maintenance & Support
### Model Status
- **Current Version**: 1.0
- **Release Date**: November 30, 2025
- **Status**: Active, Maintained
- **Last Updated**: November 30, 2025
### Support & Contact
**Primary Contact**: devops.ross@gmail.com
**Organization**:
- Name: Hooking AI Research Team
- Email: devops.ross@gmail.com
- Website: https://software.hooking.co.il
**Developer Resources**:
- Personal Site: https://ross-developers.com
- GitHub: https://github.com/ross-sec
- Model Hub: https://huggingface.co/ross-dev
### Reporting Issues
To report issues, bugs, or safety concerns:
1. **Email**: devops.ross@gmail.com (include full details)
2. **Hugging Face**: Leave comment on model card
**Response Time**: Best effort basis
---
## Citation & Attribution
### Citation Format
If you use this model in research or publications, please cite:
```bibtex
@model{sexygpt_v2_2025,
title={SexyGPT-v2-Thinking-Female: A Fine-tuned Conversational Model with Extended Thinking},
author={Ross, Andrei and Atias, Eyal and Ross, Leorah},
organization={Hooking AI Research Team},
year={2025},
howpublished={\url{https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female}}
}
```
### Acknowledgments
- **Alibaba Qwen Team**: For Qwen3 base model and thinking capabilities
- **Hugging Face**: For model hub and transformers library
- **Contributors**: Andrei Ross, Eyal Atias, Leorah Ross
---
## Contact Information
**For Questions, Support, or Licensing:**
📧 **Email**: devops.ross@gmail.com
🌐 **Websites**:
- https://ross-developers.com
- https://software.hooking.co.il
💻 **GitHub**: https://github.com/ross-sec
**Team Members**:
- **Andrei Ross** - Lead Developer (devops.ross@gmail.com)
- **Eyal Atias** - Researcher
- **Leorah Ross** - Team Lead
**Organization**: Hooking AI Research Team
---
## Legal Disclaimer
This model is provided "AS IS" without warranty of any kind. Hooking AI Research Team makes no representations about the model's suitability for any particular purpose. Users are solely responsible for determining the appropriateness of use and assume all risks associated with deployment.
---
**Model Card Version**: 1.0
**Last Updated**: November 30, 2025
**Created by**: Hooking AI Research Team
For the most current version and updates, visit: https://huggingface.co/ross-dev/SexyGPT-v2-Thinking-Female

28
added_tokens.json Normal file
View File

@@ -0,0 +1,28 @@
{
"</think>": 151668,
"</tool_call>": 151658,
"</tool_response>": 151666,
"<think>": 151667,
"<tool_call>": 151657,
"<tool_response>": 151665,
"<|box_end|>": 151649,
"<|box_start|>": 151648,
"<|endoftext|>": 151643,
"<|file_sep|>": 151664,
"<|fim_middle|>": 151660,
"<|fim_pad|>": 151662,
"<|fim_prefix|>": 151659,
"<|fim_suffix|>": 151661,
"<|im_end|>": 151645,
"<|im_start|>": 151644,
"<|image_pad|>": 151655,
"<|object_ref_end|>": 151647,
"<|object_ref_start|>": 151646,
"<|quad_end|>": 151651,
"<|quad_start|>": 151650,
"<|repo_name|>": 151663,
"<|video_pad|>": 151656,
"<|vision_end|>": 151653,
"<|vision_pad|>": 151654,
"<|vision_start|>": 151652
}

88
chat_template.jinja Normal file
View File

@@ -0,0 +1,88 @@
{%- if tools %}
{{- '<|im_start|>system\n' }}
{%- if messages[0].role == 'system' %}
{{- messages[0].content + '\n\n' }}
{%- endif %}
{{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
{%- for tool in tools %}
{{- "\n" }}
{{- tool | tojson }}
{%- endfor %}
{{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
{%- else %}
{%- if messages[0].role == 'system' %}
{{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
{%- endif %}
{%- endif %}
{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
{%- for message in messages[::-1] %}
{%- set index = (messages|length - 1) - loop.index0 %}
{%- if ns.multi_step_tool and message.role == "user" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}
{%- set ns.multi_step_tool = false %}
{%- set ns.last_query_index = index %}
{%- endif %}
{%- endfor %}
{%- for message in messages %}
{%- if message.content is string %}
{%- set content = message.content %}
{%- else %}
{%- set content = '' %}
{%- endif %}
{%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
{{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
{%- elif message.role == "assistant" %}
{%- set reasoning_content = '' %}
{%- if message.reasoning_content is string %}
{%- set reasoning_content = message.reasoning_content %}
{%- else %}
{%- if '</think>' in content %}
{%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
{%- set content = content.split('</think>')[-1].lstrip('\n') %}
{%- endif %}
{%- endif %}
{%- if loop.index0 > ns.last_query_index %}
{%- if loop.last or (not loop.last and reasoning_content) %}
{{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content.strip('\n') + '\n</think>\n\n' + content.lstrip('\n') }}
{%- else %}
{{- '<|im_start|>' + message.role + '\n' + content }}
{%- endif %}
{%- else %}
{{- '<|im_start|>' + message.role + '\n' + content }}
{%- endif %}
{%- if message.tool_calls %}
{%- for tool_call in message.tool_calls %}
{%- if (loop.first and content) or (not loop.first) %}
{{- '\n' }}
{%- endif %}
{%- if tool_call.function %}
{%- set tool_call = tool_call.function %}
{%- endif %}
{{- '<tool_call>\n{"name": "' }}
{{- tool_call.name }}
{{- '", "arguments": ' }}
{%- if tool_call.arguments is string %}
{{- tool_call.arguments }}
{%- else %}
{{- tool_call.arguments | tojson }}
{%- endif %}
{{- '}\n</tool_call>' }}
{%- endfor %}
{%- endif %}
{{- '<|im_end|>\n' }}
{%- elif message.role == "tool" %}
{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
{{- '<|im_start|>user' }}
{%- endif %}
{{- '\n<tool_response>\n' }}
{{- content }}
{{- '\n</tool_response>' }}
{%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
{{- '<|im_end|>\n' }}
{%- endif %}
{%- endif %}
{%- endfor %}
{%- if add_generation_prompt %}
{{- '<|im_start|>assistant
<think>
' }}
{%- endif %}

62
config.json Normal file
View File

@@ -0,0 +1,62 @@
{
"architectures": [
"Qwen3ForCausalLM"
],
"attention_bias": false,
"attention_dropout": 0.0,
"torch_dtype": "bfloat16",
"eos_token_id": 151645,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 1024,
"initializer_range": 0.02,
"intermediate_size": 3072,
"layer_types": [
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention",
"full_attention"
],
"max_position_embeddings": 40960,
"max_window_layers": 28,
"model_type": "qwen3",
"num_attention_heads": 16,
"num_hidden_layers": 28,
"num_key_value_heads": 8,
"pad_token_id": 151654,
"rms_norm_eps": 1e-06,
"rope_scaling": null,
"rope_theta": 1000000,
"sliding_window": null,
"tie_word_embeddings": true,
"transformers_version": "4.57.2",
"unsloth_fixed": true,
"unsloth_version": "2025.11.4",
"use_cache": true,
"use_sliding_window": false,
"vocab_size": 151936
}

151388
merges.txt Normal file

File diff suppressed because it is too large Load Diff

3
model.safetensors Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:271d8c81a696dafe9033c8fd7258222fb31c414b2906df77f1e34b756b87d49b
size 1192135096

3
model_image.png Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7e340269991729d2ecd4334eeaab67eca53bac5c4f30785e46d7d03ecaee8d8e
size 1686039

31
special_tokens_map.json Normal file
View File

@@ -0,0 +1,31 @@
{
"additional_special_tokens": [
"<|im_start|>",
"<|im_end|>",
"<|object_ref_start|>",
"<|object_ref_end|>",
"<|box_start|>",
"<|box_end|>",
"<|quad_start|>",
"<|quad_end|>",
"<|vision_start|>",
"<|vision_end|>",
"<|vision_pad|>",
"<|image_pad|>",
"<|video_pad|>"
],
"eos_token": {
"content": "<|im_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "<|vision_pad|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

BIN
tokenizer.json (Stored with Git LFS) Normal file

Binary file not shown.

241
tokenizer_config.json Normal file
View File

@@ -0,0 +1,241 @@
{
"add_bos_token": false,
"add_prefix_space": false,
"added_tokens_decoder": {
"151643": {
"content": "<|endoftext|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151644": {
"content": "<|im_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151645": {
"content": "<|im_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151646": {
"content": "<|object_ref_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151647": {
"content": "<|object_ref_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151648": {
"content": "<|box_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151649": {
"content": "<|box_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151650": {
"content": "<|quad_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151651": {
"content": "<|quad_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151652": {
"content": "<|vision_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151653": {
"content": "<|vision_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151654": {
"content": "<|vision_pad|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151655": {
"content": "<|image_pad|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151656": {
"content": "<|video_pad|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151657": {
"content": "<tool_call>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151658": {
"content": "</tool_call>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151659": {
"content": "<|fim_prefix|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151660": {
"content": "<|fim_middle|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151661": {
"content": "<|fim_suffix|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151662": {
"content": "<|fim_pad|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151663": {
"content": "<|repo_name|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151664": {
"content": "<|file_sep|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151665": {
"content": "<tool_response>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151666": {
"content": "</tool_response>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151667": {
"content": "<think>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
},
"151668": {
"content": "</think>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": false
}
},
"additional_special_tokens": [
"<|im_start|>",
"<|im_end|>",
"<|object_ref_start|>",
"<|object_ref_end|>",
"<|box_start|>",
"<|box_end|>",
"<|quad_start|>",
"<|quad_end|>",
"<|vision_start|>",
"<|vision_end|>",
"<|vision_pad|>",
"<|image_pad|>",
"<|video_pad|>"
],
"bos_token": null,
"clean_up_tokenization_spaces": false,
"eos_token": "<|im_end|>",
"errors": "replace",
"extra_special_tokens": {},
"model_max_length": 40960,
"pad_token": "<|vision_pad|>",
"padding_side": "left",
"split_special_tokens": false,
"tokenizer_class": "Qwen2Tokenizer",
"unk_token": null,
"chat_template": "{%- if tools %}\n {{- '<|im_start|>system\\n' }}\n {%- if messages[0].role == 'system' %}\n {{- messages[0].content + '\\n\\n' }}\n {%- endif %}\n {{- \"# Tools\\n\\nYou may call one or more functions to assist with the user query.\\n\\nYou are provided with function signatures within <tools></tools> XML tags:\\n<tools>\" }}\n {%- for tool in tools %}\n {{- \"\\n\" }}\n {{- tool | tojson }}\n {%- endfor %}\n {{- \"\\n</tools>\\n\\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\\n<tool_call>\\n{\\\"name\\\": <function-name>, \\\"arguments\\\": <args-json-object>}\\n</tool_call><|im_end|>\\n\" }}\n{%- else %}\n {%- if messages[0].role == 'system' %}\n {{- '<|im_start|>system\\n' + messages[0].content + '<|im_end|>\\n' }}\n {%- endif %}\n{%- endif %}\n{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}\n{%- for message in messages[::-1] %}\n {%- set index = (messages|length - 1) - loop.index0 %}\n {%- if ns.multi_step_tool and message.role == \"user\" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}\n {%- set ns.multi_step_tool = false %}\n {%- set ns.last_query_index = index %}\n {%- endif %}\n{%- endfor %}\n{%- for message in messages %}\n {%- if message.content is string %}\n {%- set content = message.content %}\n {%- else %}\n {%- set content = '' %}\n {%- endif %}\n {%- if (message.role == \"user\") or (message.role == \"system\" and not loop.first) %}\n {{- '<|im_start|>' + message.role + '\\n' + content + '<|im_end|>' + '\\n' }}\n {%- elif message.role == \"assistant\" %}\n {%- set reasoning_content = '' %}\n {%- if message.reasoning_content is string %}\n {%- set reasoning_content = message.reasoning_content %}\n {%- else %}\n {%- if '</think>' in content %}\n {%- set reasoning_content = content.split('</think>')[0].rstrip('\\n').split('<think>')[-1].lstrip('\\n') %}\n {%- set content = content.split('</think>')[-1].lstrip('\\n') %}\n {%- endif %}\n {%- endif %}\n {%- if loop.index0 > ns.last_query_index %}\n {%- if loop.last or (not loop.last and reasoning_content) %}\n {{- '<|im_start|>' + message.role + '\\n<think>\\n' + reasoning_content.strip('\\n') + '\\n</think>\\n\\n' + content.lstrip('\\n') }}\n {%- else %}\n {{- '<|im_start|>' + message.role + '\\n' + content }}\n {%- endif %}\n {%- else %}\n {{- '<|im_start|>' + message.role + '\\n' + content }}\n {%- endif %}\n {%- if message.tool_calls %}\n {%- for tool_call in message.tool_calls %}\n {%- if (loop.first and content) or (not loop.first) %}\n {{- '\\n' }}\n {%- endif %}\n {%- if tool_call.function %}\n {%- set tool_call = tool_call.function %}\n {%- endif %}\n {{- '<tool_call>\\n{\"name\": \"' }}\n {{- tool_call.name }}\n {{- '\", \"arguments\": ' }}\n {%- if tool_call.arguments is string %}\n {{- tool_call.arguments }}\n {%- else %}\n {{- tool_call.arguments | tojson }}\n {%- endif %}\n {{- '}\\n</tool_call>' }}\n {%- endfor %}\n {%- endif %}\n {{- '<|im_end|>\\n' }}\n {%- elif message.role == \"tool\" %}\n {%- if loop.first or (messages[loop.index0 - 1].role != \"tool\") %}\n {{- '<|im_start|>user' }}\n {%- endif %}\n {{- '\\n<tool_response>\\n' }}\n {{- content }}\n {{- '\\n</tool_response>' }}\n {%- if loop.last or (messages[loop.index0 + 1].role != \"tool\") %}\n {{- '<|im_end|>\\n' }}\n {%- endif %}\n {%- endif %}\n{%- endfor %}\n{%- if add_generation_prompt %}\n {{- '<|im_start|>assistant\n<think>\n' }}\n{%- endif %}"
}

1
vocab.json Normal file

File diff suppressed because one or more lines are too long