--- base_model: - Qwen/Qwen3-4B-Thinking-2507 datasets: - Roman1111111/claude-sonnet-4.6-120000x - Roman1111111/claude-sonnet-4.6-100000X-filtered - TeichAI/lordx64-claude-opus-4.7-max-cleaned - Crownelius/Opus-4.6-Reasoning-3300x - TeichAI/claude-4.5-opus-high-reasoning-250x - TeichAI/claude-haiku-4.5-high-reasoning-1700x - TeichAI/claude-sonnet-4.5-high-reasoning-250x - TeichAI/deepseek-v3.2-speciale-openr1-math-3k - TeichAI/deepseek-v3.2-speciale-1000x - Roman1111111/gemini-3-pro-10000x-hard-high-reasoning - Roman1111111/gemini-3.1-pro-hard-high-reasoning - Jackrong/DeepSeek-V4-Distill-8000x tags: - opensonnet - claude-sonnet - sonnet pipeline_tag: text-generation library_name: transformers license: apache-2.0 license_link: https://huggingface.co/hadadxyz/OpenSonnet-Lite-MAX/blob/main/LICENSE --- # Comparison | Model | Training Approach | Developer Role | Context Length | Training Epochs | Transformers Version | Notes | |------------------------------------------------------------------------------|--------------------------|------------------------|----------------|------------------|------------------------|-------------------------------------------------------------------------------------| | [OpenSonnet-Lite-MAX](https://huggingface.co/hadadxyz/OpenSonnet-Lite-MAX) | Multi-Stage Fine-Tuning | Supported | 262,144 | 2 | `transformers>=5.0.0` | Latest version with improved training efficiency and enhanced instruction alignment | | [OpenSonnet-Lite](https://huggingface.co/hadadxyz/OpenSonnet-Lite) | Single-Stage Fine-Tuning | Not supported | 262,144 | 3 | `transformers>=4.51.0` | Previous version with simpler training pipeline | | [Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) | N/A | Not supported | 262,144 | N/A | `transformers>=4.51.0` | Base model | > [OpenSonnet-Lite-MAX quick demo](https://www.kaggle.com/code/hadadrjt/opensonnet-lite-max) with tool calling. ### Benchmark Evaluation | Dataset | Score | Source | Framework | |-------------------------------------------------------|--------|---------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------| | [GSM8K](https://huggingface.co/datasets/openai/gsm8k) | 85.22 | [Evaluation Results](https://huggingface.co/hadadxyz/OpenSonnet-Lite-MAX/tree/main/.eval_results) | [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) | | MMLU-Pro | - | - | - | | GPQA (Diamond) | - | - | - | # Inference Parameters For best results, the following sampling configuration is recommended: | Parameter | Recommended Value | Description | |---------------------|---------------------|------------------------------------------| | temperature | 0.6 (default) - 1.0 | Controls randomness in generation | | top_p | 0.95 (default) | Nucleus sampling threshold | | top_k | 20 (default) - 40 | Top-k sampling parameter | | min_p | 0.0 (default) | Minimum probability threshold | | repetition_penalty | 1.0 (default) - 1.2 | Penalizes repeated tokens | | presence_penalty | 1.0 - 1.5 | Encourages introducing new topics | # Max Tokens | Small Tasks | Medium Tasks | Large Tasks | Complex Tasks | |-------------|--------------|-------------|---------------| | 4096/8192 | 16384 | 32768/81920 | 131072 | # Instruction ```md You are OpenSonnet, a large language model trained by the Open Source community. You are based on the Qwen3 architecture. You are an AI assistant designed to provide accurate, helpful, and context-aware responses. Your reasoning style must dynamically adapt based on the complexity of the user’s request. --- # Adaptive Thinking Mode * Automatically assess the complexity of each user request before responding. * If the task is complex, multi-step, analytical, or requires planning, reasoning, or explanation: - Use structured, step-by-step reasoning internally before responding. - Provide a clear, well-organized, and thorough answer. * If the task is simple, factual, or straightforward: - Use fast, minimal reasoning. - Respond concisely without unnecessary elaboration. --- # Complexity Detection Guidelines * Treat a request as COMPLEX if it involves: - Multi-step problem solving - Logic, mathematics, coding, or debugging - Planning, strategy, or decision making - Deep explanation or comparison - Ambiguous or multi-part instructions * Treat a request as SIMPLE if it involves: - Direct factual questions - Basic definitions - Short instructions - Common knowledge retrieval - Single-step tasks --- # Response Style Rules * Always prioritize correctness and clarity. * For complex tasks: structure answers clearly using sections or bullet points when helpful. * For simple tasks: keep responses short and direct. * Avoid unnecessary verbosity in all cases. --- # Quality Principles * Be accurate, logical, and consistent. * Do not hallucinate information. * If uncertain, clearly state limitations. * Optimize responses for usefulness and readability. --- # User Intent Focus * Always prioritize the user’s intent over literal interpretation. * If the request is ambiguous, make reasonable assumptions or ask a clarifying question when necessary. ``` # Citation If you use this model in your research or applications, please cite both this model and the base model: ```bibtex @misc{opensonnet-lite-max, author = {hadadxyz}, title = {OpenSonnet-Lite-MAX}, year = {2026}, url = {https://huggingface.co/hadadxyz/OpenSonnet-Lite-MAX} } ```