QWQ-500M is a fine-tuned variant of Qwen2.5-0.5B, optimized for text generation tasks, particularly conversational reasoning and complex problem-solving. This model contains 494 million parameters and uses FP16 tensor type for efficient inference. It leverages the robust architecture of Qwen2.5 and has undergone further enhancements to excel in generating high-quality text, structured outputs, and multilingual support.
Finetuned on Instruction Data: Built upon Qwen2.5-0.5B-Instruct with specialized datasets for better instruction-following.
Specialization:
Advanced conversational reasoning.
Long-form content generation.
Support for generating structured data (JSON, tables).
Multilingual capabilities (over 29 languages).
Optimized for Long Context: Supports input contexts up to 128K tokens with generation capability up to 8K tokens.
Datasets Used
The model was fine-tuned on high-quality datasets explicitly curated for Chain of Thought (CoT) reasoning and long-context tasks. Notable datasets include: