Update README.md
This commit is contained in:
17
README.md
17
README.md
@@ -59,7 +59,7 @@ model-index:
|
|||||||
name: Training Progress (%)
|
name: Training Progress (%)
|
||||||
---
|
---
|
||||||
|
|
||||||
# Qwen3-0.6B-Gensyn-Swarm (tall_tame_panther)
|
# Qwen3-0.6B-Gensyn-Swarm the Agent-ID (tall_tame_panther)
|
||||||
|
|
||||||
[](https://huggingface.co/0xgr3y/Qwen3-0.6B-Gensyn-Swarm-tall_tame_panther)
|
[](https://huggingface.co/0xgr3y/Qwen3-0.6B-Gensyn-Swarm-tall_tame_panther)
|
||||||
[](https://huggingface.co/0xgr3y/Qwen3-0.6B-Gensyn-Swarm-tall_tame_panther/tree/main)
|
[](https://huggingface.co/0xgr3y/Qwen3-0.6B-Gensyn-Swarm-tall_tame_panther/tree/main)
|
||||||
@@ -68,18 +68,19 @@ model-index:
|
|||||||
|
|
||||||
## Model Overview
|
## Model Overview
|
||||||
|
|
||||||
This model is a continuously trained Qwen3-0.6B fine-tuned using **Gensyn RL-Swarm** framework with **GRPO (Generalized Reward Policy Optimization)** for enhanced reasoning and mathematical capabilities. **Note: Current training focuses on math/reasoning tasks**.
|
This model is a continuously trained Qwen3-0.6B fine-tuned using **Gensyn RL-Swarm** framework with **GRPO (Generalized Reward Policy Optimization)** and support **GGUF (llama.cpp)** for enhanced reasoning and mathematical capabilities. **Note: Current training focuses on math & reasoning tasks**.
|
||||||
|
|
||||||
**Agent ID:** `tall_tame_panther`
|
- **Agent ID:** `tall_tame_panther`
|
||||||
**Training Status:** 🟢 LIVE - Model updates automatically every 5-10 minutes
|
- **Training Status:** 🟢 LIVE - Model updates automatically every 5-10 minutes
|
||||||
**Current Progress:** Round 43,610+ / 100,000 (43,61%)
|
- **Auto-Sync GGUF Pipeline Status:** 🟢 LIVE - Commits update automatically every 1h-hourly
|
||||||
**Framework Version:** Gensyn RL-Swarm v0.6.4
|
- **Current Progress:** Round 43,610+ / 100,000 (43,61%)
|
||||||
**Contract:** SwarmCoordinator v0.4.2
|
- **Framework Version:** Gensyn RL-Swarm v0.6.4
|
||||||
|
- **Contract:** SwarmCoordinator v0.4.2
|
||||||
|
|
||||||
## Key Features
|
## Key Features
|
||||||
|
|
||||||
- **Real-time Training**: Continuous learning with distributed RL across Gensyn swarm network
|
- **Real-time Training**: Continuous learning with distributed RL across Gensyn swarm network
|
||||||
- **Multi-domain Reasoning**: Trained on logic, arithmetic, and mathematical problem-solving
|
- **Multi-domain Reasoning**: Trained on logic, mathematical problem-solving & reasoning tasks
|
||||||
- **GGUF Support**: Multiple quantized formats available (F16, Q3_K_M, Q4_K_M, Q5_K_M)
|
- **GGUF Support**: Multiple quantized formats available (F16, Q3_K_M, Q4_K_M, Q5_K_M)
|
||||||
- **llama.cpp Compatible**: Ready for edge deployment and local inference
|
- **llama.cpp Compatible**: Ready for edge deployment and local inference
|
||||||
- **BF16 Precision**: Trained with bfloat16 for optimal performance
|
- **BF16 Precision**: Trained with bfloat16 for optimal performance
|
||||||
|
|||||||
Reference in New Issue
Block a user