Files
Stack-3.0-Omni-Nexus/benchmarks/hellaswag.json
ModelHub XC 5d9af16e3d 初始化项目,由ModelHub XC社区提供模型
Model: my-ai-stack/Stack-3.0-Omni-Nexus
Source: Original Platform
2026-05-07 16:44:07 +08:00

8 lines
191 B
JSON

{
"benchmark": "hellaswag",
"model": "omni-nexus-alpha-q8",
"method": "chat-api (single generate, A/B/C/D pick)",
"accuracy": 0.5960963951404102,
"correct": 5986,
"total": 10042
}