ModelHub XC 1faaf17714 初始化项目,由ModelHub XC社区提供模型
Model: QwenCollection/Hercules-Mini-1.8B
Source: Original Platform
2026-05-10 14:15:42 +08:00

library_name, license, datasets, language, inference
library_name license datasets language inference
transformers other
Locutusque/hercules-v4.0
en
parameters
do_sample temperature top_p top_k max_new_tokens repetition_penalty
true 1 0.7 4 250 1.1

Hercules-Mini-1.8B

We fine-tuned Qwen1.5-1.8B on Locutusque's Hercules-v4.

Model Details

Model Description

This model has capabilities in math, coding, function calling, roleplay, and more. We fine-tuned it using 700,000 examples of Hercules-v4.

  • Developed by: M4-ai
  • Language(s) (NLP): English and maybe Chinese
  • License: tongyi-qianwen license
  • Finetuned from model: Qwen1.5-1.8B

Uses

General purpose assistant, question answering, chain-of-thought, etc..

Bias, Risks, and Limitations

The eos token was not setup properly, so to prevent infinite generation you'll need to implement a stopping criteria when the model generates the <|im_end|> token.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

Evaluation

Coming soon

Training Details

Training Data

https://huggingface.co/datasets/Locutusque/hercules-v4.0

Training Hyperparameters

  • Training regime: bf16 non-mixed precision

Technical Specifications

Hardware

We used 8 Kaggle TPUs, and we trained at a global batch size of 256 and sequence length of 1536

Contributions

Thanks to @Tonic, @aloobun, @fhai50032, and @Locutusque for their contributions to this model.

Description
Model synced from source: QwenCollection/Hercules-Mini-1.8B
Readme 4.2 MiB