Go to file

ModelHub XC 1faaf17714 初始化项目，由ModelHub XC社区提供模型

Model: QwenCollection/Hercules-Mini-1.8B
Source: Original Platform

2026-05-10 14:15:42 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

merges.txt

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

vocab.json

初始化项目，由ModelHub XC社区提供模型

2026-05-10 14:15:42 +08:00

README.md

library_name, license, datasets, language, inference

library_name

license

datasets

language

inference

transformers

other

Locutusque/hercules-v4.0

parameters

do_sample	temperature	top_p	top_k	max_new_tokens	repetition_penalty
true	1	0.7	4	250	1.1

Hercules-Mini-1.8B

We fine-tuned Qwen1.5-1.8B on Locutusque's Hercules-v4.

Model Details

Model Description

This model has capabilities in math, coding, function calling, roleplay, and more. We fine-tuned it using 700,000 examples of Hercules-v4.

Developed by: M4-ai
Language(s) (NLP): English and maybe Chinese
License: tongyi-qianwen license
Finetuned from model: Qwen1.5-1.8B

Uses

General purpose assistant, question answering, chain-of-thought, etc..

Bias, Risks, and Limitations

The eos token was not setup properly, so to prevent infinite generation you'll need to implement a stopping criteria when the model generates the <|im_end|> token.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

Evaluation

Coming soon

Training Details

Training Data

https://huggingface.co/datasets/Locutusque/hercules-v4.0

Training Hyperparameters

Training regime: bf16 non-mixed precision

Technical Specifications

Hardware

We used 8 Kaggle TPUs, and we trained at a global batch size of 256 and sequence length of 1536

Contributions

Thanks to @Tonic, @aloobun, @fhai50032, and @Locutusque for their contributions to this model.