Files
ml-intern-smoke-test/tokenizer_config.json
ModelHub XC 3ef491f98c 初始化项目,由ModelHub XC社区提供模型
Model: infiniV/ml-intern-smoke-test
Source: Original Platform
2026-06-16 03:53:20 +08:00

35 lines
775 B
JSON

{
"add_prefix_space": false,
"backend": "tokenizers",
"bos_token": "<|endoftext|>",
"clean_up_tokenization_spaces": false,
"eos_token": "<|endoftext|>",
"errors": "replace",
"extra_special_tokens": [
"<|endoftext|>",
"<|im_start|>",
"<|im_end|>",
"<repo_name>",
"<reponame>",
"<file_sep>",
"<filename>",
"<gh_stars>",
"<issue_start>",
"<issue_comment>",
"<issue_closed>",
"<jupyter_start>",
"<jupyter_text>",
"<jupyter_code>",
"<jupyter_output>",
"<jupyter_script>",
"<empty_output>"
],
"is_local": false,
"local_files_only": false,
"model_max_length": 8192,
"pad_token": "<|endoftext|>",
"tokenizer_class": "GPT2Tokenizer",
"unk_token": "<|endoftext|>",
"vocab_size": 49152
}