Files
pythia-160m-data-seed3/tokenizer_config.json

9 lines
209 B
JSON
Raw Permalink Normal View History

{
"add_prefix_space": false,
"bos_token": "<|endoftext|>",
"clean_up_tokenization_spaces": true,
"eos_token": "<|endoftext|>",
"tokenizer_class": "GPTNeoXTokenizer",
"unk_token": "<|endoftext|>"
}