language, tags, license, datasets
language tags license datasets
en
pytorch
causal-lm
pythia
apache-2.0
Anthropic/hh-rlhf

Pythia-6.9b supervised finetuned with Anthropic-hh-rlhf dataset for 1 epoch.

wandb log

Benchmark evaluations included in repo done using lm-evaluation-harness.

See Pythia-6.9b for model details (paper).

Description
Model synced from source: lomahony/eleuther-pythia6.9b-hh-sft
Readme 713 KiB
Languages
Text 100%