Update README.md

This commit is contained in:
wangpeiyi
2024-01-03 05:17:35 +00:00
committed by huggingface-web
parent 58489d7d08
commit b861154b67

View File

@@ -1,4 +1,4 @@
Process reward models (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf).
Process reward model (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf).
`Input`: question + step-by-step solutions with a special step tag `ки`, e.g.,
```