diff --git a/README.md b/README.md index 8845fdf..7b8634d 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -Process reward models (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf). +Process reward model (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf). `Input`: question + step-by-step solutions with a special step tag `ΠΊΠΈ`, e.g., ```