From b861154b67cec7ec43ad05a9bac0b84f9f2a86ec Mon Sep 17 00:00:00 2001 From: wangpeiyi Date: Wed, 3 Jan 2024 05:17:35 +0000 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 8845fdf..7b8634d 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -Process reward models (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf). +Process reward model (mistral-7b) used in [Math-Shepherd](https://arxiv.org/pdf/2312.08935.pdf). `Input`: question + step-by-step solutions with a special step tag `ΠΊΠΈ`, e.g., ```