Adding Evaluation Results (#2)
- Adding Evaluation Results (b9faac07e9ded01d2968f0ce5724837c57da60e3) Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>
This commit is contained in:
committed by
system
parent
cdd8a6cde7
commit
07ed34d94e
15
README.md
15
README.md
@@ -72,4 +72,17 @@ For additional information or inquiries about FinOPT-Washington, please contact
|
|||||||
FinOPT-Washington is an AI language model trained by Maya Philippines. It is provided "as is" without warranty of any kind, express or implied. The model developers and Maya Philippines shall not be liable for any direct or indirect damages arising from the use of this model.
|
FinOPT-Washington is an AI language model trained by Maya Philippines. It is provided "as is" without warranty of any kind, express or implied. The model developers and Maya Philippines shall not be liable for any direct or indirect damages arising from the use of this model.
|
||||||
|
|
||||||
## Acknowledgments
|
## Acknowledgments
|
||||||
The development of FinOPT-Washington was made possible by Maya Philippines and the curation and creation of the financial question-answering dataset.
|
The development of FinOPT-Washington was made possible by Maya Philippines and the curation and creation of the financial question-answering dataset.
|
||||||
|
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
||||||
|
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MayaPH__FinOPT-Washington)
|
||||||
|
|
||||||
|
| Metric | Value |
|
||||||
|
|-----------------------|---------------------------|
|
||||||
|
| Avg. | 24.87 |
|
||||||
|
| ARC (25-shot) | 25.17 |
|
||||||
|
| HellaSwag (10-shot) | 26.25 |
|
||||||
|
| MMLU (5-shot) | 24.83 |
|
||||||
|
| TruthfulQA (0-shot) | 45.8 |
|
||||||
|
| Winogrande (5-shot) | 51.07 |
|
||||||
|
| GSM8K (5-shot) | 0.0 |
|
||||||
|
| DROP (3-shot) | 1.0 |
|
||||||
|
|||||||
Reference in New Issue
Block a user