Adding Evaluation Results (#2)
- Adding Evaluation Results (b9faac07e9ded01d2968f0ce5724837c57da60e3) Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>
This commit is contained in:
committed by
system
parent
cdd8a6cde7
commit
07ed34d94e
13
README.md
13
README.md
@@ -73,3 +73,16 @@ FinOPT-Washington is an AI language model trained by Maya Philippines. It is pro
|
||||
|
||||
## Acknowledgments
|
||||
The development of FinOPT-Washington was made possible by Maya Philippines and the curation and creation of the financial question-answering dataset.
|
||||
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
||||
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MayaPH__FinOPT-Washington)
|
||||
|
||||
| Metric | Value |
|
||||
|-----------------------|---------------------------|
|
||||
| Avg. | 24.87 |
|
||||
| ARC (25-shot) | 25.17 |
|
||||
| HellaSwag (10-shot) | 26.25 |
|
||||
| MMLU (5-shot) | 24.83 |
|
||||
| TruthfulQA (0-shot) | 45.8 |
|
||||
| Winogrande (5-shot) | 51.07 |
|
||||
| GSM8K (5-shot) | 0.0 |
|
||||
| DROP (3-shot) | 1.0 |
|
||||
|
||||
Reference in New Issue
Block a user