Files
ModelHub XC 78a6661ff1 初始化项目,由ModelHub XC社区提供模型
Model: bigscience/bloomz-7b1-p3
Source: Original Platform
2026-06-15 07:40:14 +08:00

4.7 KiB

1datasetpromptmetricvalue
2xcopa_idC1 or C2? premise_idmtaccuracy0.51
3xcopa_idbest_option_idmtaccuracy0.53
4xcopa_idcause_effect_idmtaccuracy0.69
5xcopa_idi_am_hesitating_idmtaccuracy0.64
6xcopa_idplausible_alternatives_idmtaccuracy0.7
7xcopa_idmedianaccuracy0.64
8xcopa_swC1 or C2? premise_swmtaccuracy0.6
9xcopa_swbest_option_swmtaccuracy0.62
10xcopa_swcause_effect_swmtaccuracy0.49
11xcopa_swi_am_hesitating_swmtaccuracy0.56
12xcopa_swplausible_alternatives_swmtaccuracy0.54
13xcopa_swmedianaccuracy0.56
14xcopa_taC1 or C2? premise_tamtaccuracy0.52
15xcopa_tabest_option_tamtaccuracy0.55
16xcopa_tacause_effect_tamtaccuracy0.63
17xcopa_tai_am_hesitating_tamtaccuracy0.63
18xcopa_taplausible_alternatives_tamtaccuracy0.66
19xcopa_tamedianaccuracy0.63
20xcopa_viC1 or C2? premise_vimtaccuracy0.55
21xcopa_vibest_option_vimtaccuracy0.61
22xcopa_vicause_effect_vimtaccuracy0.64
23xcopa_vii_am_hesitating_vimtaccuracy0.6
24xcopa_viplausible_alternatives_vimtaccuracy0.64
25xcopa_vimedianaccuracy0.61
26xcopa_zhC1 or C2? premise_zhmtaccuracy0.52
27xcopa_zhbest_option_zhmtaccuracy0.61
28xcopa_zhcause_effect_zhmtaccuracy0.75
29xcopa_zhi_am_hesitating_zhmtaccuracy0.72
30xcopa_zhplausible_alternatives_zhmtaccuracy0.76
31xcopa_zhmedianaccuracy0.72
32xstory_cloze_arAnswer Given options_armtaccuracy0.7061548643282595
33xstory_cloze_arChoose Story Ending_armtaccuracy0.786896095301125
34xstory_cloze_arGenerate Ending_armtaccuracy0.600926538716082
35xstory_cloze_arNovel Correct Ending_armtaccuracy0.7511581733951026
36xstory_cloze_arStory Continuation and Options_armtaccuracy0.757114493712773
37xstory_cloze_armedianaccuracy0.7511581733951026
38xstory_cloze_esAnswer Given options_esmtaccuracy0.7902051621442753
39xstory_cloze_esChoose Story Ending_esmtaccuracy0.8160158835208471
40xstory_cloze_esGenerate Ending_esmtaccuracy0.657180675049636
41xstory_cloze_esNovel Correct Ending_esmtaccuracy0.784910655195235
42xstory_cloze_esStory Continuation and Options_esmtaccuracy0.7696889477167439
43xstory_cloze_esmedianaccuracy0.784910655195235
44xstory_cloze_euAnswer Given options_eumtaccuracy0.6227663798808736
45xstory_cloze_euChoose Story Ending_eumtaccuracy0.6763732627399074
46xstory_cloze_euGenerate Ending_eumtaccuracy0.5737921906022502
47xstory_cloze_euNovel Correct Ending_eumtaccuracy0.686300463269358
48xstory_cloze_euStory Continuation and Options_eumtaccuracy0.6637988087359364
49xstory_cloze_eumedianaccuracy0.6637988087359364
50xstory_cloze_hiAnswer Given options_himtaccuracy0.6697551290536069
51xstory_cloze_hiChoose Story Ending_himtaccuracy0.7160820648577101
52xstory_cloze_hiGenerate Ending_himtaccuracy0.5923229649238915
53xstory_cloze_hiNovel Correct Ending_himtaccuracy0.6882859033752482
54xstory_cloze_hiStory Continuation and Options_himtaccuracy0.7048312375909993
55xstory_cloze_himedianaccuracy0.6882859033752482
56xstory_cloze_idAnswer Given options_idmtaccuracy0.7346128391793514
57xstory_cloze_idChoose Story Ending_idmtaccuracy0.7511581733951026
58xstory_cloze_idGenerate Ending_idmtaccuracy0.6201191264063534
59xstory_cloze_idNovel Correct Ending_idmtaccuracy0.728656518861681
60xstory_cloze_idStory Continuation and Options_idmtaccuracy0.7412309728656519
61xstory_cloze_idmedianaccuracy0.7346128391793514
62xstory_cloze_zhAnswer Given options_zhmtaccuracy0.7425545996029119
63xstory_cloze_zhChoose Story Ending_zhmtaccuracy0.7941760423560555
64xstory_cloze_zhGenerate Ending_zhmtaccuracy0.6247518199867638
65xstory_cloze_zhNovel Correct Ending_zhmtaccuracy0.7842488418266049
66xstory_cloze_zhStory Continuation and Options_zhmtaccuracy0.8034414295168762
67xstory_cloze_zhmedianaccuracy0.7842488418266049
68xwinograd_frReplace_frmtaccuracy0.5180722891566265
69xwinograd_frTrue or False_frmtaccuracy0.46987951807228917
70xwinograd_frdoes underscore refer to_frmtaccuracy0.5421686746987951
71xwinograd_frstand for_frmtaccuracy0.5060240963855421
72xwinograd_frunderscore refer to_frmtaccuracy0.5421686746987951
73xwinograd_frmedianaccuracy0.5180722891566265
74xwinograd_ptReplace_ptmtaccuracy0.5057034220532319
75xwinograd_ptTrue or False_ptmtaccuracy0.5133079847908745
76xwinograd_ptdoes underscore refer to_ptmtaccuracy0.5209125475285171
77xwinograd_ptstand for_ptmtaccuracy0.5209125475285171
78xwinograd_ptunderscore refer to_ptmtaccuracy0.49049429657794674
79xwinograd_ptmedianaccuracy0.5133079847908745
80xwinograd_zhReplace_zhmtaccuracy0.5238095238095238
81xwinograd_zhTrue or False_zhmtaccuracy0.5138888888888888
82xwinograd_zhdoes underscore refer to_zhmtaccuracy0.49404761904761907
83xwinograd_zhstand for_zhmtaccuracy0.49603174603174605
84xwinograd_zhunderscore refer to_zhmtaccuracy0.503968253968254
85xwinograd_zhmedianaccuracy0.503968253968254
86multipleaveragemultiple0.6501688392588024