Files
ModelHub XC a60cddb2f9 初始化项目,由ModelHub XC社区提供模型
Model: Muennighoff/SGPT-5.8B-weightedmean-msmarco-specb-bitfit
Source: Original Platform
2026-05-13 16:46:01 +08:00

64 KiB

pipeline_tag, tags, model-index
pipeline_tag tags model-index
sentence-similarity
sentence-transformers
feature-extraction
sentence-similarity
mteb
name results
SGPT-5.8B-weightedmean-msmarco-specb-bitfit
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_counterfactual MTEB AmazonCounterfactualClassification (en) en test 2d8a100785abf0ae21420d2a55b0c56e3e1ea996
type value
accuracy 69.22388059701493
type value
ap 32.04724673950256
type value
f1 63.25719825770428
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_polarity MTEB AmazonPolarityClassification default test 80714f8dcf8cefc218ef4f8c5a966dd83f75a0e1
type value
accuracy 71.26109999999998
type value
ap 66.16336378255403
type value
f1 70.89719145825303
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_reviews_multi MTEB AmazonReviewsClassification (en) en test c379a6705fec24a2493fa68e011692605f44e119
type value
accuracy 39.19199999999999
type value
f1 38.580766731113826
task dataset metrics
type
Retrieval
type name config split revision
arguana MTEB ArguAna default test 5b3e3697907184a9b77a3c99ee9ea1a9cbb1e4e3
type value
map_at_1 27.311999999999998
type value
map_at_10 42.620000000000005
type value
map_at_100 43.707
type value
map_at_1000 43.714999999999996
type value
map_at_3 37.624
type value
map_at_5 40.498
type value
mrr_at_1 27.667
type value
mrr_at_10 42.737
type value
mrr_at_100 43.823
type value
mrr_at_1000 43.830999999999996
type value
mrr_at_3 37.743
type value
mrr_at_5 40.616
type value
ndcg_at_1 27.311999999999998
type value
ndcg_at_10 51.37500000000001
type value
ndcg_at_100 55.778000000000006
type value
ndcg_at_1000 55.96600000000001
type value
ndcg_at_3 41.087
type value
ndcg_at_5 46.269
type value
precision_at_1 27.311999999999998
type value
precision_at_10 7.945
type value
precision_at_100 0.9820000000000001
type value
precision_at_1000 0.1
type value
precision_at_3 17.046
type value
precision_at_5 12.745000000000001
type value
recall_at_1 27.311999999999998
type value
recall_at_10 79.445
type value
recall_at_100 98.151
type value
recall_at_1000 99.57300000000001
type value
recall_at_3 51.13799999999999
type value
recall_at_5 63.727000000000004
task dataset metrics
type
Clustering
type name config split revision
mteb/arxiv-clustering-p2p MTEB ArxivClusteringP2P default test 0bbdb47bcbe3a90093699aefeed338a0f28a7ee8
type value
v_measure 45.59037428592033
task dataset metrics
type
Clustering
type name config split revision
mteb/arxiv-clustering-s2s MTEB ArxivClusteringS2S default test b73bd54100e5abfa6e3a23dcafb46fe4d2438dc3
type value
v_measure 38.86371701986363
task dataset metrics
type
Reranking
type name config split revision
mteb/askubuntudupquestions-reranking MTEB AskUbuntuDupQuestions default test 4d853f94cd57d85ec13805aeeac3ae3e5eb4c49c
type value
map 61.625568691427766
type value
mrr 75.83256386580486
task dataset metrics
type
STS
type name config split revision
mteb/biosses-sts MTEB BIOSSES default test 9ee918f184421b6bd48b78f6c714d86546106103
type value
cos_sim_pearson 89.96074355094802
type value
cos_sim_spearman 86.2501580394454
type value
euclidean_pearson 82.18427440380462
type value
euclidean_spearman 80.14760935017947
type value
manhattan_pearson 82.24621578156392
type value
manhattan_spearman 80.00363016590163
task dataset metrics
type
Classification
type name config split revision
mteb/banking77 MTEB Banking77Classification default test 44fa15921b4c889113cc5df03dd4901b49161ab7
type value
accuracy 84.49350649350649
type value
f1 84.4249343233736
task dataset metrics
type
Clustering
type name config split revision
mteb/biorxiv-clustering-p2p MTEB BiorxivClusteringP2P default test 11d0121201d1f1f280e8cc8f3d98fb9c4d9f9c55
type value
v_measure 36.551459722989385
task dataset metrics
type
Clustering
type name config split revision
mteb/biorxiv-clustering-s2s MTEB BiorxivClusteringS2S default test c0fab014e1bcb8d3a5e31b2088972a1e01547dc1
type value
v_measure 33.69901851846774
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackAndroidRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 30.499
type value
map_at_10 41.208
type value
map_at_100 42.638
type value
map_at_1000 42.754
type value
map_at_3 37.506
type value
map_at_5 39.422000000000004
type value
mrr_at_1 37.339
type value
mrr_at_10 47.051
type value
mrr_at_100 47.745
type value
mrr_at_1000 47.786
type value
mrr_at_3 44.086999999999996
type value
mrr_at_5 45.711
type value
ndcg_at_1 37.339
type value
ndcg_at_10 47.666
type value
ndcg_at_100 52.994
type value
ndcg_at_1000 54.928999999999995
type value
ndcg_at_3 41.982
type value
ndcg_at_5 44.42
type value
precision_at_1 37.339
type value
precision_at_10 9.127
type value
precision_at_100 1.4749999999999999
type value
precision_at_1000 0.194
type value
precision_at_3 20.076
type value
precision_at_5 14.449000000000002
type value
recall_at_1 30.499
type value
recall_at_10 60.328
type value
recall_at_100 82.57900000000001
type value
recall_at_1000 95.074
type value
recall_at_3 44.17
type value
recall_at_5 50.94
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackEnglishRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 30.613
type value
map_at_10 40.781
type value
map_at_100 42.018
type value
map_at_1000 42.132999999999996
type value
map_at_3 37.816
type value
map_at_5 39.389
type value
mrr_at_1 38.408
type value
mrr_at_10 46.631
type value
mrr_at_100 47.332
type value
mrr_at_1000 47.368
type value
mrr_at_3 44.384
type value
mrr_at_5 45.661
type value
ndcg_at_1 38.408
type value
ndcg_at_10 46.379999999999995
type value
ndcg_at_100 50.81
type value
ndcg_at_1000 52.663000000000004
type value
ndcg_at_3 42.18
type value
ndcg_at_5 43.974000000000004
type value
precision_at_1 38.408
type value
precision_at_10 8.656
type value
precision_at_100 1.3860000000000001
type value
precision_at_1000 0.184
type value
precision_at_3 20.276
type value
precision_at_5 14.241999999999999
type value
recall_at_1 30.613
type value
recall_at_10 56.44
type value
recall_at_100 75.044
type value
recall_at_1000 86.426
type value
recall_at_3 43.766
type value
recall_at_5 48.998000000000005
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackGamingRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 37.370999999999995
type value
map_at_10 49.718
type value
map_at_100 50.737
type value
map_at_1000 50.79
type value
map_at_3 46.231
type value
map_at_5 48.329
type value
mrr_at_1 42.884
type value
mrr_at_10 53.176
type value
mrr_at_100 53.81700000000001
type value
mrr_at_1000 53.845
type value
mrr_at_3 50.199000000000005
type value
mrr_at_5 52.129999999999995
type value
ndcg_at_1 42.884
type value
ndcg_at_10 55.826
type value
ndcg_at_100 59.93000000000001
type value
ndcg_at_1000 61.013
type value
ndcg_at_3 49.764
type value
ndcg_at_5 53.025999999999996
type value
precision_at_1 42.884
type value
precision_at_10 9.046999999999999
type value
precision_at_100 1.212
type value
precision_at_1000 0.135
type value
precision_at_3 22.131999999999998
type value
precision_at_5 15.524
type value
recall_at_1 37.370999999999995
type value
recall_at_10 70.482
type value
recall_at_100 88.425
type value
recall_at_1000 96.03399999999999
type value
recall_at_3 54.43
type value
recall_at_5 62.327999999999996
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackGisRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 22.875999999999998
type value
map_at_10 31.715
type value
map_at_100 32.847
type value
map_at_1000 32.922000000000004
type value
map_at_3 29.049999999999997
type value
map_at_5 30.396
type value
mrr_at_1 24.52
type value
mrr_at_10 33.497
type value
mrr_at_100 34.455000000000005
type value
mrr_at_1000 34.510000000000005
type value
mrr_at_3 30.791
type value
mrr_at_5 32.175
type value
ndcg_at_1 24.52
type value
ndcg_at_10 36.95
type value
ndcg_at_100 42.238
type value
ndcg_at_1000 44.147999999999996
type value
ndcg_at_3 31.435000000000002
type value
ndcg_at_5 33.839000000000006
type value
precision_at_1 24.52
type value
precision_at_10 5.9319999999999995
type value
precision_at_100 0.901
type value
precision_at_1000 0.11
type value
precision_at_3 13.446
type value
precision_at_5 9.469
type value
recall_at_1 22.875999999999998
type value
recall_at_10 51.38
type value
recall_at_100 75.31099999999999
type value
recall_at_1000 89.718
type value
recall_at_3 36.26
type value
recall_at_5 42.248999999999995
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackMathematicaRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 14.984
type value
map_at_10 23.457
type value
map_at_100 24.723
type value
map_at_1000 24.846
type value
map_at_3 20.873
type value
map_at_5 22.357
type value
mrr_at_1 18.159
type value
mrr_at_10 27.431
type value
mrr_at_100 28.449
type value
mrr_at_1000 28.52
type value
mrr_at_3 24.979000000000003
type value
mrr_at_5 26.447
type value
ndcg_at_1 18.159
type value
ndcg_at_10 28.627999999999997
type value
ndcg_at_100 34.741
type value
ndcg_at_1000 37.516
type value
ndcg_at_3 23.902
type value
ndcg_at_5 26.294
type value
precision_at_1 18.159
type value
precision_at_10 5.485
type value
precision_at_100 0.985
type value
precision_at_1000 0.136
type value
precision_at_3 11.774
type value
precision_at_5 8.731
type value
recall_at_1 14.984
type value
recall_at_10 40.198
type value
recall_at_100 67.11500000000001
type value
recall_at_1000 86.497
type value
recall_at_3 27.639000000000003
type value
recall_at_5 33.595000000000006
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackPhysicsRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 29.067
type value
map_at_10 39.457
type value
map_at_100 40.83
type value
map_at_1000 40.94
type value
map_at_3 35.995
type value
map_at_5 38.159
type value
mrr_at_1 34.937000000000005
type value
mrr_at_10 44.755
type value
mrr_at_100 45.549
type value
mrr_at_1000 45.589
type value
mrr_at_3 41.947
type value
mrr_at_5 43.733
type value
ndcg_at_1 34.937000000000005
type value
ndcg_at_10 45.573
type value
ndcg_at_100 51.266999999999996
type value
ndcg_at_1000 53.184
type value
ndcg_at_3 39.961999999999996
type value
ndcg_at_5 43.02
type value
precision_at_1 34.937000000000005
type value
precision_at_10 8.296000000000001
type value
precision_at_100 1.32
type value
precision_at_1000 0.167
type value
precision_at_3 18.8
type value
precision_at_5 13.763
type value
recall_at_1 29.067
type value
recall_at_10 58.298
type value
recall_at_100 82.25099999999999
type value
recall_at_1000 94.476
type value
recall_at_3 42.984
type value
recall_at_5 50.658
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackProgrammersRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 25.985999999999997
type value
map_at_10 35.746
type value
map_at_100 37.067
type value
map_at_1000 37.191
type value
map_at_3 32.599000000000004
type value
map_at_5 34.239000000000004
type value
mrr_at_1 31.735000000000003
type value
mrr_at_10 40.515
type value
mrr_at_100 41.459
type value
mrr_at_1000 41.516
type value
mrr_at_3 37.938
type value
mrr_at_5 39.25
type value
ndcg_at_1 31.735000000000003
type value
ndcg_at_10 41.484
type value
ndcg_at_100 47.047
type value
ndcg_at_1000 49.427
type value
ndcg_at_3 36.254999999999995
type value
ndcg_at_5 38.375
type value
precision_at_1 31.735000000000003
type value
precision_at_10 7.66
type value
precision_at_100 1.234
type value
precision_at_1000 0.16
type value
precision_at_3 17.427999999999997
type value
precision_at_5 12.328999999999999
type value
recall_at_1 25.985999999999997
type value
recall_at_10 53.761
type value
recall_at_100 77.149
type value
recall_at_1000 93.342
type value
recall_at_3 39.068000000000005
type value
recall_at_5 44.693
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 24.949749999999998
type value
map_at_10 34.04991666666667
type value
map_at_100 35.26825
type value
map_at_1000 35.38316666666667
type value
map_at_3 31.181333333333335
type value
map_at_5 32.77391666666667
type value
mrr_at_1 29.402833333333334
type value
mrr_at_10 38.01633333333333
type value
mrr_at_100 38.88033333333334
type value
mrr_at_1000 38.938500000000005
type value
mrr_at_3 35.5175
type value
mrr_at_5 36.93808333333333
type value
ndcg_at_1 29.402833333333334
type value
ndcg_at_10 39.403166666666664
type value
ndcg_at_100 44.66408333333333
type value
ndcg_at_1000 46.96283333333333
type value
ndcg_at_3 34.46633333333334
type value
ndcg_at_5 36.78441666666667
type value
precision_at_1 29.402833333333334
type value
precision_at_10 6.965833333333333
type value
precision_at_100 1.1330833333333334
type value
precision_at_1000 0.15158333333333335
type value
precision_at_3 15.886666666666665
type value
precision_at_5 11.360416666666667
type value
recall_at_1 24.949749999999998
type value
recall_at_10 51.29325
type value
recall_at_100 74.3695
type value
recall_at_1000 90.31299999999999
type value
recall_at_3 37.580083333333334
type value
recall_at_5 43.529666666666664
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackStatsRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 22.081999999999997
type value
map_at_10 29.215999999999998
type value
map_at_100 30.163
type value
map_at_1000 30.269000000000002
type value
map_at_3 26.942
type value
map_at_5 28.236
type value
mrr_at_1 24.847
type value
mrr_at_10 31.918999999999997
type value
mrr_at_100 32.817
type value
mrr_at_1000 32.897
type value
mrr_at_3 29.831000000000003
type value
mrr_at_5 31.019999999999996
type value
ndcg_at_1 24.847
type value
ndcg_at_10 33.4
type value
ndcg_at_100 38.354
type value
ndcg_at_1000 41.045
type value
ndcg_at_3 29.236
type value
ndcg_at_5 31.258000000000003
type value
precision_at_1 24.847
type value
precision_at_10 5.353
type value
precision_at_100 0.853
type value
precision_at_1000 0.116
type value
precision_at_3 12.679000000000002
type value
precision_at_5 8.988
type value
recall_at_1 22.081999999999997
type value
recall_at_10 43.505
type value
recall_at_100 66.45400000000001
type value
recall_at_1000 86.378
type value
recall_at_3 32.163000000000004
type value
recall_at_5 37.059999999999995
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackTexRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 15.540000000000001
type value
map_at_10 22.362000000000002
type value
map_at_100 23.435
type value
map_at_1000 23.564
type value
map_at_3 20.143
type value
map_at_5 21.324
type value
mrr_at_1 18.892
type value
mrr_at_10 25.942999999999998
type value
mrr_at_100 26.883000000000003
type value
mrr_at_1000 26.968999999999998
type value
mrr_at_3 23.727
type value
mrr_at_5 24.923000000000002
type value
ndcg_at_1 18.892
type value
ndcg_at_10 26.811
type value
ndcg_at_100 32.066
type value
ndcg_at_1000 35.166
type value
ndcg_at_3 22.706
type value
ndcg_at_5 24.508
type value
precision_at_1 18.892
type value
precision_at_10 4.942
type value
precision_at_100 0.878
type value
precision_at_1000 0.131
type value
precision_at_3 10.748000000000001
type value
precision_at_5 7.784000000000001
type value
recall_at_1 15.540000000000001
type value
recall_at_10 36.742999999999995
type value
recall_at_100 60.525
type value
recall_at_1000 82.57600000000001
type value
recall_at_3 25.252000000000002
type value
recall_at_5 29.872
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackUnixRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 24.453
type value
map_at_10 33.363
type value
map_at_100 34.579
type value
map_at_1000 34.686
type value
map_at_3 30.583
type value
map_at_5 32.118
type value
mrr_at_1 28.918
type value
mrr_at_10 37.675
type value
mrr_at_100 38.567
type value
mrr_at_1000 38.632
type value
mrr_at_3 35.260999999999996
type value
mrr_at_5 36.576
type value
ndcg_at_1 28.918
type value
ndcg_at_10 38.736
type value
ndcg_at_100 44.261
type value
ndcg_at_1000 46.72
type value
ndcg_at_3 33.81
type value
ndcg_at_5 36.009
type value
precision_at_1 28.918
type value
precision_at_10 6.586
type value
precision_at_100 1.047
type value
precision_at_1000 0.13699999999999998
type value
precision_at_3 15.360999999999999
type value
precision_at_5 10.857999999999999
type value
recall_at_1 24.453
type value
recall_at_10 50.885999999999996
type value
recall_at_100 75.03
type value
recall_at_1000 92.123
type value
recall_at_3 37.138
type value
recall_at_5 42.864999999999995
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackWebmastersRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 24.57
type value
map_at_10 33.672000000000004
type value
map_at_100 35.244
type value
map_at_1000 35.467
type value
map_at_3 30.712
type value
map_at_5 32.383
type value
mrr_at_1 29.644
type value
mrr_at_10 38.344
type value
mrr_at_100 39.219
type value
mrr_at_1000 39.282000000000004
type value
mrr_at_3 35.771
type value
mrr_at_5 37.273
type value
ndcg_at_1 29.644
type value
ndcg_at_10 39.567
type value
ndcg_at_100 45.097
type value
ndcg_at_1000 47.923
type value
ndcg_at_3 34.768
type value
ndcg_at_5 37.122
type value
precision_at_1 29.644
type value
precision_at_10 7.5889999999999995
type value
precision_at_100 1.478
type value
precision_at_1000 0.23500000000000001
type value
precision_at_3 16.337
type value
precision_at_5 12.055
type value
recall_at_1 24.57
type value
recall_at_10 51.00900000000001
type value
recall_at_100 75.423
type value
recall_at_1000 93.671
type value
recall_at_3 36.925999999999995
type value
recall_at_5 43.245
task dataset metrics
type
Retrieval
type name config split revision
BeIR/cqadupstack MTEB CQADupstackWordpressRetrieval default test 2b9f5791698b5be7bc5e10535c8690f20043c3db
type value
map_at_1 21.356
type value
map_at_10 27.904
type value
map_at_100 28.938000000000002
type value
map_at_1000 29.036
type value
map_at_3 25.726
type value
map_at_5 26.935
type value
mrr_at_1 22.551
type value
mrr_at_10 29.259
type value
mrr_at_100 30.272
type value
mrr_at_1000 30.348000000000003
type value
mrr_at_3 27.295
type value
mrr_at_5 28.358
type value
ndcg_at_1 22.551
type value
ndcg_at_10 31.817
type value
ndcg_at_100 37.164
type value
ndcg_at_1000 39.82
type value
ndcg_at_3 27.595999999999997
type value
ndcg_at_5 29.568
type value
precision_at_1 22.551
type value
precision_at_10 4.917
type value
precision_at_100 0.828
type value
precision_at_1000 0.11399999999999999
type value
precision_at_3 11.583
type value
precision_at_5 8.133
type value
recall_at_1 21.356
type value
recall_at_10 42.489
type value
recall_at_100 67.128
type value
recall_at_1000 87.441
type value
recall_at_3 31.165
type value
recall_at_5 35.853
task dataset metrics
type
Retrieval
type name config split revision
climate-fever MTEB ClimateFEVER default test 392b78eb68c07badcd7c2cd8f39af108375dfcce
type value
map_at_1 12.306000000000001
type value
map_at_10 21.523
type value
map_at_100 23.358
type value
map_at_1000 23.541
type value
map_at_3 17.809
type value
map_at_5 19.631
type value
mrr_at_1 27.948
type value
mrr_at_10 40.355000000000004
type value
mrr_at_100 41.166000000000004
type value
mrr_at_1000 41.203
type value
mrr_at_3 36.819
type value
mrr_at_5 38.958999999999996
type value
ndcg_at_1 27.948
type value
ndcg_at_10 30.462
type value
ndcg_at_100 37.473
type value
ndcg_at_1000 40.717999999999996
type value
ndcg_at_3 24.646
type value
ndcg_at_5 26.642
type value
precision_at_1 27.948
type value
precision_at_10 9.648
type value
precision_at_100 1.7239999999999998
type value
precision_at_1000 0.232
type value
precision_at_3 18.48
type value
precision_at_5 14.293
type value
recall_at_1 12.306000000000001
type value
recall_at_10 37.181
type value
recall_at_100 61.148
type value
recall_at_1000 79.401
type value
recall_at_3 22.883
type value
recall_at_5 28.59
task dataset metrics
type
Retrieval
type name config split revision
dbpedia-entity MTEB DBPedia default test f097057d03ed98220bc7309ddb10b71a54d667d6
type value
map_at_1 9.357
type value
map_at_10 18.849
type value
map_at_100 25.369000000000003
type value
map_at_1000 26.950000000000003
type value
map_at_3 13.625000000000002
type value
map_at_5 15.956999999999999
type value
mrr_at_1 67.75
type value
mrr_at_10 74.734
type value
mrr_at_100 75.1
type value
mrr_at_1000 75.10900000000001
type value
mrr_at_3 73.542
type value
mrr_at_5 74.167
type value
ndcg_at_1 55.375
type value
ndcg_at_10 39.873999999999995
type value
ndcg_at_100 43.098
type value
ndcg_at_1000 50.69200000000001
type value
ndcg_at_3 44.856
type value
ndcg_at_5 42.138999999999996
type value
precision_at_1 67.75
type value
precision_at_10 31.1
type value
precision_at_100 9.303
type value
precision_at_1000 2.0060000000000002
type value
precision_at_3 48.25
type value
precision_at_5 40.949999999999996
type value
recall_at_1 9.357
type value
recall_at_10 23.832
type value
recall_at_100 47.906
type value
recall_at_1000 71.309
type value
recall_at_3 14.512
type value
recall_at_5 18.3
task dataset metrics
type
Classification
type name config split revision
mteb/emotion MTEB EmotionClassification default test 829147f8f75a25f005913200eb5ed41fae320aa1
type value
accuracy 49.655
type value
f1 45.51976190938951
task dataset metrics
type
Retrieval
type name config split revision
fever MTEB FEVER default test 1429cf27e393599b8b359b9b72c666f96b2525f9
type value
map_at_1 62.739999999999995
type value
map_at_10 73.07000000000001
type value
map_at_100 73.398
type value
map_at_1000 73.41
type value
map_at_3 71.33800000000001
type value
map_at_5 72.423
type value
mrr_at_1 67.777
type value
mrr_at_10 77.873
type value
mrr_at_100 78.091
type value
mrr_at_1000 78.094
type value
mrr_at_3 76.375
type value
mrr_at_5 77.316
type value
ndcg_at_1 67.777
type value
ndcg_at_10 78.24
type value
ndcg_at_100 79.557
type value
ndcg_at_1000 79.814
type value
ndcg_at_3 75.125
type value
ndcg_at_5 76.834
type value
precision_at_1 67.777
type value
precision_at_10 9.832
type value
precision_at_100 1.061
type value
precision_at_1000 0.11
type value
precision_at_3 29.433
type value
precision_at_5 18.665000000000003
type value
recall_at_1 62.739999999999995
type value
recall_at_10 89.505
type value
recall_at_100 95.102
type value
recall_at_1000 96.825
type value
recall_at_3 81.028
type value
recall_at_5 85.28099999999999
task dataset metrics
type
Retrieval
type name config split revision
fiqa MTEB FiQA2018 default test 41b686a7f28c59bcaaa5791efd47c67c8ebe28be
type value
map_at_1 18.467
type value
map_at_10 30.020999999999997
type value
map_at_100 31.739
type value
map_at_1000 31.934
type value
map_at_3 26.003
type value
map_at_5 28.338
type value
mrr_at_1 35.339999999999996
type value
mrr_at_10 44.108999999999995
type value
mrr_at_100 44.993
type value
mrr_at_1000 45.042
type value
mrr_at_3 41.667
type value
mrr_at_5 43.14
type value
ndcg_at_1 35.339999999999996
type value
ndcg_at_10 37.202
type value
ndcg_at_100 43.852999999999994
type value
ndcg_at_1000 47.235
type value
ndcg_at_3 33.5
type value
ndcg_at_5 34.985
type value
precision_at_1 35.339999999999996
type value
precision_at_10 10.247
type value
precision_at_100 1.7149999999999999
type value
precision_at_1000 0.232
type value
precision_at_3 22.222
type value
precision_at_5 16.573999999999998
type value
recall_at_1 18.467
type value
recall_at_10 44.080999999999996
type value
recall_at_100 68.72200000000001
type value
recall_at_1000 89.087
type value
recall_at_3 30.567
type value
recall_at_5 36.982
task dataset metrics
type
Retrieval
type name config split revision
hotpotqa MTEB HotpotQA default test 766870b35a1b9ca65e67a0d1913899973551fc6c
type value
map_at_1 35.726
type value
map_at_10 50.207
type value
map_at_100 51.05499999999999
type value
map_at_1000 51.12799999999999
type value
map_at_3 47.576
type value
map_at_5 49.172
type value
mrr_at_1 71.452
type value
mrr_at_10 77.41900000000001
type value
mrr_at_100 77.711
type value
mrr_at_1000 77.723
type value
mrr_at_3 76.39399999999999
type value
mrr_at_5 77.00099999999999
type value
ndcg_at_1 71.452
type value
ndcg_at_10 59.260999999999996
type value
ndcg_at_100 62.424
type value
ndcg_at_1000 63.951
type value
ndcg_at_3 55.327000000000005
type value
ndcg_at_5 57.416999999999994
type value
precision_at_1 71.452
type value
precision_at_10 12.061
type value
precision_at_100 1.455
type value
precision_at_1000 0.166
type value
precision_at_3 34.36
type value
precision_at_5 22.266
type value
recall_at_1 35.726
type value
recall_at_10 60.304
type value
recall_at_100 72.75500000000001
type value
recall_at_1000 82.978
type value
recall_at_3 51.54
type value
recall_at_5 55.665
task dataset metrics
type
Classification
type name config split revision
mteb/imdb MTEB ImdbClassification default test 8d743909f834c38949e8323a8a6ce8721ea6c7f4
type value
accuracy 66.63759999999999
type value
ap 61.48938261286748
type value
f1 66.35089269264965
task dataset metrics
type
Retrieval
type name config split revision
msmarco MTEB MSMARCO default validation e6838a846e2408f22cf5cc337ebc83e0bcf77849
type value
map_at_1 20.842
type value
map_at_10 32.992
type value
map_at_100 34.236
type value
map_at_1000 34.286
type value
map_at_3 29.049000000000003
type value
map_at_5 31.391999999999996
type value
mrr_at_1 21.375
type value
mrr_at_10 33.581
type value
mrr_at_100 34.760000000000005
type value
mrr_at_1000 34.803
type value
mrr_at_3 29.704000000000004
type value
mrr_at_5 32.015
type value
ndcg_at_1 21.375
type value
ndcg_at_10 39.905
type value
ndcg_at_100 45.843
type value
ndcg_at_1000 47.083999999999996
type value
ndcg_at_3 31.918999999999997
type value
ndcg_at_5 36.107
type value
precision_at_1 21.375
type value
precision_at_10 6.393
type value
precision_at_100 0.935
type value
precision_at_1000 0.104
type value
precision_at_3 13.663
type value
precision_at_5 10.324
type value
recall_at_1 20.842
type value
recall_at_10 61.17
type value
recall_at_100 88.518
type value
recall_at_1000 97.993
type value
recall_at_3 39.571
type value
recall_at_5 49.653999999999996
task dataset metrics
type
Classification
type name config split revision
mteb/mtop_domain MTEB MTOPDomainClassification (en) en test a7e2a951126a26fc8c6a69f835f33a346ba259e3
type value
accuracy 93.46557227542178
type value
f1 92.87345917772146
task dataset metrics
type
Classification
type name config split revision
mteb/mtop_intent MTEB MTOPIntentClassification (en) en test 6299947a7777084cc2d4b64235bf7190381ce755
type value
accuracy 72.42134062927497
type value
f1 55.03624810959269
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_massive_intent MTEB MassiveIntentClassification (en) en test 072a486a144adf7f4479a4a0dddb2152e161e1ea
type value
accuracy 70.3866845998655
type value
f1 68.9674519872921
task dataset metrics
type
Classification
type name config split revision
mteb/amazon_massive_scenario MTEB MassiveScenarioClassification (en) en test 7d571f92784cd94a019292a1f45445077d0ef634
type value
accuracy 76.27774041694687
type value
f1 76.72936190462792
task dataset metrics
type
Clustering
type name config split revision
mteb/medrxiv-clustering-p2p MTEB MedrxivClusteringP2P default test dcefc037ef84348e49b0d29109e891c01067226b
type value
v_measure 31.511745925773337
task dataset metrics
type
Clustering
type name config split revision
mteb/medrxiv-clustering-s2s MTEB MedrxivClusteringS2S default test 3cd0e71dfbe09d4de0f9e5ecba43e7ce280959dc
type value
v_measure 28.764235987575365
task dataset metrics
type
Reranking
type name config split revision
mteb/mind_small MTEB MindSmallReranking default test 3bdac13927fdc888b903db93b2ffdbd90b295a69
type value
map 32.29353136386601
type value
mrr 33.536774455851685
task dataset metrics
type
Retrieval
type name config split revision
nfcorpus MTEB NFCorpus default test 7eb63cc0c1eb59324d709ebed25fcab851fa7610
type value
map_at_1 5.702
type value
map_at_10 13.642000000000001
type value
map_at_100 17.503
type value
map_at_1000 19.126
type value
map_at_3 9.748
type value
map_at_5 11.642
type value
mrr_at_1 45.82
type value
mrr_at_10 54.821
type value
mrr_at_100 55.422000000000004
type value
mrr_at_1000 55.452999999999996
type value
mrr_at_3 52.373999999999995
type value
mrr_at_5 53.937000000000005
type value
ndcg_at_1 44.272
type value
ndcg_at_10 36.213
type value
ndcg_at_100 33.829
type value
ndcg_at_1000 42.557
type value
ndcg_at_3 40.814
type value
ndcg_at_5 39.562000000000005
type value
precision_at_1 45.511
type value
precision_at_10 27.214
type value
precision_at_100 8.941
type value
precision_at_1000 2.1870000000000003
type value
precision_at_3 37.874
type value
precision_at_5 34.489
type value
recall_at_1 5.702
type value
recall_at_10 17.638
type value
recall_at_100 34.419
type value
recall_at_1000 66.41
type value
recall_at_3 10.914
type value
recall_at_5 14.032
task dataset metrics
type
Retrieval
type name config split revision
nq MTEB NQ default test 6062aefc120bfe8ece5897809fb2e53bfe0d128c
type value
map_at_1 30.567
type value
map_at_10 45.01
type value
map_at_100 46.091
type value
map_at_1000 46.126
type value
map_at_3 40.897
type value
map_at_5 43.301
type value
mrr_at_1 34.56
type value
mrr_at_10 47.725
type value
mrr_at_100 48.548
type value
mrr_at_1000 48.571999999999996
type value
mrr_at_3 44.361
type value
mrr_at_5 46.351
type value
ndcg_at_1 34.531
type value
ndcg_at_10 52.410000000000004
type value
ndcg_at_100 56.999
type value
ndcg_at_1000 57.830999999999996
type value
ndcg_at_3 44.734
type value
ndcg_at_5 48.701
type value
precision_at_1 34.531
type value
precision_at_10 8.612
type value
precision_at_100 1.118
type value
precision_at_1000 0.12
type value
precision_at_3 20.307
type value
precision_at_5 14.519000000000002
type value
recall_at_1 30.567
type value
recall_at_10 72.238
type value
recall_at_100 92.154
type value
recall_at_1000 98.375
type value
recall_at_3 52.437999999999995
type value
recall_at_5 61.516999999999996
task dataset metrics
type
Retrieval
type name config split revision
quora MTEB QuoraRetrieval default test 6205996560df11e3a3da9ab4f926788fc30a7db4
type value
map_at_1 65.98
type value
map_at_10 80.05600000000001
type value
map_at_100 80.76299999999999
type value
map_at_1000 80.786
type value
map_at_3 76.848
type value
map_at_5 78.854
type value
mrr_at_1 75.86
type value
mrr_at_10 83.397
type value
mrr_at_100 83.555
type value
mrr_at_1000 83.557
type value
mrr_at_3 82.033
type value
mrr_at_5 82.97
type value
ndcg_at_1 75.88000000000001
type value
ndcg_at_10 84.58099999999999
type value
ndcg_at_100 86.151
type value
ndcg_at_1000 86.315
type value
ndcg_at_3 80.902
type value
ndcg_at_5 82.953
type value
precision_at_1 75.88000000000001
type value
precision_at_10 12.986
type value
precision_at_100 1.5110000000000001
type value
precision_at_1000 0.156
type value
precision_at_3 35.382999999999996
type value
precision_at_5 23.555999999999997
type value
recall_at_1 65.98
type value
recall_at_10 93.716
type value
recall_at_100 99.21799999999999
type value
recall_at_1000 99.97
type value
recall_at_3 83.551
type value
recall_at_5 88.998
task dataset metrics
type
Clustering
type name config split revision
mteb/reddit-clustering MTEB RedditClustering default test b2805658ae38990172679479369a78b86de8c390
type value
v_measure 40.45148482612238
task dataset metrics
type
Clustering
type name config split revision
mteb/reddit-clustering-p2p MTEB RedditClusteringP2P default test 385e3cb46b4cfa89021f56c4380204149d0efe33
type value
v_measure 55.749490673039126
task dataset metrics
type
Retrieval
type name config split revision
scidocs MTEB SCIDOCS default test 5c59ef3e437a0a9651c8fe6fde943e7dce59fba5
type value
map_at_1 4.903
type value
map_at_10 11.926
type value
map_at_100 13.916999999999998
type value
map_at_1000 14.215
type value
map_at_3 8.799999999999999
type value
map_at_5 10.360999999999999
type value
mrr_at_1 24.099999999999998
type value
mrr_at_10 34.482
type value
mrr_at_100 35.565999999999995
type value
mrr_at_1000 35.619
type value
mrr_at_3 31.433
type value
mrr_at_5 33.243
type value
ndcg_at_1 24.099999999999998
type value
ndcg_at_10 19.872999999999998
type value
ndcg_at_100 27.606
type value
ndcg_at_1000 32.811
type value
ndcg_at_3 19.497999999999998
type value
ndcg_at_5 16.813
type value
precision_at_1 24.099999999999998
type value
precision_at_10 10.08
type value
precision_at_100 2.122
type value
precision_at_1000 0.337
type value
precision_at_3 18.2
type value
precision_at_5 14.62
type value
recall_at_1 4.903
type value
recall_at_10 20.438000000000002
type value
recall_at_100 43.043
type value
recall_at_1000 68.41000000000001
type value
recall_at_3 11.068
type value
recall_at_5 14.818000000000001
task dataset metrics
type
STS
type name config split revision
mteb/sickr-sts MTEB SICK-R default test 20a6d6f312dd54037fe07a32d58e5e168867909d
type value
cos_sim_pearson 78.58086597995997
type value
cos_sim_spearman 69.63214182814991
type value
euclidean_pearson 72.76175489042691
type value
euclidean_spearman 67.84965161872971
type value
manhattan_pearson 72.73812689782592
type value
manhattan_spearman 67.83610439531277
task dataset metrics
type
STS
type name config split revision
mteb/sts12-sts MTEB STS12 default test fdf84275bb8ce4b49c971d02e84dd1abc677a50f
type value
cos_sim_pearson 75.13970861325006
type value
cos_sim_spearman 67.5020551515597
type value
euclidean_pearson 66.33415412418276
type value
euclidean_spearman 66.82145056673268
type value
manhattan_pearson 66.55489484006415
type value
manhattan_spearman 66.95147433279057
task dataset metrics
type
STS
type name config split revision
mteb/sts13-sts MTEB STS13 default test 1591bfcbe8c69d4bf7fe2a16e2451017832cafb9
type value
cos_sim_pearson 78.85850536483447
type value
cos_sim_spearman 79.1633350177206
type value
euclidean_pearson 72.74090561408477
type value
euclidean_spearman 73.57374448302961
type value
manhattan_pearson 72.92980654233226
type value
manhattan_spearman 73.72777155112588
task dataset metrics
type
STS
type name config split revision
mteb/sts14-sts MTEB STS14 default test e2125984e7df8b7871f6ae9949cf6b6795e7c54b
type value
cos_sim_pearson 79.51125593897028
type value
cos_sim_spearman 74.46048326701329
type value
euclidean_pearson 70.87726087052985
type value
euclidean_spearman 67.7721470654411
type value
manhattan_pearson 71.05892792135637
type value
manhattan_spearman 67.93472619779037
task dataset metrics
type
STS
type name config split revision
mteb/sts15-sts MTEB STS15 default test 1cd7298cac12a96a373b6a2f18738bb3e739a9b6
type value
cos_sim_pearson 83.8299348880489
type value
cos_sim_spearman 84.47194637929275
type value
euclidean_pearson 78.68768462480418
type value
euclidean_spearman 79.80526323901917
type value
manhattan_pearson 78.6810718151946
type value
manhattan_spearman 79.7820584821254
task dataset metrics
type
STS
type name config split revision
mteb/sts16-sts MTEB STS16 default test 360a0b2dff98700d09e634a01e1cc1624d3e42cd
type value
cos_sim_pearson 79.99206664843005
type value
cos_sim_spearman 80.96089203722137
type value
euclidean_pearson 71.31216213716365
type value
euclidean_spearman 71.45258140049407
type value
manhattan_pearson 71.26140340402836
type value
manhattan_spearman 71.3896894666943
task dataset metrics
type
STS
type name config split revision
mteb/sts17-crosslingual-sts MTEB STS17 (en-en) en-en test 9fc37e8c632af1c87a3d23e685d49552a02582a0
type value
cos_sim_pearson 87.35697089594868
type value
cos_sim_spearman 87.78202647220289
type value
euclidean_pearson 84.20969668786667
type value
euclidean_spearman 83.91876425459982
type value
manhattan_pearson 84.24429755612542
type value
manhattan_spearman 83.98826315103398
task dataset metrics
type
STS
type name config split revision
mteb/sts22-crosslingual-sts MTEB STS22 (en) en test 2de6ce8c1921b71a755b262c6b57fef195dd7906
type value
cos_sim_pearson 69.06962775868384
type value
cos_sim_spearman 69.34889515492327
type value
euclidean_pearson 69.28108180412313
type value
euclidean_spearman 69.6437114853659
type value
manhattan_pearson 69.39974983734993
type value
manhattan_spearman 69.69057284482079
task dataset metrics
type
STS
type name config split revision
mteb/stsbenchmark-sts MTEB STSBenchmark default test 8913289635987208e6e7c72789e4be2fe94b6abd
type value
cos_sim_pearson 82.42553734213958
type value
cos_sim_spearman 81.38977341532744
type value
euclidean_pearson 76.47494587945522
type value
euclidean_spearman 75.92794860531089
type value
manhattan_pearson 76.4768777169467
type value
manhattan_spearman 75.9252673228599
task dataset metrics
type
Reranking
type name config split revision
mteb/scidocs-reranking MTEB SciDocsRR default test 56a6d0140cf6356659e2a7c1413286a774468d44
type value
map 80.78825425914722
type value
mrr 94.60017197762296
task dataset metrics
type
Retrieval
type name config split revision
scifact MTEB SciFact default test a75ae049398addde9b70f6b268875f5cbce99089
type value
map_at_1 60.633
type value
map_at_10 70.197
type value
map_at_100 70.758
type value
map_at_1000 70.765
type value
map_at_3 67.082
type value
map_at_5 69.209
type value
mrr_at_1 63.333
type value
mrr_at_10 71.17
type value
mrr_at_100 71.626
type value
mrr_at_1000 71.633
type value
mrr_at_3 68.833
type value
mrr_at_5 70.6
type value
ndcg_at_1 63.333
type value
ndcg_at_10 74.697
type value
ndcg_at_100 76.986
type value
ndcg_at_1000 77.225
type value
ndcg_at_3 69.527
type value
ndcg_at_5 72.816
type value
precision_at_1 63.333
type value
precision_at_10 9.9
type value
precision_at_100 1.103
type value
precision_at_1000 0.11199999999999999
type value
precision_at_3 26.889000000000003
type value
precision_at_5 18.2
type value
recall_at_1 60.633
type value
recall_at_10 87.36699999999999
type value
recall_at_100 97.333
type value
recall_at_1000 99.333
type value
recall_at_3 73.656
type value
recall_at_5 82.083
task dataset metrics
type
PairClassification
type name config split revision
mteb/sprintduplicatequestions-pairclassification MTEB SprintDuplicateQuestions default test 5a8256d0dff9c4bd3be3ba3e67e4e70173f802ea
type value
cos_sim_accuracy 99.76633663366337
type value
cos_sim_ap 93.84024096781063
type value
cos_sim_f1 88.08080808080808
type value
cos_sim_precision 88.9795918367347
type value
cos_sim_recall 87.2
type value
dot_accuracy 99.46336633663367
type value
dot_ap 75.78127156965245
type value
dot_f1 71.41403865717193
type value
dot_precision 72.67080745341616
type value
dot_recall 70.19999999999999
type value
euclidean_accuracy 99.67524752475248
type value
euclidean_ap 88.61274955249769
type value
euclidean_f1 82.30852211434735
type value
euclidean_precision 89.34426229508196
type value
euclidean_recall 76.3
type value
manhattan_accuracy 99.67722772277227
type value
manhattan_ap 88.77516158012779
type value
manhattan_f1 82.36536430834212
type value
manhattan_precision 87.24832214765101
type value
manhattan_recall 78.0
type value
max_accuracy 99.76633663366337
type value
max_ap 93.84024096781063
type value
max_f1 88.08080808080808
task dataset metrics
type
Clustering
type name config split revision
mteb/stackexchange-clustering MTEB StackExchangeClustering default test 70a89468f6dccacc6aa2b12a6eac54e74328f235
type value
v_measure 59.20812266121527
task dataset metrics
type
Clustering
type name config split revision
mteb/stackexchange-clustering-p2p MTEB StackExchangeClusteringP2P default test d88009ab563dd0b16cfaf4436abaf97fa3550cf0
type value
v_measure 33.954248554638056
task dataset metrics
type
Reranking
type name config split revision
mteb/stackoverflowdupquestions-reranking MTEB StackOverflowDupQuestions default test ef807ea29a75ec4f91b50fd4191cb4ee4589a9f9
type value
map 51.52800990025549
type value
mrr 52.360394915541974
task dataset metrics
type
Summarization
type name config split revision
mteb/summeval MTEB SummEval default test 8753c2788d36c01fc6f05d03fe3f7268d63f9122
type value
cos_sim_pearson 30.737881131277356
type value
cos_sim_spearman 31.45979323917254
type value
dot_pearson 26.24686017962023
type value
dot_spearman 25.006732878791743
task dataset metrics
type
Retrieval
type name config split revision
trec-covid MTEB TRECCOVID default test 2c8041b2c07a79b6f7ba8fe6acc72e5d9f92d217
type value
map_at_1 0.253
type value
map_at_10 2.1399999999999997
type value
map_at_100 12.873000000000001
type value
map_at_1000 31.002000000000002
type value
map_at_3 0.711
type value
map_at_5 1.125
type value
mrr_at_1 96.0
type value
mrr_at_10 98.0
type value
mrr_at_100 98.0
type value
mrr_at_1000 98.0
type value
mrr_at_3 98.0
type value
mrr_at_5 98.0
type value
ndcg_at_1 94.0
type value
ndcg_at_10 84.881
type value
ndcg_at_100 64.694
type value
ndcg_at_1000 56.85
type value
ndcg_at_3 90.061
type value
ndcg_at_5 87.155
type value
precision_at_1 96.0
type value
precision_at_10 88.8
type value
precision_at_100 65.7
type value
precision_at_1000 25.080000000000002
type value
precision_at_3 92.667
type value
precision_at_5 90.0
type value
recall_at_1 0.253
type value
recall_at_10 2.292
type value
recall_at_100 15.78
type value
recall_at_1000 53.015
type value
recall_at_3 0.7270000000000001
type value
recall_at_5 1.162
task dataset metrics
type
Retrieval
type name config split revision
webis-touche2020 MTEB Touche2020 default test 527b7d77e16e343303e68cb6af11d6e18b9f7b3b
type value
map_at_1 2.116
type value
map_at_10 9.625
type value
map_at_100 15.641
type value
map_at_1000 17.127
type value
map_at_3 4.316
type value
map_at_5 6.208
type value
mrr_at_1 32.653
type value
mrr_at_10 48.083999999999996
type value
mrr_at_100 48.631
type value
mrr_at_1000 48.649
type value
mrr_at_3 42.857
type value
mrr_at_5 46.224
type value
ndcg_at_1 29.592000000000002
type value
ndcg_at_10 25.430999999999997
type value
ndcg_at_100 36.344
type value
ndcg_at_1000 47.676
type value
ndcg_at_3 26.144000000000002
type value
ndcg_at_5 26.304
type value
precision_at_1 32.653
type value
precision_at_10 24.082
type value
precision_at_100 7.714
type value
precision_at_1000 1.5310000000000001
type value
precision_at_3 26.531
type value
precision_at_5 26.939
type value
recall_at_1 2.116
type value
recall_at_10 16.794
type value
recall_at_100 47.452
type value
recall_at_1000 82.312
type value
recall_at_3 5.306
type value
recall_at_5 9.306000000000001
task dataset metrics
type
Classification
type name config split revision
mteb/toxic_conversations_50k MTEB ToxicConversationsClassification default test edfaf9da55d3dd50d43143d90c1ac476895ae6de
type value
accuracy 67.709
type value
ap 13.541535578501716
type value
f1 52.569619919446794
task dataset metrics
type
Classification
type name config split revision
mteb/tweet_sentiment_extraction MTEB TweetSentimentExtractionClassification default test 62146448f05be9e52a36b8ee9936447ea787eede
type value
accuracy 56.850594227504246
type value
f1 57.233377364910574
task dataset metrics
type
Clustering
type name config split revision
mteb/twentynewsgroups-clustering MTEB TwentyNewsgroupsClustering default test 091a54f9a36281ce7d6590ec8c75dd485e7e01d4
type value
v_measure 39.463722986090474
task dataset metrics
type
PairClassification
type name config split revision
mteb/twittersemeval2015-pairclassification MTEB TwitterSemEval2015 default test 70970daeab8776df92f5ea462b6173c0b46fd2d1
type value
cos_sim_accuracy 84.09131549144662
type value
cos_sim_ap 66.86677647503386
type value
cos_sim_f1 62.94631710362049
type value
cos_sim_precision 59.73933649289099
type value
cos_sim_recall 66.51715039577837
type value
dot_accuracy 80.27656911247541
type value
dot_ap 54.291720398612085
type value
dot_f1 54.77150537634409
type value
dot_precision 47.58660957571039
type value
dot_recall 64.5118733509235
type value
euclidean_accuracy 82.76211480002385
type value
euclidean_ap 62.430397690753296
type value
euclidean_f1 59.191590539356774
type value
euclidean_precision 56.296119971435374
type value
euclidean_recall 62.401055408970976
type value
manhattan_accuracy 82.7561542588067
type value
manhattan_ap 62.41882051995577
type value
manhattan_f1 59.32101002778785
type value
manhattan_precision 54.71361711611321
type value
manhattan_recall 64.77572559366754
type value
max_accuracy 84.09131549144662
type value
max_ap 66.86677647503386
type value
max_f1 62.94631710362049
task dataset metrics
type
PairClassification
type name config split revision
mteb/twitterurlcorpus-pairclassification MTEB TwitterURLCorpus default test 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf
type value
cos_sim_accuracy 88.79574649745798
type value
cos_sim_ap 85.28960532524223
type value
cos_sim_f1 77.98460043358001
type value
cos_sim_precision 75.78090948714224
type value
cos_sim_recall 80.32029565753002
type value
dot_accuracy 85.5939767920208
type value
dot_ap 76.14131706694056
type value
dot_f1 72.70246298696868
type value
dot_precision 65.27012127894156
type value
dot_recall 82.04496458269172
type value
euclidean_accuracy 86.72332828812046
type value
euclidean_ap 80.84854809178995
type value
euclidean_f1 72.47657499809551
type value
euclidean_precision 71.71717171717171
type value
euclidean_recall 73.25223283030489
type value
manhattan_accuracy 86.7563162184189
type value
manhattan_ap 80.87598895575626
type value
manhattan_f1 72.54617892068092
type value
manhattan_precision 68.49268225960881
type value
manhattan_recall 77.10963966738528
type value
max_accuracy 88.79574649745798
type value
max_ap 85.28960532524223
type value
max_f1 77.98460043358001

SGPT-5.8B-weightedmean-msmarco-specb-bitfit

Usage

For usage instructions, refer to our codebase: https://github.com/Muennighoff/sgpt

Evaluation Results

For eval results, refer to our paper: https://arxiv.org/abs/2202.08904

Training

The model was trained with the parameters:

DataLoader:

torch.utils.data.dataloader.DataLoader of length 249592 with parameters:

{'batch_size': 2, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}

Loss:

sentence_transformers.losses.MultipleNegativesRankingLoss.MultipleNegativesRankingLoss with parameters:

{'scale': 20.0, 'similarity_fct': 'cos_sim'}

Parameters of the fit()-Method:

{
    "epochs": 10,
    "evaluation_steps": 0,
    "evaluator": "NoneType",
    "max_grad_norm": 1,
    "optimizer_class": "<class 'transformers.optimization.AdamW'>",
    "optimizer_params": {
        "lr": 5e-05
    },
    "scheduler": "WarmupLinear",
    "steps_per_epoch": null,
    "warmup_steps": 1000,
    "weight_decay": 0.01
}

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 300, 'do_lower_case': False}) with Transformer model: GPTJModel 
  (1): Pooling({'word_embedding_dimension': 4096, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': True, 'pooling_mode_lasttoken': False})
)

Citing & Authors

@article{muennighoff2022sgpt,
  title={SGPT: GPT Sentence Embeddings for Semantic Search},
  author={Muennighoff, Niklas},
  journal={arXiv preprint arXiv:2202.08904},
  year={2022}
}