Files
FRIDA/README.md
ModelHub XC 5646ba790d 初始化项目,由ModelHub XC社区提供模型
Model: ai-forever/FRIDA
Source: Original Platform
2026-05-14 14:15:17 +08:00

54 KiB
Raw Permalink Blame History

model-index, license, language, tags, base_model, pipeline_tag, datasets
model-index license language tags base_model pipeline_tag datasets
name results
FRIDA
dataset metrics task
config name revision split type
default MTEB CEDRClassification (default) c0ba03d058e3e1b2f3fd20518875a4563dd12db4 test ai-forever/cedr-classification
type value
accuracy 64.60148777895856
type value
f1 70.36630348039266
type value
lrap 92.47290116896953
type value
main_score 64.60148777895856
type
MultilabelClassification
dataset metrics task
config name revision split type
default MTEB GeoreviewClassification (default) 3765c0d1de6b7d264bc459433c45e5a75513839c test ai-forever/georeview-classification
type value
accuracy 57.70996093750001
type value
f1 53.18542982057098
type value
f1_weighted 53.17663229582108
type value
main_score 57.70996093750001
type
Classification
dataset metrics task
config name revision split type
default MTEB GeoreviewClusteringP2P (default) 97a313c8fc85b47f13f33e7e9a95c1ad888c7fec test ai-forever/georeview-clustering-p2p
type value
main_score 78.25468393043356
type value
v_measure 78.25468393043356
type value
v_measure_std 0.5094366871364238
type
Clustering
dataset metrics task
config name revision split type
default MTEB HeadlineClassification (default) 2fe05ee6b5832cda29f2ef7aaad7b7fe6a3609eb test ai-forever/headline-classification
type value
accuracy 89.0185546875
type value
f1 88.993933120612
type value
f1_weighted 88.99276764225768
type value
main_score 89.0185546875
type
Classification
dataset metrics task
config name revision split type
default MTEB InappropriatenessClassification (default) 601651fdc45ef243751676e62dd7a19f491c0285 test ai-forever/inappropriateness-classification
type value
accuracy 78.330078125
type value
ap 73.17856750532495
type value
ap_weighted 73.17856750532495
type value
f1 78.20169867599041
type value
f1_weighted 78.20169867599041
type value
main_score 78.330078125
type
Classification
dataset metrics task
config name revision split type
default MTEB KinopoiskClassification (default) 5911f26666ac11af46cb9c6849d0dc80a378af24 test ai-forever/kinopoisk-sentiment-classification
type value
accuracy 70.46666666666665
type value
f1 65.83951766538878
type value
f1_weighted 65.83951766538878
type value
main_score 70.46666666666665
type
Classification
dataset metrics task
config name revision split type
ru MTEB MIRACLReranking (ru) 6d1962c527217f8927fca80f890f14f36b2802af dev miracl/mmteb-miracl-reranking
type value
MAP@1(MIRACL) 39.023
type value
MAP@10(MIRACL) 60.208
type value
MAP@100(MIRACL) 61.672000000000004
type value
MAP@1000(MIRACL) 61.672000000000004
type value
MAP@20(MIRACL) 61.30799999999999
type value
MAP@3(MIRACL) 53.33
type value
MAP@5(MIRACL) 57.289
type value
NDCG@1(MIRACL) 63.352
type value
NDCG@10(MIRACL) 66.042
type value
NDCG@100(MIRACL) 68.702
type value
NDCG@1000(MIRACL) 68.702
type value
NDCG@20(MIRACL) 67.768
type value
NDCG@3(MIRACL) 61.925
type value
NDCG@5(MIRACL) 63.327
type value
P@1(MIRACL) 63.352
type value
P@10(MIRACL) 16.512
type value
P@100(MIRACL) 1.9529999999999998
type value
P@1000(MIRACL) 0.19499999999999998
type value
P@20(MIRACL) 9.13
type value
P@3(MIRACL) 37.878
type value
P@5(MIRACL) 27.586
type value
Recall@1(MIRACL) 39.023
type value
Recall@10(MIRACL) 72.35000000000001
type value
Recall@100(MIRACL) 79.952
type value
Recall@1000(MIRACL) 79.952
type value
Recall@20(MIRACL) 76.828
type value
Recall@3(MIRACL) 57.769999999999996
type value
Recall@5(MIRACL) 64.91900000000001
type value
main_score 66.042
type value
nAUC_MAP@1000_diff1(MIRACL) 27.150388833033052
type value
nAUC_MAP@1000_max(MIRACL) 55.15672274267081
type value
nAUC_MAP@1000_std(MIRACL) 30.088939934575553
type value
nAUC_MAP@100_diff1(MIRACL) 27.150388833033052
type value
nAUC_MAP@100_max(MIRACL) 55.15672274267081
type value
nAUC_MAP@100_std(MIRACL) 30.088939934575553
type value
nAUC_MAP@10_diff1(MIRACL) 27.853691773641742
type value
nAUC_MAP@10_max(MIRACL) 52.89390350055654
type value
nAUC_MAP@10_std(MIRACL) 28.08732516551691
type value
nAUC_MAP@1_diff1(MIRACL) 43.23179150244192
type value
nAUC_MAP@1_max(MIRACL) 29.923943954188864
type value
nAUC_MAP@1_std(MIRACL) 7.447084370195121
type value
nAUC_MAP@20_diff1(MIRACL) 27.328384072311675
type value
nAUC_MAP@20_max(MIRACL) 54.60286379835721
type value
nAUC_MAP@20_std(MIRACL) 29.8084128980043
type value
nAUC_MAP@3_diff1(MIRACL) 31.244971536944554
type value
nAUC_MAP@3_max(MIRACL) 43.63984692803854
type value
nAUC_MAP@3_std(MIRACL) 18.609234683765887
type value
nAUC_MAP@5_diff1(MIRACL) 29.088760492638286
type value
nAUC_MAP@5_max(MIRACL) 48.30474364461509
type value
nAUC_MAP@5_std(MIRACL) 23.817514353844224
type value
nAUC_NDCG@1000_diff1(MIRACL) 23.12754356408408
type value
nAUC_NDCG@1000_max(MIRACL) 64.24894553363303
type value
nAUC_NDCG@1000_std(MIRACL) 38.19318050598967
type value
nAUC_NDCG@100_diff1(MIRACL) 23.12754356408408
type value
nAUC_NDCG@100_max(MIRACL) 64.24894553363303
type value
nAUC_NDCG@100_std(MIRACL) 38.19318050598967
type value
nAUC_NDCG@10_diff1(MIRACL) 24.779856373697275
type value
nAUC_NDCG@10_max(MIRACL) 60.4054459738118
type value
nAUC_NDCG@10_std(MIRACL) 35.148950441182784
type value
nAUC_NDCG@1_diff1(MIRACL) 35.605865569438556
type value
nAUC_NDCG@1_max(MIRACL) 65.77787399715454
type value
nAUC_NDCG@1_std(MIRACL) 34.34726892885082
type value
nAUC_NDCG@20_diff1(MIRACL) 23.71231783125691
type value
nAUC_NDCG@20_max(MIRACL) 62.89676599488004
type value
nAUC_NDCG@20_std(MIRACL) 37.697052941884316
type value
nAUC_NDCG@3_diff1(MIRACL) 26.109027741640865
type value
nAUC_NDCG@3_max(MIRACL) 56.22356793638693
type value
nAUC_NDCG@3_std(MIRACL) 29.9437568508688
type value
nAUC_NDCG@5_diff1(MIRACL) 25.98644715327336
type value
nAUC_NDCG@5_max(MIRACL) 56.25032008404774
type value
nAUC_NDCG@5_std(MIRACL) 31.581899860862578
type value
nAUC_P@1000_diff1(MIRACL) -18.29912787064644
type value
nAUC_P@1000_max(MIRACL) 31.811344878776087
type value
nAUC_P@1000_std(MIRACL) 30.163820183304914
type value
nAUC_P@100_diff1(MIRACL) -18.299127870646405
type value
nAUC_P@100_max(MIRACL) 31.811344878776133
type value
nAUC_P@100_std(MIRACL) 30.163820183304956
type value
nAUC_P@10_diff1(MIRACL) -15.96416268531149
type value
nAUC_P@10_max(MIRACL) 36.989578896466526
type value
nAUC_P@10_std(MIRACL) 34.54507111688143
type value
nAUC_P@1_diff1(MIRACL) 35.605865569438556
type value
nAUC_P@1_max(MIRACL) 65.77787399715454
type value
nAUC_P@1_std(MIRACL) 34.34726892885082
type value
nAUC_P@20_diff1(MIRACL) -17.443963421383287
type value
nAUC_P@20_max(MIRACL) 34.309618168778385
type value
nAUC_P@20_std(MIRACL) 33.38820956485373
type value
nAUC_P@3_diff1(MIRACL) -8.533621861815652
type value
nAUC_P@3_max(MIRACL) 45.90408386776497
type value
nAUC_P@3_std(MIRACL) 34.50459351305535
type value
nAUC_P@5_diff1(MIRACL) -13.207968899314865
type value
nAUC_P@5_max(MIRACL) 40.37718282248973
type value
nAUC_P@5_std(MIRACL) 35.601417332196206
type value
nAUC_Recall@1000_diff1(MIRACL) 7.907304198177226
type value
nAUC_Recall@1000_max(MIRACL) 77.82197832361145
type value
nAUC_Recall@1000_std(MIRACL) 52.66957487246724
type value
nAUC_Recall@100_diff1(MIRACL) 7.907304198177226
type value
nAUC_Recall@100_max(MIRACL) 77.82197832361145
type value
nAUC_Recall@100_std(MIRACL) 52.66957487246724
type value
nAUC_Recall@10_diff1(MIRACL) 15.498121023488693
type value
nAUC_Recall@10_max(MIRACL) 62.24320529338724
type value
nAUC_Recall@10_std(MIRACL) 40.60221460946224
type value
nAUC_Recall@1_diff1(MIRACL) 43.23179150244192
type value
nAUC_Recall@1_max(MIRACL) 29.923943954188864
type value
nAUC_Recall@1_std(MIRACL) 7.447084370195121
type value
nAUC_Recall@20_diff1(MIRACL) 11.457044176116248
type value
nAUC_Recall@20_max(MIRACL) 70.3493054342368
type value
nAUC_Recall@20_std(MIRACL) 49.27124296325928
type value
nAUC_Recall@3_diff1(MIRACL) 25.12077828977941
type value
nAUC_Recall@3_max(MIRACL) 42.903379317937166
type value
nAUC_Recall@3_std(MIRACL) 20.324501722161497
type value
nAUC_Recall@5_diff1(MIRACL) 20.925701235197977
type value
nAUC_Recall@5_max(MIRACL) 49.85323960390812
type value
nAUC_Recall@5_std(MIRACL) 29.04484539530469
type
Reranking
dataset metrics task
config name revision split type
ru MTEB MIRACLRetrieval (ru) main dev miracl/mmteb-miracl
type value
main_score 71.882
type value
map_at_1 37.913000000000004
type value
map_at_10 62.604000000000006
type value
map_at_100 64.925
type value
map_at_1000 64.992
type value
map_at_20 64.081
type value
map_at_3 55.212
type value
map_at_5 59.445
type value
mrr_at_1 73.24281150159744
type value
mrr_at_10 81.65043866321825
type value
mrr_at_100 81.85391378818977
type value
mrr_at_1000 81.85753390802569
type value
mrr_at_20 81.81045606130179
type value
mrr_at_3 80.56443024494146
type value
mrr_at_5 81.30724174653893
type value
nauc_map_at_1000_diff1 26.962150235593356
type value
nauc_map_at_1000_max 29.234958037854568
type value
nauc_map_at_1000_std -2.4294465103633884
type value
nauc_map_at_100_diff1 26.92990252114163
type value
nauc_map_at_100_max 29.206328533120118
type value
nauc_map_at_100_std -2.437371090941197
type value
nauc_map_at_10_diff1 25.758265691179226
type value
nauc_map_at_10_max 26.949978490795317
type value
nauc_map_at_10_std -5.484961002106038
type value
nauc_map_at_1_diff1 34.70849461278043
type value
nauc_map_at_1_max 12.778570893623042
type value
nauc_map_at_1_std -13.018292652743938
type value
nauc_map_at_20_diff1 26.659923008218268
type value
nauc_map_at_20_max 28.341440871568185
type value
nauc_map_at_20_std -3.614549844913084
type value
nauc_map_at_3_diff1 27.197629021438203
type value
nauc_map_at_3_max 20.701094874050856
type value
nauc_map_at_3_std -12.062992301112041
type value
nauc_map_at_5_diff1 25.51793537203295
type value
nauc_map_at_5_max 23.80396771243794
type value
nauc_map_at_5_std -8.920465695323575
type value
nauc_mrr_at_1000_diff1 45.14819989592967
type value
nauc_mrr_at_1000_max 53.29202156141053
type value
nauc_mrr_at_1000_std 18.037336462510524
type value
nauc_mrr_at_100_diff1 45.15287600228451
type value
nauc_mrr_at_100_max 53.29979751928615
type value
nauc_mrr_at_100_std 18.04996604778386
type value
nauc_mrr_at_10_diff1 44.96865105944474
type value
nauc_mrr_at_10_max 53.53323465323092
type value
nauc_mrr_at_10_std 18.25001344917689
type value
nauc_mrr_at_1_diff1 46.16604946873163
type value
nauc_mrr_at_1_max 48.573651103547874
type value
nauc_mrr_at_1_std 13.764871626330915
type value
nauc_mrr_at_20_diff1 45.11925458479102
type value
nauc_mrr_at_20_max 53.35685123898342
type value
nauc_mrr_at_20_std 18.127344968819905
type value
nauc_mrr_at_3_diff1 45.377195452730234
type value
nauc_mrr_at_3_max 53.35146309217089
type value
nauc_mrr_at_3_std 17.47105877186237
type value
nauc_mrr_at_5_diff1 45.00525578771549
type value
nauc_mrr_at_5_max 53.76227254707128
type value
nauc_mrr_at_5_std 18.437290060746957
type value
nauc_ndcg_at_1000_diff1 31.19215594457491
type value
nauc_ndcg_at_1000_max 38.09555406458668
type value
nauc_ndcg_at_1000_std 7.225628621238009
type value
nauc_ndcg_at_100_diff1 30.726331247999934
type value
nauc_ndcg_at_100_max 37.81369589418277
type value
nauc_ndcg_at_100_std 7.242855238555071
type value
nauc_ndcg_at_10_diff1 27.514048333744835
type value
nauc_ndcg_at_10_max 33.10990399385253
type value
nauc_ndcg_at_10_std 0.3051899572112002
type value
nauc_ndcg_at_1_diff1 47.06089085235751
type value
nauc_ndcg_at_1_max 47.7300872370495
type value
nauc_ndcg_at_1_std 12.468605493613916
type value
nauc_ndcg_at_20_diff1 29.404215438764496
type value
nauc_ndcg_at_20_max 35.26967886796471
type value
nauc_ndcg_at_20_std 3.7214697890813353
type value
nauc_ndcg_at_3_diff1 29.448848639643067
type value
nauc_ndcg_at_3_max 33.85912412370657
type value
nauc_ndcg_at_3_std 0.895453646819452
type value
nauc_ndcg_at_5_diff1 26.916649012613526
type value
nauc_ndcg_at_5_max 30.899005979291644
type value
nauc_ndcg_at_5_std -1.0001575639156615
type value
nauc_precision_at_1000_diff1 -8.492004667432635
type value
nauc_precision_at_1000_max 14.970190384017679
type value
nauc_precision_at_1000_std 32.871386621137816
type value
nauc_precision_at_100_diff1 -8.287314133999967
type value
nauc_precision_at_100_max 17.794821961284736
type value
nauc_precision_at_100_std 35.092483550562
type value
nauc_precision_at_10_diff1 -7.594128993028063
type value
nauc_precision_at_10_max 24.691446370325732
type value
nauc_precision_at_10_std 30.126552282608493
type value
nauc_precision_at_1_diff1 47.06089085235751
type value
nauc_precision_at_1_max 47.7300872370495
type value
nauc_precision_at_1_std 12.468605493613916
type value
nauc_precision_at_20_diff1 -6.503872195775146
type value
nauc_precision_at_20_max 21.789730053141312
type value
nauc_precision_at_20_std 32.61349377558794
type value
nauc_precision_at_3_diff1 0.67417079971061
type value
nauc_precision_at_3_max 30.793871354370662
type value
nauc_precision_at_3_std 18.35266479252011
type value
nauc_precision_at_5_diff1 -7.088881730215777
type value
nauc_precision_at_5_max 26.539771712769006
type value
nauc_precision_at_5_std 24.116262291865834
type value
nauc_recall_at_1000_diff1 34.53263588412461
type value
nauc_recall_at_1000_max 63.54157869100173
type value
nauc_recall_at_1000_std 64.19854844792808
type value
nauc_recall_at_100_diff1 22.86564728642275
type value
nauc_recall_at_100_max 40.350507162549825
type value
nauc_recall_at_100_std 29.24492545863015
type value
nauc_recall_at_10_diff1 15.384818367225009
type value
nauc_recall_at_10_max 24.41108571453699
type value
nauc_recall_at_10_std -3.9216160585776323
type value
nauc_recall_at_1_diff1 34.70849461278043
type value
nauc_recall_at_1_max 12.778570893623042
type value
nauc_recall_at_1_std -13.018292652743938
type value
nauc_recall_at_20_diff1 18.122499000084208
type value
nauc_recall_at_20_max 26.63104220179424
type value
nauc_recall_at_20_std 3.969217732521512
type value
nauc_recall_at_3_diff1 21.413050725250116
type value
nauc_recall_at_3_max 16.18894988386887
type value
nauc_recall_at_3_std -15.24884339282375
type value
nauc_recall_at_5_diff1 16.35673072212927
type value
nauc_recall_at_5_max 18.607003829267846
type value
nauc_recall_at_5_std -10.463525876945454
type value
ndcg_at_1 72.923
type value
ndcg_at_10 71.882
type value
ndcg_at_100 77.09899999999999
type value
ndcg_at_1000 77.835
type value
ndcg_at_20 74.497
type value
ndcg_at_3 68.504
type value
ndcg_at_5 69.068
type value
precision_at_1 72.923
type value
precision_at_10 19.936
type value
precision_at_100 2.6310000000000002
type value
precision_at_1000 0.27799999999999997
type value
precision_at_20 11.33
type value
precision_at_3 45.927
type value
precision_at_5 33.131
type value
recall_at_1 37.913000000000004
type value
recall_at_10 78.365
type value
recall_at_100 94.348
type value
recall_at_1000 98.187
type value
recall_at_20 85.229
type value
recall_at_3 61.42999999999999
type value
recall_at_5 69.56700000000001
type
Retrieval
dataset metrics task
config name revision split type
ru MTEB MassiveIntentClassification (ru) 4672e20407010da34463acc759c162ca9734bca6 test mteb/amazon_massive_intent
type value
accuracy 79.11903160726294
type value
f1 76.22609082694545
type value
f1_weighted 77.81461248063566
type value
main_score 79.11903160726294
type
Classification
dataset metrics task
config name revision split type
ru MTEB MassiveScenarioClassification (ru) fad2c6e8459f9e1c45d9315f4953d921437d70f8 test mteb/amazon_massive_scenario
type value
accuracy 88.80632145258912
type value
f1 87.53157475314829
type value
f1_weighted 88.22733432521495
type value
main_score 88.80632145258912
type
Classification
dataset metrics task
config name revision split type
default MTEB RUParaPhraserSTS (default) 43265056790b8f7c59e0139acb4be0a8dad2c8f4 test merionum/ru_paraphraser
type value
cosine_pearson 72.70307124858925
type value
cosine_spearman 78.09439086920204
type value
euclidean_pearson 76.2033672014715
type value
euclidean_spearman 78.09439086920204
type value
main_score 78.09439086920204
type value
manhattan_pearson 76.11750470223116
type value
manhattan_spearman 78.01081063503413
type value
pearson 72.70307124858925
type value
spearman 78.09439086920204
type
STS
dataset metrics task
config name revision split type
default MTEB RiaNewsRetrieval (default) 82374b0bbacda6114f39ff9c5b925fa1512ca5d7 test ai-forever/ria-news-retrieval
type value
main_score 86.819
type value
map_at_1 78.79
type value
map_at_10 84.516
type value
map_at_100 84.68
type value
map_at_1000 84.685
type value
map_at_20 84.624
type value
map_at_3 83.722
type value
map_at_5 84.246
type value
mrr_at_1 78.78
type value
mrr_at_10 84.51815476190441
type value
mrr_at_100 84.68390840473289
type value
mrr_at_1000 84.68947095200002
type value
mrr_at_20 84.62958130822527
type value
mrr_at_3 83.74499999999964
type value
mrr_at_5 84.23849999999955
type value
nauc_map_at_1000_diff1 82.09914867708899
type value
nauc_map_at_1000_max 43.02024854784386
type value
nauc_map_at_1000_std -22.919695880762777
type value
nauc_map_at_100_diff1 82.09705922783733
type value
nauc_map_at_100_max 43.02697379581718
type value
nauc_map_at_100_std -22.90719212899522
type value
nauc_map_at_10_diff1 82.04404594672894
type value
nauc_map_at_10_max 43.06752103182731
type value
nauc_map_at_10_std -23.007870153273576
type value
nauc_map_at_1_diff1 83.89134152210333
type value
nauc_map_at_1_max 38.083626428503415
type value
nauc_map_at_1_std -25.817960401194252
type value
nauc_map_at_20_diff1 82.08534662247806
type value
nauc_map_at_20_max 43.074305042312346
type value
nauc_map_at_20_std -22.91785703613217
type value
nauc_map_at_3_diff1 81.7967508697558
type value
nauc_map_at_3_max 42.90927479098251
type value
nauc_map_at_3_std -24.01312203859392
type value
nauc_map_at_5_diff1 81.90704517505098
type value
nauc_map_at_5_max 43.05204677044616
type value
nauc_map_at_5_std -23.267331507554896
type value
nauc_mrr_at_1000_diff1 82.11902348082472
type value
nauc_mrr_at_1000_max 43.04118936353063
type value
nauc_mrr_at_1000_std -22.858804296830773
type value
nauc_mrr_at_100_diff1 82.11685562002263
type value
nauc_mrr_at_100_max 43.0482537895494
type value
nauc_mrr_at_100_std -22.84431127787993
type value
nauc_mrr_at_10_diff1 82.06909958688058
type value
nauc_mrr_at_10_max 43.07921689466605
type value
nauc_mrr_at_10_std -22.957623576663234
type value
nauc_mrr_at_1_diff1 83.91147637794326
type value
nauc_mrr_at_1_max 37.91917159543152
type value
nauc_mrr_at_1_std -26.141868289283266
type value
nauc_mrr_at_20_diff1 82.10314004731809
type value
nauc_mrr_at_20_max 43.09295406509764
type value
nauc_mrr_at_20_std -22.862091782178787
type value
nauc_mrr_at_3_diff1 81.82117067269036
type value
nauc_mrr_at_3_max 42.94628953323521
type value
nauc_mrr_at_3_std -23.852510312400714
type value
nauc_mrr_at_5_diff1 81.92857441701598
type value
nauc_mrr_at_5_max 43.129719354492934
type value
nauc_mrr_at_5_std -23.145342272624085
type value
nauc_ndcg_at_1000_diff1 81.75015729717991
type value
nauc_ndcg_at_1000_max 44.7266586308995
type value
nauc_ndcg_at_1000_std -20.60663899715267
type value
nauc_ndcg_at_100_diff1 81.6897808298767
type value
nauc_ndcg_at_100_max 44.99492791287099
type value
nauc_ndcg_at_100_std -20.09637266506936
type value
nauc_ndcg_at_10_diff1 81.46290312197337
type value
nauc_ndcg_at_10_max 45.30218378452244
type value
nauc_ndcg_at_10_std -20.70393523891777
type value
nauc_ndcg_at_1_diff1 83.89134152210333
type value
nauc_ndcg_at_1_max 38.083626428503415
type value
nauc_ndcg_at_1_std -25.817960401194252
type value
nauc_ndcg_at_20_diff1 81.61080772657213
type value
nauc_ndcg_at_20_max 45.36571800492172
type value
nauc_ndcg_at_20_std -20.278763852504042
type value
nauc_ndcg_at_3_diff1 80.95965359410461
type value
nauc_ndcg_at_3_max 44.756971949205834
type value
nauc_ndcg_at_3_std -23.07797617717319
type value
nauc_ndcg_at_5_diff1 81.12417712163976
type value
nauc_ndcg_at_5_max 45.15727381406512
type value
nauc_ndcg_at_5_std -21.52861766165519
type value
nauc_precision_at_1000_diff1 76.80566850396093
type value
nauc_precision_at_1000_max 82.45685370922442
type value
nauc_precision_at_1000_std 46.93570976777808
type value
nauc_precision_at_100_diff1 77.21645520953484
type value
nauc_precision_at_100_max 73.43604108309935
type value
nauc_precision_at_100_std 31.978176891671367
type value
nauc_precision_at_10_diff1 77.88251664302092
type value
nauc_precision_at_10_max 60.58112638995018
type value
nauc_precision_at_10_std -3.674424315180332
type value
nauc_precision_at_1_diff1 83.89134152210333
type value
nauc_precision_at_1_max 38.083626428503415
type value
nauc_precision_at_1_std -25.817960401194252
type value
nauc_precision_at_20_diff1 78.16426786697438
type value
nauc_precision_at_20_max 66.0723612699222
type value
nauc_precision_at_20_std 6.121527084555938
type value
nauc_precision_at_3_diff1 77.43122492166451
type value
nauc_precision_at_3_max 52.50727288548085
type value
nauc_precision_at_3_std -19.036076920799427
type value
nauc_precision_at_5_diff1 77.1127254320532
type value
nauc_precision_at_5_max 56.100901899221135
type value
nauc_precision_at_5_std -12.009191140844198
type value
nauc_recall_at_1000_diff1 76.80566850396035
type value
nauc_recall_at_1000_max 82.45685370922577
type value
nauc_recall_at_1000_std 46.93570976777776
type value
nauc_recall_at_100_diff1 77.21645520953459
type value
nauc_recall_at_100_max 73.43604108310011
type value
nauc_recall_at_100_std 31.978176891671993
type value
nauc_recall_at_10_diff1 77.88251664302089
type value
nauc_recall_at_10_max 60.58112638994999
type value
nauc_recall_at_10_std -3.6744243151805427
type value
nauc_recall_at_1_diff1 83.89134152210333
type value
nauc_recall_at_1_max 38.083626428503415
type value
nauc_recall_at_1_std -25.817960401194252
type value
nauc_recall_at_20_diff1 78.16426786697409
type value
nauc_recall_at_20_max 66.07236126992217
type value
nauc_recall_at_20_std 6.121527084555941
type value
nauc_recall_at_3_diff1 77.43122492166454
type value
nauc_recall_at_3_max 52.507272885480816
type value
nauc_recall_at_3_std -19.036076920799776
type value
nauc_recall_at_5_diff1 77.11272543205318
type value
nauc_recall_at_5_max 56.10090189922128
type value
nauc_recall_at_5_std -12.009191140843809
type value
ndcg_at_1 78.79
type value
ndcg_at_10 86.819
type value
ndcg_at_100 87.599
type value
ndcg_at_1000 87.761
type value
ndcg_at_20 87.208
type value
ndcg_at_3 85.222
type value
ndcg_at_5 86.164
type value
precision_at_1 78.79
type value
precision_at_10 9.384
type value
precision_at_100 0.975
type value
precision_at_1000 0.099
type value
precision_at_20 4.769
type value
precision_at_3 29.842999999999996
type value
precision_at_5 18.362000000000002
type value
recall_at_1 78.79
type value
recall_at_10 93.84
type value
recall_at_100 97.45
type value
recall_at_1000 98.76
type value
recall_at_20 95.37
type value
recall_at_3 89.53
type value
recall_at_5 91.81
type
Retrieval
dataset metrics task
config name revision split type
default MTEB RuBQReranking (default) 2e96b8f098fa4b0950fc58eacadeb31c0d0c7fa2 test ai-forever/rubq-reranking
type value
main_score 77.07394404835635
type value
map 77.07394404835635
type value
mrr 82.53144412718882
type value
nAUC_map_diff1 45.29805217456628
type value
nAUC_map_max 34.39894042439188
type value
nAUC_map_std 21.11309674418275
type value
nAUC_mrr_diff1 54.783994737367046
type value
nAUC_mrr_max 45.68526733900048
type value
nAUC_mrr_std 28.22466385500339
type
Reranking
dataset metrics task
config name revision split type
default MTEB RuBQRetrieval (default) e19b6ffa60b3bc248e0b41f4cc37c26a55c2a67b test ai-forever/rubq-retrieval
type value
main_score 72.392
type value
map_at_1 47.370000000000005
type value
map_at_10 65.503
type value
map_at_100 66.38
type value
map_at_1000 66.42099999999999
type value
map_at_20 66.071
type value
map_at_3 61.439
type value
map_at_5 63.922999999999995
type value
mrr_at_1 67.37588652482269
type value
mrr_at_10 76.0066747345116
type value
mrr_at_100 76.25754138969413
type value
mrr_at_1000 76.26968825657428
type value
mrr_at_20 76.17548265904622
type value
mrr_at_3 74.61583924349881
type value
mrr_at_5 75.46690307328608
type value
nauc_map_at_1000_diff1 42.52570720187294
type value
nauc_map_at_1000_max 37.40318318724238
type value
nauc_map_at_1000_std 0.6037788201535506
type value
nauc_map_at_100_diff1 42.493410029691226
type value
nauc_map_at_100_max 37.39802489244377
type value
nauc_map_at_100_std 0.6071359951887154
type value
nauc_map_at_10_diff1 42.09833519659916
type value
nauc_map_at_10_max 37.1184138958874
type value
nauc_map_at_10_std 0.4063543094010351
type value
nauc_map_at_1_diff1 49.56605205141156
type value
nauc_map_at_1_max 26.251096698710384
type value
nauc_map_at_1_std -4.580748485387834
type value
nauc_map_at_20_diff1 42.33372393482018
type value
nauc_map_at_20_max 37.416955604649985
type value
nauc_map_at_20_std 0.6050577802787294
type value
nauc_map_at_3_diff1 42.362234475441845
type value
nauc_map_at_3_max 34.56001379838821
type value
nauc_map_at_3_std -1.507636598929042
type value
nauc_map_at_5_diff1 42.0202264882535
type value
nauc_map_at_5_max 36.64306050200848
type value
nauc_map_at_5_std -0.09509025708798424
type value
nauc_mrr_at_1000_diff1 58.99601742026931
type value
nauc_mrr_at_1000_max 49.61561872452777
type value
nauc_mrr_at_1000_std 2.3956102974352356
type value
nauc_mrr_at_100_diff1 58.9865943101085
type value
nauc_mrr_at_100_max 49.6248111507265
type value
nauc_mrr_at_100_std 2.411155095066369
type value
nauc_mrr_at_10_diff1 58.81758131092919
type value
nauc_mrr_at_10_max 49.780365572616695
type value
nauc_mrr_at_10_std 2.7068696565195944
type value
nauc_mrr_at_1_diff1 61.67036882487055
type value
nauc_mrr_at_1_max 45.455271042821714
type value
nauc_mrr_at_1_std -0.9370526815458349
type value
nauc_mrr_at_20_diff1 58.93674818203478
type value
nauc_mrr_at_20_max 49.703218108625215
type value
nauc_mrr_at_20_std 2.4473106598190415
type value
nauc_mrr_at_3_diff1 59.046856598788445
type value
nauc_mrr_at_3_max 49.37161726123392
type value
nauc_mrr_at_3_std 1.5110936686701506
type value
nauc_mrr_at_5_diff1 58.92289378915668
type value
nauc_mrr_at_5_max 49.847638994134144
type value
nauc_mrr_at_5_std 2.420421880131702
type value
nauc_ndcg_at_1000_diff1 45.56062215161734
type value
nauc_ndcg_at_1000_max 41.507152286702
type value
nauc_ndcg_at_1000_std 2.79388283208751
type value
nauc_ndcg_at_100_diff1 44.84064192570408
type value
nauc_ndcg_at_100_max 41.50353573562353
type value
nauc_ndcg_at_100_std 3.1804999773629357
type value
nauc_ndcg_at_10_diff1 43.341482144213614
type value
nauc_ndcg_at_10_max 41.159590898395074
type value
nauc_ndcg_at_10_std 2.945242338240843
type value
nauc_ndcg_at_1_diff1 62.23623985611396
type value
nauc_ndcg_at_1_max 45.04945770947091
type value
nauc_ndcg_at_1_std -0.8804967656575725
type value
nauc_ndcg_at_20_diff1 43.905372612093664
type value
nauc_ndcg_at_20_max 41.797709837872446
type value
nauc_ndcg_at_20_std 3.1853356915569653
type value
nauc_ndcg_at_3_diff1 44.18163998834299
type value
nauc_ndcg_at_3_max 38.352891017864636
type value
nauc_ndcg_at_3_std -0.8235767021150929
type value
nauc_ndcg_at_5_diff1 43.41374688421302
type value
nauc_ndcg_at_5_max 40.390365601593956
type value
nauc_ndcg_at_5_std 1.6743650108127537
type value
nauc_precision_at_1000_diff1 -9.711058370691381
type value
nauc_precision_at_1000_max 6.97321343449286
type value
nauc_precision_at_1000_std 7.933531916622121
type value
nauc_precision_at_100_diff1 -8.247029644152319
type value
nauc_precision_at_100_max 10.86740140944616
type value
nauc_precision_at_100_std 9.581885544675918
type value
nauc_precision_at_10_diff1 -2.409043695429943
type value
nauc_precision_at_10_max 21.04733206074314
type value
nauc_precision_at_10_std 10.03334651647101
type value
nauc_precision_at_1_diff1 62.23623985611396
type value
nauc_precision_at_1_max 45.04945770947091
type value
nauc_precision_at_1_std -0.8804967656575725
type value
nauc_precision_at_20_diff1 -5.230303656931621
type value
nauc_precision_at_20_max 17.77799716919181
type value
nauc_precision_at_20_std 10.739127998618654
type value
nauc_precision_at_3_diff1 10.40376424999862
type value
nauc_precision_at_3_max 30.933333400254035
type value
nauc_precision_at_3_std 6.126209127968004
type value
nauc_precision_at_5_diff1 3.147398101830739
type value
nauc_precision_at_5_max 27.1746309955971
type value
nauc_precision_at_5_std 8.874723615388788
type value
nauc_recall_at_1000_diff1 5.055940692380908
type value
nauc_recall_at_1000_max 22.42031123370267
type value
nauc_recall_at_1000_std 27.75476692527869
type value
nauc_recall_at_100_diff1 17.86391178198642
type value
nauc_recall_at_100_max 34.776134863678955
type value
nauc_recall_at_100_std 18.96377158778504
type value
nauc_recall_at_10_diff1 24.863097695413597
type value
nauc_recall_at_10_max 37.697411651507444
type value
nauc_recall_at_10_std 9.519849994253967
type value
nauc_recall_at_1_diff1 49.56605205141156
type value
nauc_recall_at_1_max 26.251096698710384
type value
nauc_recall_at_1_std -4.580748485387834
type value
nauc_recall_at_20_diff1 22.440602811005636
type value
nauc_recall_at_20_max 39.538861316515
type value
nauc_recall_at_20_std 11.363269553121468
type value
nauc_recall_at_3_diff1 32.80302839873736
type value
nauc_recall_at_3_max 32.53105685012729
type value
nauc_recall_at_3_std -0.7140166410605693
type value
nauc_recall_at_5_diff1 29.375386639154865
type value
nauc_recall_at_5_max 36.91045781164083
type value
nauc_recall_at_5_std 4.725419050262578
type value
ndcg_at_1 67.13900000000001
type value
ndcg_at_10 72.392
type value
ndcg_at_100 75.25800000000001
type value
ndcg_at_1000 75.982
type value
ndcg_at_20 73.783
type value
ndcg_at_3 67.269
type value
ndcg_at_5 69.807
type value
precision_at_1 67.13900000000001
type value
precision_at_10 13.327
type value
precision_at_100 1.5559999999999998
type value
precision_at_1000 0.164
type value
precision_at_20 7.119000000000001
type value
precision_at_3 35.599
type value
precision_at_5 23.936
type value
recall_at_1 47.370000000000005
type value
recall_at_10 82.16
type value
recall_at_100 93.34
type value
recall_at_1000 98.202
type value
recall_at_20 86.687
type value
recall_at_3 69.319
type value
recall_at_5 75.637
type
Retrieval
dataset metrics task
config name revision split type
default MTEB RuReviewsClassification (default) f6d2c31f4dc6b88f468552750bfec05b4b41b05a test ai-forever/ru-reviews-classification
type value
accuracy 75.0537109375
type value
f1 74.00523205209554
type value
f1_weighted 74.00436782840376
type value
main_score 75.0537109375
type
Classification
dataset metrics task
config name revision split type
default MTEB RuSTSBenchmarkSTS (default) 7cf24f325c6da6195df55bef3d86b5e0616f3018 test ai-forever/ru-stsbenchmark-sts
type value
cosine_pearson 81.10255413476487
type value
cosine_spearman 81.40020843157141
type value
euclidean_pearson 81.25155479902466
type value
euclidean_spearman 81.40020831064922
type value
main_score 81.40020843157141
type value
manhattan_pearson 81.1493715249014
type value
manhattan_spearman 81.30973667941649
type value
pearson 81.10255413476487
type value
spearman 81.40020843157141
type
STS
dataset metrics task
config name revision split type
default MTEB RuSciBenchGRNTIClassification (default) 673a610d6d3dd91a547a0d57ae1b56f37ebbf6a1 test ai-forever/ru-scibench-grnti-classification
type value
accuracy 69.8974609375
type value
f1 68.57837564785511
type value
f1_weighted 68.59030489460784
type value
main_score 69.8974609375
type
Classification
dataset metrics task
config name revision split type
default MTEB RuSciBenchGRNTIClusteringP2P (default) 673a610d6d3dd91a547a0d57ae1b56f37ebbf6a1 test ai-forever/ru-scibench-grnti-classification
type value
main_score 67.03880348548029
type value
v_measure 67.03880348548029
type value
v_measure_std 0.6126278133139618
type
Clustering
dataset metrics task
config name revision split type
default MTEB RuSciBenchOECDClassification (default) 26c88e99dcaba32bb45d0e1bfc21902337f6d471 test ai-forever/ru-scibench-oecd-classification
type value
accuracy 54.63378906250001
type value
f1 51.34306420274629
type value
f1_weighted 51.33495867493914
type value
main_score 54.63378906250001
type
Classification
dataset metrics task
config name revision split type
default MTEB RuSciBenchOECDClusteringP2P (default) 26c88e99dcaba32bb45d0e1bfc21902337f6d471 test ai-forever/ru-scibench-oecd-classification
type value
main_score 56.55947121159027
type value
v_measure 56.55947121159027
type value
v_measure_std 0.5498882006880662
type
Clustering
dataset metrics task
config name revision split type
ru MTEB STS22 (ru) de9d86b3b84231dc21f76c7b7af1f28e2f57f6e3 test mteb/sts22-crosslingual-sts
type value
cosine_pearson 61.833294921667914
type value
cosine_spearman 63.53967536726357
type value
euclidean_pearson 60.382865218855805
type value
euclidean_spearman 63.53967536726357
type value
main_score 63.53967536726357
type value
manhattan_pearson 60.24879015304578
type value
manhattan_spearman 63.42305760430092
type value
pearson 61.833294921667914
type value
spearman 63.53967536726357
type
STS
dataset metrics task
config name revision split type
default MTEB SensitiveTopicsClassification (default) 416b34a802308eac30e4192afc0ff99bb8dcc7f2 test ai-forever/sensitive-topics-classification
type value
accuracy 39.8193359375
type value
f1 55.46591740935434
type value
lrap 66.50980631510454
type value
main_score 39.8193359375
type
MultilabelClassification
dataset metrics task
config name revision split type
default MTEB TERRa (default) 7b58f24536063837d644aab9a023c62199b2a612 dev ai-forever/terra-pairclassification
type value
cosine_accuracy 66.77524429967427
type value
cosine_accuracy_threshold 55.58975338935852
type value
cosine_ap 66.4567219323658
type value
cosine_f1 70.64676616915423
type value
cosine_f1_threshold 45.55969536304474
type value
cosine_precision 57.028112449799195
type value
cosine_recall 92.81045751633987
type value
dot_accuracy 66.77524429967427
type value
dot_accuracy_threshold 55.589759349823
type value
dot_ap 66.4567219323658
type value
dot_f1 70.64676616915423
type value
dot_f1_threshold 45.55969536304474
type value
dot_precision 57.028112449799195
type value
dot_recall 92.81045751633987
type value
euclidean_accuracy 66.77524429967427
type value
euclidean_accuracy_threshold 94.24455165863037
type value
euclidean_ap 66.4567219323658
type value
euclidean_f1 70.64676616915423
type value
euclidean_f1_threshold 104.34587001800537
type value
euclidean_precision 57.028112449799195
type value
euclidean_recall 92.81045751633987
type value
main_score 66.4567219323658
type value
manhattan_accuracy 66.77524429967427
type value
manhattan_accuracy_threshold 2865.5345916748047
type value
manhattan_ap 66.26659863769075
type value
manhattan_f1 70.8542713567839
type value
manhattan_f1_threshold 3212.3912811279297
type value
manhattan_precision 57.55102040816327
type value
manhattan_recall 92.15686274509804
type value
max_accuracy 66.77524429967427
type value
max_ap 66.4567219323658
type value
max_f1 70.8542713567839
type value
max_precision 57.55102040816327
type value
max_recall 92.81045751633987
type value
similarity_accuracy 66.77524429967427
type value
similarity_accuracy_threshold 55.58975338935852
type value
similarity_ap 66.4567219323658
type value
similarity_f1 70.64676616915423
type value
similarity_f1_threshold 45.55969536304474
type value
similarity_precision 57.028112449799195
type value
similarity_recall 92.81045751633987
type
PairClassification
mit
ru
en
mteb
transformers
sentence-transformers
ai-forever/FRED-T5-1.7B feature-extraction
ai-forever/solyanka

Model Card for FRIDA

FRIDA is a full-scale finetuned general text embedding model inspired by denoising architecture based on T5. The model is based on the encoder part of FRED-T5 model and continues research of text embedding models (ruMTEB, ru-en-RoSBERTa). It has been pre-trained on a Russian-English dataset and fine-tuned for improved performance on the target task.

For more model details please refer to our article (RU).

Usage

The model can be used as is with prefixes. It is recommended to use CLS pooling. The choice of prefix and pooling depends on the task.

We use the following basic rules to choose a prefix:

  • "search_query: " and "search_document: " prefixes are for answer or relevant paragraph retrieval
  • "paraphrase: " prefix is for symmetric paraphrasing related tasks (STS, paraphrase mining, deduplication)
  • "categorize: " prefix is for asymmetric matching of document title and body (e.g. news, scientific papers, social posts)
  • "categorize_sentiment: " prefix is for any tasks that rely on sentiment features (e.g. hate, toxic, emotion)
  • "categorize_topic: " prefix is intended for tasks where you need to group texts by topic
  • "categorize_entailment: " prefix is for textual entailment task (NLI)

To better tailor the model to your needs, you can fine-tune it with relevant high-quality Russian and English datasets.

Below are examples of texts encoding using the Transformers and SentenceTransformers libraries.

Transformers

import torch
import torch.nn.functional as F
from transformers import AutoTokenizer, T5EncoderModel


def pool(hidden_state, mask, pooling_method="cls"):
    if pooling_method == "mean":
        s = torch.sum(hidden_state * mask.unsqueeze(-1).float(), dim=1)
        d = mask.sum(axis=1, keepdim=True).float()
        return s / d
    elif pooling_method == "cls":
        return hidden_state[:, 0]

inputs = [
    # 
    "paraphrase: В Ярославской области разрешили работу бань, но без посетителей",
    "categorize_entailment: Женщину доставили в больницу, за ее жизнь сейчас борются врачи.",
    "search_query: Сколько программистов нужно, чтобы вкрутить лампочку?",
    # 
    "paraphrase: Ярославским баням разрешили работать без посетителей",
    "categorize_entailment: Женщину спасают врачи.",
    "search_document: Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование."
]

tokenizer = AutoTokenizer.from_pretrained("ai-forever/FRIDA")
model = T5EncoderModel.from_pretrained("ai-forever/FRIDA")

tokenized_inputs = tokenizer(inputs, max_length=512, padding=True, truncation=True, return_tensors="pt")

with torch.no_grad():
    outputs = model(**tokenized_inputs)
    
embeddings = pool(
    outputs.last_hidden_state, 
    tokenized_inputs["attention_mask"],
    pooling_method="cls" # or try "mean"
)

embeddings = F.normalize(embeddings, p=2, dim=1)
sim_scores = embeddings[:3] @ embeddings[3:].T
print(sim_scores.diag().tolist())
# [0.9360030293464661, 0.8591322302818298, 0.728583037853241]

SentenceTransformers

from sentence_transformers import SentenceTransformer

inputs = [
    # 
    "paraphrase: В Ярославской области разрешили работу бань, но без посетителей",
    "categorize_entailment: Женщину доставили в больницу, за ее жизнь сейчас борются врачи.",
    "search_query: Сколько программистов нужно, чтобы вкрутить лампочку?",
    # 
    "paraphrase: Ярославским баням разрешили работать без посетителей",
    "categorize_entailment: Женщину спасают врачи.",
    "search_document: Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование."
]

# loads model with CLS pooling
model = SentenceTransformer("ai-forever/FRIDA")

# embeddings are normalized by default
embeddings = model.encode(inputs, convert_to_tensor=True)

sim_scores = embeddings[:3] @ embeddings[3:].T
print(sim_scores.diag().tolist())
# [0.9360026717185974, 0.8591331243515015, 0.7285830974578857]

or using prompts (sentence-transformers>=2.4.0):

from sentence_transformers import SentenceTransformer

# loads model with CLS pooling
model = SentenceTransformer("ai-forever/FRIDA")

paraphrase = model.encode(["В Ярославской области разрешили работу бань, но без посетителей", "Ярославским баням разрешили работать без посетителей"], prompt_name="paraphrase")
print(paraphrase[0] @ paraphrase[1].T) # 0.9360032

categorize_entailment = model.encode(["Женщину доставили в больницу, за ее жизнь сейчас борются врачи.", "Женщину спасают врачи."], prompt_name="categorize_entailment")
print(categorize_entailment[0] @ categorize_entailment[1].T) # 0.8591322

query_embedding = model.encode("Сколько программистов нужно, чтобы вкрутить лампочку?", prompt_name="search_query")
document_embedding = model.encode("Чтобы вкрутить лампочку, требуется три программиста: один напишет программу извлечения лампочки, другой — вкручивания лампочки, а третий проведет тестирование.", prompt_name="search_document")
print(query_embedding @ document_embedding.T) # 0.7285831

Authors

Citation

@misc{TODO
}

Limitations

The model is designed to process texts in Russian, the quality in English is unknown. Maximum input text length is limited to 512 tokens.