初始化项目,由ModelHub XC社区提供模型

Model: cross-encoder/stsb-TinyBERT-L4
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-13 16:48:25 +08:00
commit 8884cc30ff
24 changed files with 74131 additions and 0 deletions

10
.gitattributes vendored Normal file
View File

@@ -0,0 +1,10 @@
*.bin.* filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tar.gz filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
model.safetensors filter=lfs diff=lfs merge=lfs -text

View File

@@ -0,0 +1,7 @@
epoch,steps,Pearson_Correlation,Spearman_Correlation
0,-1,0.8545983323224908,0.8468392535285587
1,-1,0.8571942532186558,0.8520208839729133
2,-1,0.8613338704883177,0.8538568557640089
3,-1,0.8616480884450483,0.8550005160746958
4,-1,0.8605050883898332,0.8543731967733662
5,-1,0.8612005934368199,0.8541962412946662
1 epoch steps Pearson_Correlation Spearman_Correlation
2 0 -1 0.8545983323224908 0.8468392535285587
3 1 -1 0.8571942532186558 0.8520208839729133
4 2 -1 0.8613338704883177 0.8538568557640089
5 3 -1 0.8616480884450483 0.8550005160746958
6 4 -1 0.8605050883898332 0.8543731967733662
7 5 -1 0.8612005934368199 0.8541962412946662

31
README.md Normal file
View File

@@ -0,0 +1,31 @@
---
license: apache-2.0
datasets:
- sentence-transformers/stsb
language:
- en
pipeline_tag: text-ranking
library_name: sentence-transformers
tags:
- transformers
---
# Cross-Encoder for Semantic Textual Similarity
This model was trained using [SentenceTransformers](https://sbert.net) [Cross-Encoder](https://www.sbert.net/examples/applications/cross-encoder/README.html) class.
## Training Data
This model was trained on the [STS benchmark dataset](http://ixa2.si.ehu.eus/stswiki/index.php/STSbenchmark). The model will predict a score between 0 and 1 how for the semantic similarity of two sentences.
## Usage and Performance
Pre-trained models can be used like this:
```python
from sentence_transformers import CrossEncoder
model = CrossEncoder('cross-encoder/stsb-TinyBERT-L4')
scores = model.predict([('Sentence 1', 'Sentence 2'), ('Sentence 3', 'Sentence 4')])
```
The model will predict scores for the pairs `('Sentence 1', 'Sentence 2')` and `('Sentence 3', 'Sentence 4')`.
You can use this model also without sentence_transformers and by just using Transformers ``AutoModel`` class

28
config.json Normal file
View File

@@ -0,0 +1,28 @@
{
"_name_or_path": "output/TinyBERT_L-4-nli/",
"architectures": [
"BertForSequenceClassification"
],
"attention_probs_dropout_prob": 0.1,
"gradient_checkpointing": false,
"hidden_act": "gelu",
"hidden_dropout_prob": 0.1,
"hidden_size": 312,
"id2label": {
"0": "LABEL_0"
},
"initializer_range": 0.02,
"intermediate_size": 1200,
"label2id": {
"LABEL_0": 0
},
"layer_norm_eps": 1e-12,
"max_position_embeddings": 512,
"model_type": "bert",
"num_attention_heads": 12,
"num_hidden_layers": 4,
"pad_token_id": 0,
"position_embedding_type": "absolute",
"type_vocab_size": 2,
"vocab_size": 30522
}

3
flax_model.msgpack Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ec046a6abe4d2302f8ac8a06ad8d90957d1e6e937411753d7a865860bc696c9c
size 57404914

3
model.safetensors Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4fa70fae4092a08c5d1dcffc720318e5cbd4f8208b2213ebdb2d9525f89aabf5
size 57414738

3
onnx/model.onnx Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8a083b28b32cca196730541bb3d0874c73cb11ec6e8a192273d088e1fbac2388
size 57510828

3
onnx/model_O1.onnx Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a1300e70a3510de85dbcdfeda810b99379f67f173c3c813bbbfe0b74fa0b62a6
size 57478044

3
onnx/model_O2.onnx Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9d73135beeca73f916a4db31da46e3d551b496480b2173ac71819d7f2e1491b0
size 57420067

3
onnx/model_O3.onnx Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4e99a59ad7b0dfcd597deba6beb74c97195ab3b0e45b6faf59c972ca3e95f931
size 57420025

3
onnx/model_O4.onnx Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4b6ce4d7445228291d6de39e27c824b9571141694624c444ed8b6f196d38d264
size 28764362

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:75eb42eceab4bded3be613c125803863f37312dd1740180b0b7db4c684454764
size 14656536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:75eb42eceab4bded3be613c125803863f37312dd1740180b0b7db4c684454764
size 14656536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:75eb42eceab4bded3be613c125803863f37312dd1740180b0b7db4c684454764
size 14656536

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ea95077b3644ca53770b83af890a7758311270809c2832c8b47146454847a506
size 14656535

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1d1ce9851caab609548ab56661d46b1ed873bbf1abf2ebb07b17438e98ebd344
size 57406500

4753
openvino/openvino_model.xml Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8d825a0ca34c76d4e8617c5f5d9dc169baa11074c46e9a7ee43acf38d9aadfa0
size 14612160

File diff suppressed because it is too large Load Diff

3
pytorch_model.bin Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c81276416deab713ab841b7d4afa2484270d5d23d735d1baa27a7bc1501080e2
size 57436041

7
special_tokens_map.json Normal file
View File

@@ -0,0 +1,7 @@
{
"cls_token": "[CLS]",
"mask_token": "[MASK]",
"pad_token": "[PAD]",
"sep_token": "[SEP]",
"unk_token": "[UNK]"
}

30672
tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

58
tokenizer_config.json Normal file
View File

@@ -0,0 +1,58 @@
{
"added_tokens_decoder": {
"0": {
"content": "[PAD]",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"100": {
"content": "[UNK]",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"101": {
"content": "[CLS]",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"102": {
"content": "[SEP]",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"103": {
"content": "[MASK]",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
}
},
"clean_up_tokenization_spaces": true,
"cls_token": "[CLS]",
"do_basic_tokenize": true,
"do_lower_case": true,
"extra_special_tokens": {},
"mask_token": "[MASK]",
"model_max_length": 512,
"never_split": null,
"pad_token": "[PAD]",
"sep_token": "[SEP]",
"strip_accents": null,
"tokenize_chinese_chars": true,
"tokenizer_class": "BertTokenizer",
"unk_token": "[UNK]"
}

30522
vocab.txt Normal file

File diff suppressed because it is too large Load Diff