76d3bebabb2166b410062e5583bb1ff70190c130
Model: cross-encoder/quora-roberta-large Source: Original Platform
license, datasets, language, base_model, pipeline_tag, library_name, tags
| license | datasets | language | base_model | pipeline_tag | library_name | tags | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| apache-2.0 |
|
|
|
text-ranking | sentence-transformers |
|
Cross-Encoder for Quora Duplicate Questions Detection
This model was trained using SentenceTransformers Cross-Encoder class.
Training Data
This model was trained on the Quora Duplicate Questions dataset. The model will predict a score between 0 and 1 how likely the two given questions are duplicates.
Note: The model is not suitable to estimate the similarity of questions, e.g. the two questions "How to learn Java" and "How to learn Python" will result in a rather low score, as these are not duplicates.
Usage and Performance
Pre-trained models can be used like this:
from sentence_transformers import CrossEncoder
model = CrossEncoder('cross-encoder/quora-roberta-large')
scores = model.predict([('Question 1', 'Question 2'), ('Question 3', 'Question 4')])
You can use this model also without sentence_transformers and by just using Transformers AutoModel class
Description
Languages
CSV
100%