quicktensor/blockrank-msmarco-mistral-7b

Go to file

ModelHub XC f5e65f3176 初始化项目，由ModelHub XC社区提供模型

Model: quicktensor/blockrank-msmarco-mistral-7b
Source: Original Platform

2026-05-04 13:21:04 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

all_results.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

chat_template.jinja

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

eval_results.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

model-00001-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

model-00002-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

model-00003-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

tokenizer.model

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

train_config.yaml

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

train_results.json

初始化项目，由ModelHub XC社区提供模型

2026-05-04 13:21:04 +08:00

README.md

license, language, library_name, tags, base_model, datasets, metrics

license

language

library_name

BlockRank-Mistral-7B: Scalable In-context Ranking with Generative Models

BlockRank-Mistral-7B is a fine-tuned version of Mistral-7B-Instruct-v0.3 optimized for efficient in-context document ranking. It implements BlockRank, a method that makes LLMs efficient and scalable for ranking by aligning their internal attention mechanisms with the structure of the ranking task.

Key Features

Linear Complexity Attention: Structured sparse attention reduces complexity from O(n²) to O(n)
2-4× Faster Inference: Attention-based scoring eliminates autoregressive decoding
Auxiliary Contrastive Loss: Mid-layer contrastive objective improves relevance signals
Strong Zero-shot Generalization: SOTA performance on BEIR benchmarks

Citation

If you use this model, please cite:

@article{gupta2025blockrank,
  title={Scalable In-context Ranking with Generative Models},
  author={Gupta, Nilesh and You, Chong and Bhojanapalli, Srinadh and Kumar, Sanjiv and Dhillon, Inderjit and Yu, Felix},
  journal={arXiv preprint arXiv:2510.05396},
  year={2025}
}

Model Card Contact

For questions or issues, please open an issue on GitHub.

Additional Resources

Paper: arXiv:2510.05396
Code: GitHub Repository
Dataset: HuggingFace Dataset
Demo: Colab Notebook

License

This model is released under the MIT License. See LICENSE for details.

README.md Unescape Escape

BlockRank-Mistral-7B: Scalable In-context Ranking with Generative Models

Key Features

Citation

Model Card Contact

Additional Resources

License

README.md