license, datasets, language, metrics, base_model, pipeline_tag, tags
license datasets language metrics base_model pipeline_tag tags
apache-2.0
RUC-NLPIR/FlashRAG_datasets
en
f1
recall
Qwen/Qwen2.5-7B-Instruct
question-answering
ambiguity
agent
reinforcement-learning
  • This repository contains the RL-trained model accompanying our paper, A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning. More details are available at https://github.com/zfj1998/A2Search
Description
Model synced from source: zfj1998/A2Search-7B-Instruct
Readme 4.2 MiB