82 lines
2.8 KiB
Markdown
82 lines
2.8 KiB
Markdown
|
|
---
|
|||
|
|
license: mit
|
|||
|
|
language:
|
|||
|
|
- en
|
|||
|
|
tags:
|
|||
|
|
- text-rewriting
|
|||
|
|
- web
|
|||
|
|
- generative-engine-optimization
|
|||
|
|
- geo
|
|||
|
|
- reinforcement-learning
|
|||
|
|
- grpo
|
|||
|
|
- qwen3
|
|||
|
|
- transformers
|
|||
|
|
- safetensors
|
|||
|
|
library_name: transformers
|
|||
|
|
pipeline_tag: text-generation
|
|||
|
|
base_model: Qwen/Qwen3-1.7B
|
|||
|
|
datasets:
|
|||
|
|
- cx-cmu/Researchy-GEO
|
|||
|
|
---
|
|||
|
|
|
|||
|
|
# AutoGEO<sub>Mini</sub> (Qwen1.7B, Researchy-GEO)
|
|||
|
|
|
|||
|
|
AutoGEO<sub>Mini</sub> (Qwen1.7B, Researchy-GEO) is a GEO model designed to improve how web document is incorporated into answers generated by **LLM-based generative engines**.
|
|||
|
|
|
|||
|
|
The model rewrites a given document to better match the preferences of generative engines (e.g., GPT, Gemini, Claude), with the goal of increasing the document’s **visibility and coverage** in generated responses, while **preserving the original meaning and factual content**.
|
|||
|
|
|
|||
|
|
⚠️ This model is trained for the generative engine powered by `gemini-2.5-flash-lite` on dataset `Researchy-GEO`. If you intend to use AutoGEO<sub>Mini</sub> with other types of generative engines or datasets, you must post-train `Qwen/Qwen3-1.7B` using [our code](https://github.com/cxcscmu/AutoGEO).
|
|||
|
|
|
|||
|
|
This model is part of the **AutoGEO** framework proposed in the paper
|
|||
|
|
|
|||
|
|
📄 **Paper:** ["What Generative Search Engines Like and How to Optimize Web Content Cooperatively"](https://arxiv.org/abs/2510.11438)
|
|||
|
|
👥 **Authors:** Yujiang Wu*, Shanshan Zhong*, Yubin Kim, Chenyan Xiong (*Equal contribution)
|
|||
|
|
🚀 **Code:** [AutoGEO on GitHub](https://github.com/cxcscmu/AutoGEO)
|
|||
|
|
|
|||
|
|
## Usage
|
|||
|
|
|
|||
|
|
This model is designed to be used through the [**AutoGEO framework**](https://github.com/cxcscmu/AutoGEO). Try it out in [huggingface Space](https://huggingface.co/spaces/cx-cmu/AutoGEO_Mini) or
|
|||
|
|
|
|||
|
|
Quick starts:
|
|||
|
|
|
|||
|
|
```python
|
|||
|
|
from autogeo.rewriters import rewrite_document
|
|||
|
|
|
|||
|
|
rewritten_text = rewrite_document(
|
|||
|
|
document="Input text.",
|
|||
|
|
dataset="Researchy-GEO",
|
|||
|
|
engine_llm="gemini",
|
|||
|
|
model_path="cx-cmu/AutoGEO_mini_Qwen1.7B_ResearchyGEO",
|
|||
|
|
)
|
|||
|
|
```
|
|||
|
|
|
|||
|
|
Evaluation:
|
|||
|
|
|
|||
|
|
```bash
|
|||
|
|
python -m autogeo.evaluate \
|
|||
|
|
--model autogeo_mini \
|
|||
|
|
--model_path cx-cmu/AutoGEO_mini_Qwen1.7B_ResearchyGEO \
|
|||
|
|
--dataset Researchy-GEO
|
|||
|
|
```
|
|||
|
|
|
|||
|
|
|
|||
|
|
|
|||
|
|
## Related Resources
|
|||
|
|
|
|||
|
|
* **Paper:** [https://arxiv.org/abs/2510.11438](https://arxiv.org/abs/2510.11438)
|
|||
|
|
* **Code:** [https://github.com/cxcscmu/AutoGEO](https://github.com/cxcscmu/AutoGEO)
|
|||
|
|
* **Demo:** [https://huggingface.co/spaces/cx-cmu/AutoGEO_Mini](https://huggingface.co/spaces/cx-cmu/AutoGEO_Mini)
|
|||
|
|
* **Dataset:** [https://huggingface.co/datasets/cx-cmu/Researchy-GEO](https://huggingface.co/datasets/cx-cmu/Researchy-GEO)
|
|||
|
|
|
|||
|
|
## Citation
|
|||
|
|
|
|||
|
|
If you use this model, please cite:
|
|||
|
|
|
|||
|
|
```bibtex
|
|||
|
|
@article{wu2025generative,
|
|||
|
|
title={What Generative Search Engines Like and How to Optimize Web Content Cooperatively},
|
|||
|
|
author={Wu, Yujiang and Zhong, Shanshan and Kim, Yubin and Xiong, Chenyan},
|
|||
|
|
journal={arXiv preprint arXiv:2510.11438},
|
|||
|
|
year={2025}
|
|||
|
|
}
|
|||
|
|
```
|