初始化项目，由ModelHub XC社区提供模型

Model: giants2026/GIANTS-4B Source: Original Platform
2026-06-06 15:07:49 +08:00
commit b167a86046
14 changed files with 152365 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,63 @@
+---
+license: cc-by-4.0
+library_name: transformers
+base_model: Qwen/Qwen3-4B
+tags:
+  - insight-anticipation
+  - scientific-literature
+---
+
+# GIANTS-4B
+
+GIANTS-4B is a language model for **insight anticipation** from scientific literature, introduced in the paper:
+
+> **GIANTS: Generative Insight Anticipation from Scientific Literature**
+
+Given summaries of two parent papers, GIANTS-4B generates the key insight of a downstream paper that builds on both parent papers. It is fine-tuned from [Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B).
+
+## Training Data
+
+This model was trained on [GiantsBench-train](https://huggingface.co/datasets/giants2026/GiantsBench-train).
+
+## Usage
+
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from datasets import load_dataset
+
+# Load model and tokenizer
+model_name = "giants2026/GIANTS-4B"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
+
+# Load a sample prompt from the GiantsBench test set
+dataset = load_dataset("giants2026/GiantsBench-test", split="train")
+query = dataset[0]["query"]
+
+# Format as chat messages
+messages = [{"role": "user", "content": query}]
+text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+inputs = tokenizer(text, return_tensors="pt").to(model.device)
+
+# Generate
+output = model.generate(
+    **inputs,
+    max_new_tokens=2048,
+    temperature=0.6,
+    top_p=0.95,
+    top_k=20,
+    min_p=0.0,
+)
+
+# Decode and print the generated insight
+response = tokenizer.decode(output[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
+print(response)
+```
+
+## Evaluation
+
+See [GiantsBench-test](https://huggingface.co/datasets/giants2026/GiantsBench-test) for the evaluation benchmark.
+
+## License
+
+This model is released under the [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) license.