初始化项目,由ModelHub XC社区提供模型
Model: giants2026/GIANTS-4B Source: Original Platform
This commit is contained in:
63
README.md
Normal file
63
README.md
Normal file
@@ -0,0 +1,63 @@
|
||||
---
|
||||
license: cc-by-4.0
|
||||
library_name: transformers
|
||||
base_model: Qwen/Qwen3-4B
|
||||
tags:
|
||||
- insight-anticipation
|
||||
- scientific-literature
|
||||
---
|
||||
|
||||
# GIANTS-4B
|
||||
|
||||
GIANTS-4B is a language model for **insight anticipation** from scientific literature, introduced in the paper:
|
||||
|
||||
> **GIANTS: Generative Insight Anticipation from Scientific Literature**
|
||||
|
||||
Given summaries of two parent papers, GIANTS-4B generates the key insight of a downstream paper that builds on both parent papers. It is fine-tuned from [Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B).
|
||||
|
||||
## Training Data
|
||||
|
||||
This model was trained on [GiantsBench-train](https://huggingface.co/datasets/giants2026/GiantsBench-train).
|
||||
|
||||
## Usage
|
||||
|
||||
```python
|
||||
from transformers import AutoModelForCausalLM, AutoTokenizer
|
||||
from datasets import load_dataset
|
||||
|
||||
# Load model and tokenizer
|
||||
model_name = "giants2026/GIANTS-4B"
|
||||
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
||||
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto")
|
||||
|
||||
# Load a sample prompt from the GiantsBench test set
|
||||
dataset = load_dataset("giants2026/GiantsBench-test", split="train")
|
||||
query = dataset[0]["query"]
|
||||
|
||||
# Format as chat messages
|
||||
messages = [{"role": "user", "content": query}]
|
||||
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
|
||||
inputs = tokenizer(text, return_tensors="pt").to(model.device)
|
||||
|
||||
# Generate
|
||||
output = model.generate(
|
||||
**inputs,
|
||||
max_new_tokens=2048,
|
||||
temperature=0.6,
|
||||
top_p=0.95,
|
||||
top_k=20,
|
||||
min_p=0.0,
|
||||
)
|
||||
|
||||
# Decode and print the generated insight
|
||||
response = tokenizer.decode(output[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
|
||||
print(response)
|
||||
```
|
||||
|
||||
## Evaluation
|
||||
|
||||
See [GiantsBench-test](https://huggingface.co/datasets/giants2026/GiantsBench-test) for the evaluation benchmark.
|
||||
|
||||
## License
|
||||
|
||||
This model is released under the [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/) license.
|
||||
Reference in New Issue
Block a user