初始化项目,由ModelHub XC社区提供模型

Model: BEE-spoke-data/Mixtral-GQA-400m-v2
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-24 01:55:15 +08:00
commit 8eae555d0b
10 changed files with 2044 additions and 0 deletions

41
README.md Normal file
View File

@@ -0,0 +1,41 @@
---
license: apache-2.0
language:
- en
---
# BEE-spoke-data/Mixtral-GQA-400m-v2
## testing code
```python
# !pip install -U -q transformers datasets accelerate sentencepiece
import pprint as pp
from transformers import pipeline
pipe = pipeline(
"text-generation",
model="BEE-spoke-data/Mixtral-GQA-400m-v2",
device_map="auto",
)
pipe.model.config.pad_token_id = pipe.model.config.eos_token_id
prompt = "My favorite movie is Godfather because"
res = pipe(
prompt,
max_new_tokens=256,
top_k=4,
penalty_alpha=0.6,
use_cache=True,
no_repeat_ngram_size=4,
repetition_penalty=1.1,
renormalize_logits=True,
)
pp.pprint(res[0])
```