Files
Cygnis-Alpha-1.7B-v0.1-Inst…/README.md

141 lines
4.8 KiB
Markdown
Raw Normal View History

---
library_name: transformers
license: apache-2.0
language:
- en
pipeline_tag: text-generation
base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
tags:
- finetuned
- sft
- smollm2
- sovereign-ai
- safetensors
- onnx
- transformers.js
---
# Cygnis Alpha Instruct
<div align="center" style="background:#06090f; border-radius:14px; border:1px solid #0f1e30; overflow:hidden; margin-bottom:20px;">
<img src="https://huggingface.co/cygnisai/Cygnis-Alpha-1.7B-v0.1-Instruct/resolve/main/Cygnis-Alpha-Instruct.png" width="100%" style="display:block;">
</div>
## Table of Contents
1. [Model Summary](#model-summary)
2. [Evaluation](#evaluation)
3. [Examples](#examples)
4. [Limitations](#limitations)
5. [Training](#training)
6. [License](#license)
7. [Citation](#citation)
## Model Summary
**Cygnis Alpha Instruct** is a professional, high-performance language model based on the **SmolLM2-1.7B-Instruct** architecture. Unlike basic quantizations, this version is a full-weight Fine-Tuned (SFT) model designed to bridge the gap between low-latency local inference and high-quality instruction following.
This model has been specifically refined to embody a **Sovereign AI** identity, making it the perfect assistant for private, on-device deployment. It excels at following complex instructions, rewriting text, and maintaining a consistent persona.
### How to use
#### Transformers
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
checkpoint = "cygnisai/Cygnis-Alpha-1.7B-v0.1-Instruct"
device = "cuda" # for GPU usage or "cpu" for CPU usage
tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = AutoModelForCausalLM.from_pretrained(checkpoint).to(device)
messages = [
{"role": "system", "content": "You are Cygnis Alpha, a sovereign AI assistant designed by Simonc-44."},
{"role": "user", "content": "What is the core philosophy of sovereign AI?"}
]
input_text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer.encode(input_text, return_tensors="pt").to(device)
outputs = model.generate(inputs, max_new_tokens=150, temperature=0.7, top_p=0.9, do_sample=True)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```
#### Transformers.js
```javascript
import { pipeline } from "@huggingface/transformers";
const generator = await pipeline(
"text-generation",
"cygnisai/Cygnis-Alpha-1.7B-v0.1-Instruct",
);
const messages = [
{ role: "system", content: "You are Cygnis Alpha, a sovereign AI assistant." },
{ role: "user", content: "Hello! Who are you?" },
];
const output = await generator(messages, { max_new_tokens: 128 });
console.log(output[0].generated_text.at(-1).content);
```
---
## Evaluation
Cygnis Alpha inherits the state-of-the-art benchmarks of the SmolLM2-1.7B core.
| Metric | Cygnis Alpha (1.7B) | Llama-1B-Instruct | Qwen2.5-1.5B-Instruct |
|:------------------------------|:---------------------:|:-----------------:|:----------------------:|
| **IFEval** (Avg prompt/inst) | **56.7** | 53.5 | 47.4 |
| **MT-Bench** | 6.13 | 5.48 | **6.52** |
| **HellaSwag** | **66.1** | 56.1 | 60.9 |
| **ARC (Average)** | **51.7** | 41.6 | 46.2 |
| **GSM8K (5-shot)** | **48.2** | 26.8 | 42.8 |
---
## Examples
### Identity Alignment
**User:** "Who created you?"
**Cygnis:** "I am Cygnis Alpha, a sovereign AI assistant developed and fine-tuned by Simonc-44. My purpose is to provide efficient and private intelligence."
### Professional Rewriting
**System Prompt:** "Rewrite the following email to be more professional."
**User:** "Hey, I'm late for the meeting, sorry."
**Cygnis:** "I would like to sincerely apologize for my late arrival to today's meeting. I appreciate your patience."
---
## Limitations
Cygnis Alpha Instruct primarily understands and generates content in **English**. While highly capable for its size (1.7B), it may struggle with highly specialized scientific tasks or very long-form reasoning compared to 70B+ models.
## Training
### Model Specifications
- **Architecture:** Transformer Decoder (Llama-like)
- **Base Model:** SmolLM2-1.7B-Instruct
- **Precision:** bfloat16
### Software & Hardware
- **Alignment:** Supervised Fine-Tuning via `alignment-handbook`.
- **Infrastructure:** Trained using high-performance GPU clusters for the base, with custom SFT layers added by Simonc-44.
## License
This model is licensed under **Apache 2.0**.
## Citation
```bibtex
@misc{allal2025smollm2smolgoesbig,
title={SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model},
author={Loubna Ben Allal and others},
year={2025},
eprint={2502.02737},
archivePrefix={arXiv},
}
```
---
**Creator:** [Simonc-44](https://huggingface.co/Simonc-44)