117 lines
4.0 KiB
Markdown
117 lines
4.0 KiB
Markdown
---
|
||
language:
|
||
- en
|
||
- fr
|
||
- de
|
||
- it
|
||
- es
|
||
pipeline_tag: text-generation
|
||
tags:
|
||
- cygnisai
|
||
- llama-3.1
|
||
- model-merge
|
||
- unsloth
|
||
- enterprise-ready
|
||
- finetuned
|
||
- safety-aligned
|
||
license: apache-2.0
|
||
datasets:
|
||
- cygnisai/Cygnis-Alpha-2-Instruct-Mix
|
||
base_model:
|
||
- unsloth/Meta-Llama-3.1-8B-bnb-4bit
|
||
library_name: transformers
|
||
---
|
||
|
||
# Cygnis-Alpha-2-8B-v0.2 (Merge Version)
|
||
|
||
<div align="center" style="background:#06090f; border-radius:14px; border:1px solid #0f1e30; overflow:hidden; margin-bottom:20px;">
|
||
<img src="https://huggingface.co/cygnisai/Cygnis-Alpha-2-8B-v0.1/resolve/main/Cygnis-alpha-2.png" width="100%" style="display:block;">
|
||
</div>
|
||
|
||
**Cygnis-Alpha-2-8B-v0.2** is a next-generation large language model developed by **CygnisAI**. This version is the result of a strategic **Model Merge**, engineered to surpass the native reasoning capabilities of Llama 3.1 8B while maintaining optimal computational efficiency for enterprise-scale deployments.
|
||
|
||
## Architecture & Development
|
||
|
||
The model is built upon the **Llama 3.1** architecture (Auto-regressive Transformer). The CygnisAI team leveraged **Unsloth** for accelerated **Supervised Fine-Tuning (SFT)** and applied advanced **Model Merging** techniques to sharpen technical and professional response accuracy.
|
||
|
||
| Feature | Specification |
|
||
| :--- | :--- |
|
||
| **Developer** | [CygnisAI](https://huggingface.co/cygnisai) |
|
||
| **Base Model** | Meta Llama 3.1 8B (Finetuned via Unsloth) |
|
||
| **Context Length** | 128k tokens |
|
||
| **Optimization** | Multilingual (Primary focus: EN/FR) |
|
||
| **Knowledge Cutoff** | December 2023 |
|
||
| **License** | Apache 2.0 |
|
||
|
||
## Performance & Benchmarks
|
||
|
||
Cygnis v0.2 has been rigorously tested against internal benchmarks to ensure production-grade stability. It shows significant improvements in **Instruction Following** for complex, multi-step tasks compared to the v0.1 release.
|
||
|
||
* **MMLU (Reasoning)**: ~69.8%
|
||
* **IFEval (Strict Instruction)**: ~81.2%
|
||
* **HumanEval (Coding)**: ~72.9%
|
||
|
||
## Quick Start
|
||
|
||
### Via Transformers (Pipeline)
|
||
```python
|
||
import transformers
|
||
import torch
|
||
|
||
model_id = "cygnisai/Cygnis-Alpha-2-8B-v0.2"
|
||
|
||
pipeline = transformers.pipeline(
|
||
"text-generation",
|
||
model=model_id,
|
||
model_kwargs={"torch_dtype": torch.bfloat16},
|
||
device_map="auto"
|
||
)
|
||
|
||
messages = [
|
||
{"role": "system", "content": "You are Cygnis, the official enterprise AI assistant."},
|
||
{"role": "user", "content": "Analyze the scalability requirements for our LLM infrastructure."}
|
||
]
|
||
|
||
print(pipeline(messages, max_new_tokens=256))
|
||
```
|
||
|
||
### Official Prompt Format
|
||
The model follows the standard Llama 3.1 chat template:
|
||
```text
|
||
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
||
|
||
{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
||
|
||
{user_prompt}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
||
```
|
||
|
||
## Responsibility & Safety
|
||
|
||
In alignment with **CygnisAI’s** commitment to responsible AI:
|
||
1. **Guardrails**: The model integrates filters to mitigate malicious content, disinformation, and unintended biases.
|
||
2. **Transparency**: While highly capable, Cygnis should not be used without human oversight for critical decision-making (Medical, Legal, Financial).
|
||
3. **License**: This model is released under the **Apache License 2.0**, offering maximum flexibility for commercial and private use.
|
||
|
||
## Legal Notice
|
||
This model was finetuned using the **Unsloth** library. While the underlying architecture is based on Llama 3.1, the specific weights of Cygnis-Alpha-2-8B-v0.2 are provided under the Apache 2.0 terms.
|
||
*Built with Llama.*
|
||
|
||
---
|
||
|
||
## Citation
|
||
|
||
```bibtex
|
||
@misc{cygnis_alpha_2_v0.1,
|
||
author = {Simonc-44},
|
||
title = {The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink},
|
||
year = {2026},
|
||
publisher = {Hugging Face},
|
||
url = {https://huggingface.co/Simonc-44/Cygnis-Alpha-2-8B-v0.1}
|
||
eprint={2204.05149},
|
||
archivePrefix={arXiv},
|
||
}
|
||
```
|
||
|
||
---
|
||
**Professional Contact & Support**: [cygnisai/contact](https://huggingface.co/cygnisai)
|
||
*Driven by innovation and corporate-grade rigor at CygnisAI.* |