116 lines
4.1 KiB
Markdown
116 lines
4.1 KiB
Markdown
---
|
|
license: apache-2.0
|
|
base_model:
|
|
- prithivMLmods/Sombrero-Opus-14B-Elite6
|
|
pipeline_tag: text-generation
|
|
tags:
|
|
- SFT
|
|
- text-generation-inference
|
|
- abliterated
|
|
- trl
|
|
- code
|
|
- moderately abliterated
|
|
- math
|
|
language:
|
|
- en
|
|
library_name: transformers
|
|
---
|
|
|
|

|
|
|
|
# **Sombrero-Opus-14B-Elite13**
|
|
|
|
> Sombrero-Opus-14B-Elite13 builds upon the Qwen 2.5 14B modality architecture, elevating reasoning performance in mid- to large-scale models. This iteration focuses on enhancing general-purpose comprehension, structured intelligence, and interactive versatility. Fine-tuned with an advanced reasoning chain and carefully curated datasets, Elite13 offers improved contextual understanding, logical coherence, and multi-step problem-solving.
|
|
|
|
Key improvements include:
|
|
|
|
1. **Expanded Domain Fluency**: Delivers refined general knowledge across disciplines for more accurate and coherent answers.
|
|
2. **Advanced Instruction Parsing**: Enhanced capacity to interpret and execute complex instructions while preserving structure and clarity.
|
|
3. **Robust Prompt Flexibility**: Excels in adapting to diverse interaction styles, from casual inquiries to formal requests.
|
|
4. **Extended Context Window**: Handles up to 128K tokens of input and generates up to 8K tokens in a single output — ideal for detailed reasoning and expansive replies.
|
|
5. **Global Linguistic Range**: Offers proficiency in 29+ languages, including English, Chinese, French, Spanish, Japanese, Arabic, and more.
|
|
|
|
---
|
|
|
|
# **Quickstart with Transformers**
|
|
|
|
Use the following snippet to load and test the model using `transformers` and `apply_chat_template`:
|
|
|
|
```python
|
|
from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
|
|
model_name = "prithivMLmods/Sombrero-Opus-14B-Elite13"
|
|
|
|
model = AutoModelForCausalLM.from_pretrained(
|
|
model_name,
|
|
torch_dtype="auto",
|
|
device_map="auto"
|
|
)
|
|
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
|
|
|
prompt = "What are the key principles of general-purpose AI?"
|
|
messages = [
|
|
{"role": "system", "content": "You are a helpful assistant capable of answering a wide range of questions."},
|
|
{"role": "user", "content": prompt}
|
|
]
|
|
text = tokenizer.apply_chat_template(
|
|
messages,
|
|
tokenize=False,
|
|
add_generation_prompt=True
|
|
)
|
|
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
|
|
|
|
generated_ids = model.generate(
|
|
**model_inputs,
|
|
max_new_tokens=512
|
|
)
|
|
generated_ids = [
|
|
output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
|
|
]
|
|
|
|
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
|
```
|
|
|
|
---
|
|
|
|
# **Intended Use**
|
|
|
|
1. **Cognitive Reasoning & General Q\&A**
|
|
Designed to support high-level thinking and accurate responses across general domains.
|
|
|
|
2. **Education & Research Support**
|
|
Suitable for generating study guides, academic summaries, and informative explanations.
|
|
|
|
3. **Conversational Intelligence**
|
|
Powers AI assistants and chatbots with memory-aware, context-sensitive dialogues.
|
|
|
|
4. **Cross-Language Communication**
|
|
Useful in multilingual environments for translation, communication, and content creation.
|
|
|
|
5. **Data-Aware Structuring**
|
|
Capable of converting unstructured data into meaningful formats like JSON or tabular summaries.
|
|
|
|
6. **Lengthy Content Generation**
|
|
Suitable for drafting articles, technical documents, or creative prose with sustained coherence.
|
|
|
|
---
|
|
|
|
# **Limitations**
|
|
|
|
1. **Resource-Intensive Execution**
|
|
Requires robust computational infrastructure (e.g., ≥48GB VRAM) to run efficiently.
|
|
|
|
2. **Residual Biases**
|
|
Though tuned for neutrality, occasional bias may surface from inherited training data.
|
|
|
|
3. **Creative Variability**
|
|
Creative outputs such as fiction or poetry may vary in quality and style coherence.
|
|
|
|
4. **Lack of Real-Time Knowledge**
|
|
The model operates with a static knowledge base and lacks access to current world events.
|
|
|
|
5. **Drift in Extended Outputs**
|
|
Long responses may introduce cumulative inaccuracies or lose focus over time.
|
|
|
|
6. **Prompt Dependence**
|
|
Output quality is sensitive to the clarity and specificity of the initial prompt. |