初始化项目,由ModelHub XC社区提供模型
Model: prithivMLmods/Sombrero-Opus-14B-Elite13 Source: Original Platform
This commit is contained in:
116
README.md
Normal file
116
README.md
Normal file
@@ -0,0 +1,116 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
base_model:
|
||||
- prithivMLmods/Sombrero-Opus-14B-Elite6
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- SFT
|
||||
- text-generation-inference
|
||||
- abliterated
|
||||
- trl
|
||||
- code
|
||||
- moderately abliterated
|
||||
- math
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
---
|
||||
|
||||

|
||||
|
||||
# **Sombrero-Opus-14B-Elite13**
|
||||
|
||||
> Sombrero-Opus-14B-Elite13 builds upon the Qwen 2.5 14B modality architecture, elevating reasoning performance in mid- to large-scale models. This iteration focuses on enhancing general-purpose comprehension, structured intelligence, and interactive versatility. Fine-tuned with an advanced reasoning chain and carefully curated datasets, Elite13 offers improved contextual understanding, logical coherence, and multi-step problem-solving.
|
||||
|
||||
Key improvements include:
|
||||
|
||||
1. **Expanded Domain Fluency**: Delivers refined general knowledge across disciplines for more accurate and coherent answers.
|
||||
2. **Advanced Instruction Parsing**: Enhanced capacity to interpret and execute complex instructions while preserving structure and clarity.
|
||||
3. **Robust Prompt Flexibility**: Excels in adapting to diverse interaction styles, from casual inquiries to formal requests.
|
||||
4. **Extended Context Window**: Handles up to 128K tokens of input and generates up to 8K tokens in a single output — ideal for detailed reasoning and expansive replies.
|
||||
5. **Global Linguistic Range**: Offers proficiency in 29+ languages, including English, Chinese, French, Spanish, Japanese, Arabic, and more.
|
||||
|
||||
---
|
||||
|
||||
# **Quickstart with Transformers**
|
||||
|
||||
Use the following snippet to load and test the model using `transformers` and `apply_chat_template`:
|
||||
|
||||
```python
|
||||
from transformers import AutoModelForCausalLM, AutoTokenizer
|
||||
|
||||
model_name = "prithivMLmods/Sombrero-Opus-14B-Elite13"
|
||||
|
||||
model = AutoModelForCausalLM.from_pretrained(
|
||||
model_name,
|
||||
torch_dtype="auto",
|
||||
device_map="auto"
|
||||
)
|
||||
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
||||
|
||||
prompt = "What are the key principles of general-purpose AI?"
|
||||
messages = [
|
||||
{"role": "system", "content": "You are a helpful assistant capable of answering a wide range of questions."},
|
||||
{"role": "user", "content": prompt}
|
||||
]
|
||||
text = tokenizer.apply_chat_template(
|
||||
messages,
|
||||
tokenize=False,
|
||||
add_generation_prompt=True
|
||||
)
|
||||
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
|
||||
|
||||
generated_ids = model.generate(
|
||||
**model_inputs,
|
||||
max_new_tokens=512
|
||||
)
|
||||
generated_ids = [
|
||||
output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
|
||||
]
|
||||
|
||||
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
# **Intended Use**
|
||||
|
||||
1. **Cognitive Reasoning & General Q\&A**
|
||||
Designed to support high-level thinking and accurate responses across general domains.
|
||||
|
||||
2. **Education & Research Support**
|
||||
Suitable for generating study guides, academic summaries, and informative explanations.
|
||||
|
||||
3. **Conversational Intelligence**
|
||||
Powers AI assistants and chatbots with memory-aware, context-sensitive dialogues.
|
||||
|
||||
4. **Cross-Language Communication**
|
||||
Useful in multilingual environments for translation, communication, and content creation.
|
||||
|
||||
5. **Data-Aware Structuring**
|
||||
Capable of converting unstructured data into meaningful formats like JSON or tabular summaries.
|
||||
|
||||
6. **Lengthy Content Generation**
|
||||
Suitable for drafting articles, technical documents, or creative prose with sustained coherence.
|
||||
|
||||
---
|
||||
|
||||
# **Limitations**
|
||||
|
||||
1. **Resource-Intensive Execution**
|
||||
Requires robust computational infrastructure (e.g., ≥48GB VRAM) to run efficiently.
|
||||
|
||||
2. **Residual Biases**
|
||||
Though tuned for neutrality, occasional bias may surface from inherited training data.
|
||||
|
||||
3. **Creative Variability**
|
||||
Creative outputs such as fiction or poetry may vary in quality and style coherence.
|
||||
|
||||
4. **Lack of Real-Time Knowledge**
|
||||
The model operates with a static knowledge base and lacks access to current world events.
|
||||
|
||||
5. **Drift in Extended Outputs**
|
||||
Long responses may introduce cumulative inaccuracies or lose focus over time.
|
||||
|
||||
6. **Prompt Dependence**
|
||||
Output quality is sensitive to the clarity and specificity of the initial prompt.
|
||||
Reference in New Issue
Block a user