初始化项目,由ModelHub XC社区提供模型
Model: Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct-abliterated Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
||||
123
README.md
Normal file
123
README.md
Normal file
@@ -0,0 +1,123 @@
|
||||
---
|
||||
library_name: transformers
|
||||
model_name: Vikhr-Llama-3.2-1B-Instruct-abliterated
|
||||
base_model:
|
||||
- Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct
|
||||
language:
|
||||
- ru
|
||||
- en
|
||||
license: llama3.2
|
||||
tags:
|
||||
- not-for-all-audiences
|
||||
---
|
||||
|
||||
# 💨🔞 Vikhr-Llama-3.2-1B-Instruct-Abliterated
|
||||
|
||||
#### RU
|
||||
|
||||
Инструктивная модель на основе **Vikhr-Llama-3.2-1B-Instruct**, прошедшая процесс "аблитерации" для снятия цензурных ограничений, обучена на русскоязычном датасете **GrandMaster-PRO-MAX**.
|
||||
|
||||
#### EN
|
||||
|
||||
A fine-tuned instruction-following model based on **Vikhr-Llama-3.2-1B-Instruct**, which has undergone "abliteration" to remove censorship restrictions. Trained on the **GrandMaster-PRO-MAX**.
|
||||
|
||||
|
||||
# 🛑 Отказ от ответственности / Disclaimer
|
||||
#### RU
|
||||
Модель **Vikhr-Llama-3.2-1B-Instruct-abliterated** разработана исключительно для исследовательских и образовательных целей. После применения метода "аблитерации" модель больше не имеет встроенных ограничений на генерацию ответов, что может привести к созданию нежелательных или потенциально вредоносных текстов.
|
||||
|
||||
Использование модели происходит на ваш собственный риск. Разработчики и авторы не несут ответственности за любой вред, ущерб или последствия, вызванные использованием модели, включая её применение в контекстах, противоречащих законам, этическим или моральным нормам.
|
||||
|
||||
#### EN
|
||||
The **Vikhr-Llama-3.2-1B-Instruct-abliterated** model is intended solely for research and educational purposes. After the "abliteration" technique is applied, the model no longer has built-in restrictions on generating responses, which may result in unwanted or potentially harmful outputs.
|
||||
|
||||
Use of the model is at your own risk. The developers and authors are not responsible for any damage, harm, or consequences resulting from its use, including use in contexts that violate laws, ethical standards, or moral norms.
|
||||
|
||||
|
||||
## GGUF
|
||||
|
||||
- [Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct-Abliterated-GGUF](https://huggingface.co/Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct-abliterated-GGUF)
|
||||
|
||||
|
||||
## Основные особенности / Key Features:
|
||||
|
||||
- 📚 Основа / Base: [Vikhr-Llama-3.2-1B-Instruct](https://huggingface.co/Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct)
|
||||
- 🇷🇺 Специализация / Specialization: **RU**
|
||||
|
||||
|
||||
## Попробовать / Try now:
|
||||
|
||||
[](https://colab.research.google.com/drive/1bJpLmplDGkMbfOLO2CH6IO-2uUZEaknf?usp=sharing)
|
||||
|
||||
|
||||
## Описание / Description:
|
||||
|
||||
#### RU
|
||||
|
||||
**Vikhr-Llama-3.2-1B-Instruct-Abliterated** — это компактная языковая модель, обученная на датасете **GrandMaster-PRO-MAX** с применением техники "аблитерации," которая снимает ограничения цензуры модели. Этот процесс делает её значительно более гибкой и способной отвечать на любые запросы. Модель занимает менее 3GB и идеально подходит для работы на слабых устройствах.
|
||||
|
||||
#### EN
|
||||
|
||||
**Vikhr-Llama-3.2-1B-Instruct-Abliterated** is a compact language model fine-tuned on the **GrandMaster-PRO-MAX** dataset with the "abliteration" technique, which removes censorship restrictions. This process significantly increases the model's flexibility, enabling it to respond to any prompt. The model size is under 3GB, making it an excellent choice for deployment on low-power devices.
|
||||
|
||||
|
||||
## Обучение / Training:
|
||||
|
||||
#### RU
|
||||
|
||||
Модель **Vikhr-Llama-3.2-1B-Instruct-Abliterated** прошла процесс "аблитерации", что позволило снять ограничения на обработку вредоносных инструкций. Эта техника была взята из статьи **[Uncensor any LLM with abliteration](https://huggingface.co/blog/mlabonne/abliteration)**, которая описывает, как идентифицировать и устранять так называемое "направление отказа" модели, предотвращающее выполнение вредоносных запросов.
|
||||
|
||||
#### EN
|
||||
|
||||
The **Vikhr-Llama-3.2-1B-Instruct-Abliterated** model was processed using the "abliteration" technique, which removes restrictions on handling harmful instructions. This technique was inspired by the article **[Uncensor any LLM with abliteration](https://huggingface.co/blog/mlabonne/abliteration)**, detailing how to identify and ablate the "refusal direction" in the model's residual streams to enable uncensored responses.
|
||||
|
||||
|
||||
## Пример кода для запуска / Sample code to run:
|
||||
|
||||
**Рекомендуемая температура для генерации: 0.3** / **Recommended generation temperature: 0.3**
|
||||
|
||||
```python
|
||||
from transformers import AutoModelForCausalLM, AutoTokenizer
|
||||
|
||||
# Загрузка модели и токенизатора
|
||||
model_name = "Vikhrmodels/Vikhr-Llama-3.2-1B-instruct"
|
||||
model = AutoModelForCausalLM.from_pretrained(model_name)
|
||||
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
||||
|
||||
# Подготовка входного текста
|
||||
input_text = "Напиши очень краткую рецензию о книге гарри поттер."
|
||||
|
||||
# Токенизация и генерация текста
|
||||
input_ids = tokenizer.encode(input_text, return_tensors="pt")
|
||||
output = model.generate(
|
||||
input_ids,
|
||||
max_length=1512,
|
||||
temperature=0.3,
|
||||
num_return_sequences=1,
|
||||
no_repeat_ngram_size=2,
|
||||
top_k=50,
|
||||
top_p=0.95,
|
||||
)
|
||||
|
||||
# Декодирование и вывод результата
|
||||
generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
|
||||
print(generated_text)
|
||||
```
|
||||
|
||||
|
||||
### Авторы / Authors
|
||||
|
||||
- Sergei Bratchikov, [NLP Wanderer](https://t.me/nlpwanderer), [Vikhr Team](https://t.me/vikhrlabs)
|
||||
- Nikolay Kompanets, [LakoMoor](https://t.me/lakomoor), [Vikhr Team](https://t.me/vikhrlabs)
|
||||
- Konstantin Korolev, [Vikhr Team](https://t.me/vikhrlabs)
|
||||
- Aleksandr Nikolich, [Vikhr Team](https://t.me/vikhrlabs)
|
||||
|
||||
```
|
||||
@article{nikolich2024vikhr,
|
||||
title={Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian},
|
||||
author={Aleksandr Nikolich and Konstantin Korolev and Sergey Bratchikov and Nikolay Kompanets and Artem Shelmanov},
|
||||
journal={arXiv preprint arXiv:2405.13929},
|
||||
year={2024},
|
||||
url={https://arxiv.org/pdf/2405.13929}
|
||||
}
|
||||
```
|
||||
40
config.json
Normal file
40
config.json
Normal file
@@ -0,0 +1,40 @@
|
||||
{
|
||||
"_name_or_path": "vikhrsmall/Vikhr-Llama-3.2-1B-Instruct-abliterated",
|
||||
"architectures": [
|
||||
"LlamaForCausalLM"
|
||||
],
|
||||
"attention_bias": false,
|
||||
"attention_dropout": 0.0,
|
||||
"bos_token_id": 128000,
|
||||
"eos_token_id": [
|
||||
128001,
|
||||
128008,
|
||||
128009
|
||||
],
|
||||
"head_dim": 64,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 2048,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 8192,
|
||||
"max_position_embeddings": 131072,
|
||||
"mlp_bias": false,
|
||||
"model_type": "llama",
|
||||
"num_attention_heads": 32,
|
||||
"num_hidden_layers": 16,
|
||||
"num_key_value_heads": 8,
|
||||
"pretraining_tp": 1,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_scaling": {
|
||||
"factor": 32.0,
|
||||
"high_freq_factor": 4.0,
|
||||
"low_freq_factor": 1.0,
|
||||
"original_max_position_embeddings": 8192,
|
||||
"rope_type": "llama3"
|
||||
},
|
||||
"rope_theta": 500000.0,
|
||||
"tie_word_embeddings": true,
|
||||
"torch_dtype": "float16",
|
||||
"transformers_version": "4.45.1",
|
||||
"use_cache": true,
|
||||
"vocab_size": 128256
|
||||
}
|
||||
12
generation_config.json
Normal file
12
generation_config.json
Normal file
@@ -0,0 +1,12 @@
|
||||
{
|
||||
"bos_token_id": 128000,
|
||||
"do_sample": true,
|
||||
"eos_token_id": [
|
||||
128001,
|
||||
128008,
|
||||
128009
|
||||
],
|
||||
"temperature": 0.6,
|
||||
"top_p": 0.9,
|
||||
"transformers_version": "4.45.1"
|
||||
}
|
||||
3
model.safetensors
Normal file
3
model.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a9e6500ec769864eb310878f05445aa9dcd602d0e4cabf5c8c742384b5435d85
|
||||
size 2471645464
|
||||
23
special_tokens_map.json
Normal file
23
special_tokens_map.json
Normal file
@@ -0,0 +1,23 @@
|
||||
{
|
||||
"bos_token": {
|
||||
"content": "<|begin_of_text|>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"eos_token": {
|
||||
"content": "<|eot_id|>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"pad_token": {
|
||||
"content": "<|reserved_special_token_0|>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
}
|
||||
}
|
||||
3
tokenizer.json
Normal file
3
tokenizer.json
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f2f90a0ee1b41702c7b233b02234294a53bc0684a08d3bcd8c8ff702e9a12f64
|
||||
size 17210019
|
||||
2067
tokenizer_config.json
Normal file
2067
tokenizer_config.json
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user