初始化项目,由ModelHub XC社区提供模型
Model: vaiv/GeM2-Llamion-14B-Chat Source: Original Platform
This commit is contained in:
21
README.md
Normal file
21
README.md
Normal file
@@ -0,0 +1,21 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
---
|
||||
|
||||
# **GeM2-Llamion-14B**
|
||||
|
||||
We have released **Llamion** as **GeM 2.0**, the second series of generative models developed by VAIV Company to address the our principal business needs.
|
||||
|
||||
**Llamion** (Llamafied Orion) is derived from transforming the [Orion model](https://huggingface.co/OrionStarAI/Orion-14B-Chat)
|
||||
into [the standard LLaMA architecture](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py)
|
||||
through parameter mapping and offline knowledge transfer.
|
||||
Further technical specifications and study results will be detailed in our upcoming paper, available on this page.
|
||||
|
||||
<!-- Note that this model has NOT been contaminated to artificially inflate its scores for the Open LLM Leaderboards,
|
||||
unlike some recent models which have been intentionally tainted. -->
|
||||
|
||||

|
||||
|
||||
### Contributors
|
||||
|
||||
- VAIV Company AI Lab ([vaiv.kr](https://www.vaiv.kr/))
|
||||
Reference in New Issue
Block a user