Model: agentlans/Gemma2-9B-AdvancedFuse Source: Original Platform
datasets, base_model, tags, language, pipeline_tag, license, model-index
| datasets | base_model | tags | language | pipeline_tag | license | model-index | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
text-generation | gemma |
|
Gemma2-9B-AdvancedFuse
Gemma2-9B-AdvancedFuse is an experimental, open-source large language model (LLM) with 9 billion parameters. It aims to combine the strengths of FuseAI/FuseChat-Gemma-2-9B-Instruct and jsgreenawalt/gemma-2-9B-it-advanced-v2.1 through additive linear merging, further fine-tuned on a 12K row dataset from agentlans/crash-course for enhanced chat and instruct performance, including math and multilingual prompts.
Capabilities
- Text Generation: Generates coherent emails, summaries, and notes. This model card was primarily generated by the model itself.
- Instruction Following: Demonstrates strong ability to understand and execute instructions in conversational settings.
- Roleplaying: Can engage in third-person narrative roleplay but may exhibit common GPT expressions or clichés.
Limitations
As with most large language models:
- Factual Errors: May generate incorrect or outdated information due to data biases.
- Mathematical Operations: Struggles with mathematical calculations requiring symbolic reasoning despite its finetuning data.
- Handling Unsafe Input: May generate unsafe, biased, or malicious content if provided inappropriate input. Careful prompt engineering is recommended.
Model Usage Guidelines
- Use clear and specific instructions for optimal performance.
- Verify generated outputs for factual accuracy when critical information is involved.
- Avoid providing inputs that could lead to harmful or unethical responses.
- Consider using human review, especially in high-stakes applications.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here! Summarized results can be found here!
| Metric | Value (%) |
|---|---|
| Average | 20.02 |
| IFEval (0-Shot) | 15.43 |
| BBH (3-Shot) | 40.52 |
| MATH Lvl 5 (4-Shot) | 7.55 |
| GPQA (0-shot) | 11.30 |
| MuSR (0-shot) | 11.99 |
| MMLU-PRO (5-shot) | 33.34 |
Description