初始化项目,由ModelHub XC社区提供模型
Model: Rustamshry/BioGenesis-ToT-GGUF Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
BioGenesis-ToT-f16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
3
BioGenesis-ToT-f16.gguf
Normal file
3
BioGenesis-ToT-f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:871698edf28ac8c3ee692874fb142e2750c3aca66a60d1a046088a13fa24b6d2
|
||||||
|
size 3447349472
|
||||||
100
README.md
Normal file
100
README.md
Normal file
@@ -0,0 +1,100 @@
|
|||||||
|
---
|
||||||
|
license: apache-2.0
|
||||||
|
language:
|
||||||
|
- en
|
||||||
|
metrics:
|
||||||
|
- accuracy
|
||||||
|
base_model:
|
||||||
|
- khazarai/BioGenesis-ToT
|
||||||
|
pipeline_tag: text-generation
|
||||||
|
tags:
|
||||||
|
- biology
|
||||||
|
- medical
|
||||||
|
- science
|
||||||
|
- unsloth
|
||||||
|
- sft
|
||||||
|
---
|
||||||
|
|
||||||
|
# Model Card for BioGenesis-ToT
|
||||||
|
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
- **Overall Success Rate**:
|
||||||
|
- khazarai/BioGenesis-ToT: **51.45**
|
||||||
|
- Qwen/Qwen3-1.7B: **46.82**
|
||||||
|
|
||||||
|
- **Benchmark**: [emre/TARA_Turkish_LLM_Benchmark](https://huggingface.co/datasets/emre/TARA_Turkish_LLM_Benchmark)
|
||||||
|
|
||||||
|
|
||||||
|
GGUF version of https://huggingface.co/khazarai/BioGenesis-ToT
|
||||||
|
|
||||||
|
BioGenesis-ToT is a fine-tuned version of Qwen3-1.7B, optimized for mechanistic reasoning and explanatory understanding in biology.
|
||||||
|
This model has been trained on the [moremilk/ToT-Biology](https://huggingface.co/datasets/moremilk/ToT-Biology) dataset — a reasoning-rich collection of biology questions emphasizing why and how processes occur, rather than simply what happens.
|
||||||
|
|
||||||
|
The model demonstrates strong capabilities in:
|
||||||
|
- Structured biological explanation generation
|
||||||
|
- Logical and causal reasoning
|
||||||
|
- Chain-of-thought (ToT) reasoning in scientific contexts
|
||||||
|
- Interdisciplinary biological analysis (e.g., bioengineering, medicine, ecology)
|
||||||
|
|
||||||
|
## Uses
|
||||||
|
|
||||||
|
### 🚀 Intended Use
|
||||||
|
|
||||||
|
- Educational and scientific explanation generation
|
||||||
|
- Biological reasoning and tutoring applications
|
||||||
|
- Model interpretability research
|
||||||
|
- Training datasets for reasoning-focused LLMs
|
||||||
|
|
||||||
|
|
||||||
|
### ⚠️ Limitations
|
||||||
|
|
||||||
|
- Not a replacement for expert biological judgment
|
||||||
|
- May occasionally over-generalize or simplify complex phenomena
|
||||||
|
- Limited to reasoning quality within biological contexts (not trained for creative writing or coding)
|
||||||
|
|
||||||
|
|
||||||
|
## 🧪 Dataset: moremilk/ToT-Biology
|
||||||
|
|
||||||
|
The ToT-Biology dataset emphasizes mechanistic understanding and explanatory reasoning within biology.
|
||||||
|
It’s designed to help AI models develop interpretable, step-by-step reasoning abilities for complex biological systems.
|
||||||
|
|
||||||
|
It spans a wide range of biological subdomains:
|
||||||
|
- Foundational biology: Cell biology, genetics, evolution, and ecology
|
||||||
|
- Advanced topics: Systems biology, synthetic biology, computational biophysics
|
||||||
|
- Applied domains: Medicine, agriculture, bioengineering, and environmental science
|
||||||
|
|
||||||
|
Dataset features include:
|
||||||
|
|
||||||
|
- 🧩 Logical reasoning styles — deductive, inductive, abductive, causal, and analogical
|
||||||
|
- 🧠 Problem-solving techniques — decomposition, elimination, systems thinking, trade-off analysis
|
||||||
|
- 🔬 Real-world problem contexts — experiment design, pathway mapping, and data interpretation
|
||||||
|
- 🌍 Practical relevance — bridging theoretical reasoning and applied biological insight
|
||||||
|
- 🎓 Educational focus — for both AI training and human learning in scientific reasoning
|
||||||
|
|
||||||
|
|
||||||
|
## 🧭 Objective
|
||||||
|
|
||||||
|
This fine-tuning project aims to build an interpretable reasoning model capable of:
|
||||||
|
|
||||||
|
- Explaining biological mechanisms clearly and coherently
|
||||||
|
- Demonstrating transparent, step-by-step thought processes
|
||||||
|
- Applying logical reasoning techniques to biological and interdisciplinary problems
|
||||||
|
- Supporting educational and research use cases where reasoning transparency matters
|
||||||
|
|
||||||
|
|
||||||
|
## Citation
|
||||||
|
|
||||||
|
**BibTeX:**
|
||||||
|
```bibtex
|
||||||
|
@model{khazarai/BioGenesis-ToT,
|
||||||
|
title = {BioGenesis-ToT: A Fine-Tuned Model for Explanatory Biological Reasoning},
|
||||||
|
author = {Rustam Shiriyev},
|
||||||
|
year = {2025},
|
||||||
|
publisher = {Hugging Face},
|
||||||
|
base_model = {Qwen3-1.7B},
|
||||||
|
dataset = {moremilk/ToT-Biology},
|
||||||
|
license = {MIT}
|
||||||
|
}
|
||||||
|
```
|
||||||
BIN
benchmark/BioGenesis-ToT.png
Normal file
BIN
benchmark/BioGenesis-ToT.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 85 KiB |
Reference in New Issue
Block a user