commit bc758af904ec8ac1644f91516ff6352d70bde6fc Author: ModelHub XC Date: Mon Apr 20 02:58:05 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: Rustamshry/BioGenesis-ToT-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..f6a4483 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,36 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +BioGenesis-ToT-f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/BioGenesis-ToT-f16.gguf b/BioGenesis-ToT-f16.gguf new file mode 100644 index 0000000..fa9e26b --- /dev/null +++ b/BioGenesis-ToT-f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:871698edf28ac8c3ee692874fb142e2750c3aca66a60d1a046088a13fa24b6d2 +size 3447349472 diff --git a/README.md b/README.md new file mode 100644 index 0000000..eae47b4 --- /dev/null +++ b/README.md @@ -0,0 +1,100 @@ +--- +license: apache-2.0 +language: +- en +metrics: +- accuracy +base_model: +- khazarai/BioGenesis-ToT +pipeline_tag: text-generation +tags: +- biology +- medical +- science +- unsloth +- sft +--- + +# Model Card for BioGenesis-ToT + + +![alt="General Benchmark Comparison Chart"](benchmark/BioGenesis-ToT.png) + +- **Overall Success Rate**: + - khazarai/BioGenesis-ToT: **51.45** + - Qwen/Qwen3-1.7B: **46.82** + +- **Benchmark**: [emre/TARA_Turkish_LLM_Benchmark](https://huggingface.co/datasets/emre/TARA_Turkish_LLM_Benchmark) + + +GGUF version of https://huggingface.co/khazarai/BioGenesis-ToT + +BioGenesis-ToT is a fine-tuned version of Qwen3-1.7B, optimized for mechanistic reasoning and explanatory understanding in biology. +This model has been trained on the [moremilk/ToT-Biology](https://huggingface.co/datasets/moremilk/ToT-Biology) dataset — a reasoning-rich collection of biology questions emphasizing why and how processes occur, rather than simply what happens. + +The model demonstrates strong capabilities in: +- Structured biological explanation generation +- Logical and causal reasoning +- Chain-of-thought (ToT) reasoning in scientific contexts +- Interdisciplinary biological analysis (e.g., bioengineering, medicine, ecology) + +## Uses + +### 🚀 Intended Use + +- Educational and scientific explanation generation +- Biological reasoning and tutoring applications +- Model interpretability research +- Training datasets for reasoning-focused LLMs + + +### ⚠️ Limitations + +- Not a replacement for expert biological judgment +- May occasionally over-generalize or simplify complex phenomena +- Limited to reasoning quality within biological contexts (not trained for creative writing or coding) + + +## 🧪 Dataset: moremilk/ToT-Biology + +The ToT-Biology dataset emphasizes mechanistic understanding and explanatory reasoning within biology. +It’s designed to help AI models develop interpretable, step-by-step reasoning abilities for complex biological systems. + +It spans a wide range of biological subdomains: +- Foundational biology: Cell biology, genetics, evolution, and ecology +- Advanced topics: Systems biology, synthetic biology, computational biophysics +- Applied domains: Medicine, agriculture, bioengineering, and environmental science + +Dataset features include: + +- 🧩 Logical reasoning styles — deductive, inductive, abductive, causal, and analogical +- 🧠 Problem-solving techniques — decomposition, elimination, systems thinking, trade-off analysis +- 🔬 Real-world problem contexts — experiment design, pathway mapping, and data interpretation +- 🌍 Practical relevance — bridging theoretical reasoning and applied biological insight +- 🎓 Educational focus — for both AI training and human learning in scientific reasoning + + +## 🧭 Objective + +This fine-tuning project aims to build an interpretable reasoning model capable of: + +- Explaining biological mechanisms clearly and coherently +- Demonstrating transparent, step-by-step thought processes +- Applying logical reasoning techniques to biological and interdisciplinary problems +- Supporting educational and research use cases where reasoning transparency matters + + +## Citation + +**BibTeX:** +```bibtex +@model{khazarai/BioGenesis-ToT, + title = {BioGenesis-ToT: A Fine-Tuned Model for Explanatory Biological Reasoning}, + author = {Rustam Shiriyev}, + year = {2025}, + publisher = {Hugging Face}, + base_model = {Qwen3-1.7B}, + dataset = {moremilk/ToT-Biology}, + license = {MIT} +} +``` \ No newline at end of file diff --git a/benchmark/BioGenesis-ToT.png b/benchmark/BioGenesis-ToT.png new file mode 100644 index 0000000..cac7c52 Binary files /dev/null and b/benchmark/BioGenesis-ToT.png differ