初始化项目,由ModelHub XC社区提供模型

Model: mesabo/agri-chat-multilingual
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-02 00:10:53 +08:00
commit 3e46118f71
12 changed files with 151686 additions and 0 deletions

36
.gitattributes vendored Normal file
View File

@@ -0,0 +1,36 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text

132
README.md Normal file
View File

@@ -0,0 +1,132 @@
---
license: apache-2.0
language:
- en
- fr
- es
- pt
- sw
- ja
- ar
base_model: Qwen/Qwen2-0.5B-Instruct
tags:
- agriculture
- multilingual
- chatbot
- crop-diseases
- farming
- west-africa
pipeline_tag: text-generation
---
# AgriChat Multilingual - Agricultural Assistant
A multilingual chatbot fine-tuned for agricultural assistance, specifically designed for farmers in West Africa and beyond.
## Model Description
- **Base Model:** Qwen/Qwen2-0.5B-Instruct
- **Fine-tuning Method:** LoRA (Low-Rank Adaptation)
- **Languages:** English, French, Spanish, Portuguese, Swahili, Japanese, Arabic
- **Domain:** Agricultural crop diseases, farming practices, pest management
- **License:** Apache 2.0
## Supported Languages
| Language | Code | Coverage |
|----------|------|----------|
| English | en | Full |
| French | fr | Full |
| Spanish | es | Full |
| Portuguese | pt | Full |
| Swahili | sw | Full |
| Japanese | ja | Full |
| Arabic | ar | Full |
## Use Cases
- **Crop Disease Identification:** Ask about symptoms and treatments for plant diseases
- **Farming Advice:** Get guidance on agricultural practices
- **Pest Management:** Learn about controlling pests affecting crops
- **Multilingual Support:** Communicate in 7 different languages
## Quick Start
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
# Load model
model = AutoModelForCausalLM.from_pretrained("mesabo/agri-chat-multilingual")
tokenizer = AutoTokenizer.from_pretrained("mesabo/agri-chat-multilingual")
# Chat example
messages = [
{"role": "user", "content": "How do I identify cassava mosaic disease?"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=256, temperature=0.7)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
```
## Training Details
- **Training Data:** 38 curated Q&A examples across 7 languages
- **Epochs:** 3
- **LoRA Parameters:** 2.16M trainable (0.44% of total)
- **Training Loss:** 2.54
- **Hardware:** NVIDIA RTX 3090 (25.3 GB)
- **Training Time:** ~17 seconds
## Covered Topics
### Crop Diseases
- Cassava mosaic disease
- Maize leaf blight
- Tomato bacterial wilt
- Cashew anthracnose
- Rice blast disease
### Farming Practices
- Organic pest control
- Soil health management
- Crop rotation benefits
- Water conservation
## Limitations
- Fine-tuned on limited agricultural domain data
- Best suited for common crop diseases in West Africa
- May not cover specialized or rare conditions
- Responses should be verified with local agricultural experts
## Intended Use
This model is designed for:
- Agricultural extension workers
- Small-scale farmers
- Agricultural education platforms
- Farming assistance applications
## Citation
```bibtex
@misc{agri-chat-multilingual,
author = {mesabo},
title = {AgriChat Multilingual - Agricultural Assistant},
year = {2024},
publisher = {Hugging Face},
url = {https://huggingface.co/mesabo/agri-chat-multilingual}
}
```
## Related Models
- [mesabo/agri-plant-disease-resnet50](https://huggingface.co/mesabo/agri-plant-disease-resnet50) - Plant disease image classification (95%+ accuracy)
## Contact
For questions or issues, please open a discussion on the model page.

5
added_tokens.json Normal file
View File

@@ -0,0 +1,5 @@
{
"<|endoftext|>": 151643,
"<|im_end|>": 151645,
"<|im_start|>": 151644
}

6
chat_template.jinja Normal file
View File

@@ -0,0 +1,6 @@
{% for message in messages %}{% if loop.first and messages[0]['role'] != 'system' %}{{ '<|im_start|>system
You are a helpful assistant.<|im_end|>
' }}{% endif %}{{'<|im_start|>' + message['role'] + '
' + message['content'] + '<|im_end|>' + '
'}}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
' }}{% endif %}

28
config.json Normal file
View File

@@ -0,0 +1,28 @@
{
"architectures": [
"Qwen2ForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 151643,
"eos_token_id": 151645,
"hidden_act": "silu",
"hidden_size": 896,
"initializer_range": 0.02,
"intermediate_size": 4864,
"max_position_embeddings": 32768,
"max_window_layers": 24,
"model_type": "qwen2",
"num_attention_heads": 14,
"num_hidden_layers": 24,
"num_key_value_heads": 2,
"rms_norm_eps": 1e-06,
"rope_scaling": null,
"rope_theta": 1000000.0,
"sliding_window": 32768,
"tie_word_embeddings": true,
"torch_dtype": "float32",
"transformers_version": "4.52.4",
"use_cache": true,
"use_sliding_window": false,
"vocab_size": 151936
}

14
generation_config.json Normal file
View File

@@ -0,0 +1,14 @@
{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"repetition_penalty": 1.1,
"temperature": 0.7,
"top_k": 20,
"top_p": 0.8,
"transformers_version": "4.52.4"
}

151388
merges.txt Normal file

File diff suppressed because it is too large Load Diff

3
model.safetensors Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8ae2853c1a6cfce1519d303b28fca2dbd094a1c6fd1bbecc71af67ed9f8df1ca
size 1976163472

20
special_tokens_map.json Normal file
View File

@@ -0,0 +1,20 @@
{
"additional_special_tokens": [
"<|im_start|>",
"<|im_end|>"
],
"eos_token": {
"content": "<|im_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "<|endoftext|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

3
tokenizer.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f5ae291beb14c5986404b36fdaabbcf3cb858121c6239ffbec6c1405cbafd9dc
size 11418544

50
tokenizer_config.json Normal file
View File

@@ -0,0 +1,50 @@
{
"add_prefix_space": false,
"added_tokens_decoder": {
"151643": {
"content": "<|endoftext|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151644": {
"content": "<|im_start|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"151645": {
"content": "<|im_end|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
}
},
"additional_special_tokens": [
"<|im_start|>",
"<|im_end|>"
],
"bos_token": null,
"clean_up_tokenization_spaces": false,
"eos_token": "<|im_end|>",
"errors": "replace",
"extra_special_tokens": {},
"max_length": 512,
"model_max_length": 32768,
"pad_to_multiple_of": null,
"pad_token": "<|endoftext|>",
"pad_token_type_id": 0,
"padding_side": "right",
"split_special_tokens": false,
"stride": 0,
"tokenizer_class": "Qwen2Tokenizer",
"truncation_side": "right",
"truncation_strategy": "longest_first",
"unk_token": null
}

1
vocab.json Normal file

File diff suppressed because one or more lines are too long