40 lines
956 B
Markdown
40 lines
956 B
Markdown
---
|
|
pipeline_tag: text-generation
|
|
inference: false
|
|
license: apache-2.0
|
|
library_name: mlx
|
|
tags:
|
|
- language
|
|
- granite-3.3
|
|
- mlx
|
|
base_model: ibm-granite/granite-3.3-2b-instruct
|
|
---
|
|
|
|
# mlx-community/granite-3.3-2b-instruct-fp16
|
|
|
|
This model [mlx-community/granite-3.3-2b-instruct-fp16](https://huggingface.co/mlx-community/granite-3.3-2b-instruct-fp16) was
|
|
converted to MLX format from [ibm-granite/granite-3.3-2b-instruct](https://huggingface.co/ibm-granite/granite-3.3-2b-instruct)
|
|
using mlx-lm version **0.22.5**.
|
|
|
|
## Use with mlx
|
|
|
|
```bash
|
|
pip install mlx-lm
|
|
```
|
|
|
|
```python
|
|
from mlx_lm import load, generate
|
|
|
|
model, tokenizer = load("mlx-community/granite-3.3-2b-instruct-fp16")
|
|
|
|
prompt = "hello"
|
|
|
|
if tokenizer.chat_template is not None:
|
|
messages = [{"role": "user", "content": prompt}]
|
|
prompt = tokenizer.apply_chat_template(
|
|
messages, add_generation_prompt=True
|
|
)
|
|
|
|
response = generate(model, tokenizer, prompt=prompt, verbose=True)
|
|
```
|