Files
Llama-3.1-Nemotron-Nano-4B-…/README.md
ModelHub XC 277ce53351 初始化项目,由ModelHub XC社区提供模型
Model: mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16
Source: Original Platform
2026-05-11 15:49:34 +08:00

1.2 KiB

library_name, license, license_name, license_link, pipeline_tag, language, tags, base_model, datasets
library_name license license_name license_link pipeline_tag language tags base_model datasets
mlx other nvidia-open-model-license https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/ text-generation
en
nvidia
llama-3
pytorch
mlx
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
nvidia/Llama-Nemotron-Post-Training-Dataset

mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16

This model mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16 was converted to MLX format from nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 using mlx-lm version 0.25.0.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)