mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16

Files

ModelHub XC 277ce53351 初始化项目，由ModelHub XC社区提供模型

Model: mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16
Source: Original Platform

2026-05-11 15:49:34 +08:00

1.2 KiB

Raw Blame History

library_name, license, license_name, license_link, pipeline_tag, language, tags, base_model, datasets

library_name

license

license_name

license_link

pipeline_tag

language

mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16

This model mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16 was converted to MLX format from nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1 using mlx-lm version 0.25.0.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)

1.2 KiB Raw Blame History

mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16

Use with mlx

1.2 KiB

Raw Blame History