--- language: en license: apache-2.0 library_name: mlx pipeline_tag: text-generation tags: - text-generation - mlx - apple-silicon base_model: YoAbriel/KodaLite-1.3B --- # KodaLite-1.3B — MLX (fp16) MLX version of [YoAbriel/KodaLite-1.3B](https://huggingface.co/YoAbriel/KodaLite-1.3B), optimized for Apple Silicon (M1/M2/M3/M4). **Size**: ~2.5 GB | **Precision**: bfloat16 ## Usage ```bash pip install mlx-lm ``` ```python from mlx_lm import load, generate model, tok = load("YoAbriel/KodaLite-1.3B-mlx") prompt = tok.apply_chat_template( [{"role": "user", "content": "What is the capital of France?"}], tokenize=False, add_generation_prompt=True, ) print(generate(model, tok, prompt=prompt, max_tokens=80)) ``` Or from the command line: ```bash mlx_lm.generate --model YoAbriel/KodaLite-1.3B-mlx \ --prompt "<|user|>\nHello\n<|assistant|>\n" --max-tokens 80 ``` ## Other quantizations - [YoAbriel/KodaLite-1.3B-mlx-8bit](https://huggingface.co/YoAbriel/KodaLite-1.3B-mlx-8bit) — 1.4 GB, 8-bit ## Limitations Small model (1.27B params), undertrained (1.64B tokens). See the [base model card](https://huggingface.co/YoAbriel/KodaLite-1.3B) for full details. ## License Apache 2.0