Files
joke-finetome-model-gguf-ph…/inference/llama_cli_examples.md
ModelHub XC 2c2240df42 初始化项目,由ModelHub XC社区提供模型
Model: Mathieu-Thomas-JOSSET/joke-finetome-model-gguf-phi4-20260112-081758
Source: Original Platform
2026-04-11 12:30:59 +08:00

238 B

Local inference (llama.cpp)

llama-cli -hf {REPO_ID}:q8_0 -cnv --chat-template phi4

Server (OpenAI-compatible)

llama-server -hf {REPO_ID}:q8_0
# /v1/chat/completions will be available (OpenAI-compatible)