xiezhongtao 36a77d3318 docs: 添加容器运行说明及注意事项
添加运行容器的命令示例,并强调必须使用 `--no-mmap` 参数以避免错误
2026-01-23 16:47:14 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00
2026-01-23 11:34:20 +08:00

enginex-bi_150-llama.cpp

运行于【天数智芯-天垓150】算力卡的【文本生成】引擎基于 llama.cpp (b7516) 引擎进行架构特别适配优化。

Build Docker Image

docker build -t enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 .

最新镜像git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150

运行容器 注意:必须使用 --no-mmap 参数关闭内存映射,否则会报错

docker run -it --rm \
-v <model_dir>:/app/models \
--privileged \
-e CUDA_VISIBLE_DEVICES=0 \
git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150
/app/llama-cli -m /app/models/xxx.gguf --no-mmap -p "你好"
Description
运行于【天数智芯-天垓150】算力卡的【文本生成】引擎,基于 llama.cpp 引擎进行架构特别适配优化。
Readme MIT 26 MiB
Languages
C++ 56.1%
C 12.6%
Python 7.9%
Cuda 6.5%
HTML 4.6%
Other 12.2%