diff --git a/README.md b/README.md index c51168e..814991c 100644 --- a/README.md +++ b/README.md @@ -8,4 +8,15 @@ docker build -t enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 . ``` -最新镜像:git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 \ No newline at end of file +最新镜像:git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 + +运行容器 +**注意**:必须使用 `--no-mmap` 参数关闭内存映射,否则会报错 +```bash +docker run -it --rm \ +-v :/app/models \ +--privileged \ +-e CUDA_VISIBLE_DEVICES=0 \ +git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 +/app/llama-cli -m /app/models/xxx.gguf --no-mmap -p "你好" +``` \ No newline at end of file