enginex-bi_150-llama.cpp/README.md at 8d3f9b9cb16a74abe7422af32847d6b76e4f740e - enginex-bi_150-llama.cpp - Gitea: Git with a cup of tea

EngineX-Iluvatar/enginex-bi_150-llama.cpp

Files

xiezhongtao 8d3f9b9cb1 fix(ggml-cuda): 修正CUDA编译标志和WARP_SIZE配置

更新CUDA编译标志以使用正确的fast-math和extended-lambda选项
调整WARP_SIZE为64以适配目标硬件
移除-Wmissing-noreturn警告选项
修复cudaStreamWaitEvent调用缺少参数的问题

2026-01-23 16:42:43 +08:00

11 lines

359 B

Markdown

Raw Blame History

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

 # enginex-bi_150-llama.cpp
 运行于【天数智芯-天垓150】算力卡的【文本生成】引擎，基于 llama.cpp (b7516) 引擎进行架构特别适配优化。
 ## Build Docker Image
 ```bash
 docker build -t enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 .
 ```
 最新镜像：git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150