Files
enginex-bi_150-llama.cpp/README.md
xiezhongtao 8d3f9b9cb1 fix(ggml-cuda): 修正CUDA编译标志和WARP_SIZE配置
更新CUDA编译标志以使用正确的fast-math和extended-lambda选项
调整WARP_SIZE为64以适配目标硬件
移除-Wmissing-noreturn警告选项
修复cudaStreamWaitEvent调用缺少参数的问题
2026-01-23 16:42:43 +08:00

11 lines
359 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# enginex-bi_150-llama.cpp
运行于【天数智芯-天垓150】算力卡的【文本生成】引擎基于 llama.cpp (b7516) 引擎进行架构特别适配优化。
## Build Docker Image
```bash
docker build -t enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150 .
```
最新镜像git.modelhub.org.cn:9443/enginex-iluvatar/iluvatar-llama.cpp:b7516-bi150