2025-09-12 11:39:55 +08:00
|
|
|
|
# Kokoro-TTS
|
|
|
|
|
|
|
|
|
|
|
|
本项目基于 **Kokoro** 模型封装,提供简洁的 Docker 部署方式,支持 **SSML 输入**,输出 **PCM 原始音频**,可用于语音合成。
|
|
|
|
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
## Quickstart
|
|
|
|
|
|
|
|
|
|
|
|
### 1. 安装镜像
|
|
|
|
|
|
```bash
|
|
|
|
|
|
docker build -t tts:kokoro . -f Dockerfile_kokoro
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
### 2. 启动服务
|
|
|
|
|
|
```bash
|
|
|
|
|
|
metax-docker run -it --rm \
|
|
|
|
|
|
-v /models/Kokoro-82M-v1.1-zh:/mnt/models \
|
|
|
|
|
|
--gpus=[2] \
|
|
|
|
|
|
-p 8080:80 \
|
|
|
|
|
|
-e MODEL_DIR=/mnt/models \
|
|
|
|
|
|
-e MODEL_NAME=kokoro-v1_1-zh.pth \
|
|
|
|
|
|
tts:kokoro
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
参数说明:
|
|
|
|
|
|
- `MODEL_DIR`:模型所在目录(挂载到容器内 `/mnt/models`)
|
|
|
|
|
|
- `MODEL_NAME`:加载的模型文件名(通常为 `.safetensors`)
|
|
|
|
|
|
- `-p 8080:80`:将容器内服务端口映射到宿主机 `8080`
|
|
|
|
|
|
|
|
|
|
|
|
### 3. 测试服务
|
|
|
|
|
|
```bash
|
|
|
|
|
|
curl --request POST "http://localhost:8080/tts" \
|
|
|
|
|
|
--header 'Content-Type: application/ssml+xml' \
|
|
|
|
|
|
--header 'User-Agent: curl' \
|
|
|
|
|
|
--data-raw '<speak version="1.0" xml:lang="zh">
|
|
|
|
|
|
<voice xml:lang="zh" xml:gender="Female" name="zh">
|
|
|
|
|
|
今天天气很好,不知道明天天气怎么样。
|
|
|
|
|
|
</voice>
|
|
|
|
|
|
</speak>' \
|
|
|
|
|
|
--output sound.pcm
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
---
|
|
|
|
|
|
|
2025-09-12 15:42:17 +08:00
|
|
|
|
- Patch: 修复torch.istft 复数运算出错问题 ,修复 GPU “打字机”噪声
|