Skip to main content
Version: Next

Kokoro-FastAPI

Run Kokoro-FastAPI locally and connect it to OpenReader using the Custom OpenAI-Like provider.

warning

For Kokoro issues and support, use the upstream repository: remsky/Kokoro-FastAPI.

Run Kokoro

CPU:

docker run --name kokoro-tts \
--restart unless-stopped \
-d \
-p 8880:8880 \
-e ONNX_NUM_THREADS=8 \
-e ONNX_INTER_OP_THREADS=4 \
-e ONNX_EXECUTION_MODE=parallel \
-e ONNX_OPTIMIZATION_LEVEL=all \
-e ONNX_MEMORY_PATTERN=true \
-e ONNX_ARENA_EXTEND_STRATEGY=kNextPowerOfTwo \
-e API_LOG_LEVEL=DEBUG \
ghcr.io/remsky/kokoro-fastapi-cpu:v0.2.4

GPU (NVIDIA):

docker run --name kokoro-tts \
--restart unless-stopped \
-d \
--gpus all \
--user 1001:1001 \
-p 8880:8880 \
-e USE_GPU=true \
-e PYTHONUNBUFFERED=1 \
-e API_LOG_LEVEL=DEBUG \
ghcr.io/remsky/kokoro-fastapi-gpu:v0.2.4

Connect to OpenReader

Recommended (auth + admin): Settings → Admin → Shared providers

  1. Add a shared provider with type custom-openai.
  2. Set base URL to http://host.docker.internal:8880/v1.
  3. Leave API key blank unless required by your deployment.
  4. Set default model to Kokoro.

Legacy bootstrap seed (optional, first boot only):

API_BASE=http://host.docker.internal:8880/v1

Use host.docker.internal so the OpenReader container reaches Kokoro's published port on your host. The container name (kokoro-tts) only resolves if OpenReader and Kokoro share a Docker network, i.e. you started them with Docker Compose, --link kokoro-tts, or a shared --network. On native Linux Docker, host.docker.internal needs --add-host=host.docker.internal:host-gateway on the OpenReader container. Note that localhost/127.0.0.1 will not work, since inside the container that points at the container itself.

Users select the configured shared provider, model, and voice from Settings → TTS Provider.

References