Quick Start

Deploy Whissle Gateway

One command to run the full voice AI stack on any NVIDIA GPU:

docker run -d --gpus all -p 9000:9000 \
  -v whissle-trt:/tmp/trt_engines \
  whissleasr/whissle-gateway:standard

The gateway is ready at http://localhost:9000 after ~2 minutes (TensorRT engines build on first run, cached after).

What's Inside

Component	Details
ASR	23 languages, 440ms TTFT, TensorRT accelerated
LLM	3B params, 265 tok/s on GPU, OpenAI-compatible API
TTS	Human-quality Orpheus, EN + Hindi, 230ms TTFB
Diarization	ECAPA-TDNN speaker encoder, multi-speaker separation
Metadata	Emotion, intent, age, gender, entities per utterance

Verify

# Health check
curl http://localhost:9000/

# Quick ASR test
curl -X POST http://localhost:9000/asr/transcribe \
  -F "file=@audio.wav" -F "language=en"

Quick Start

Deploy Whissle Gateway

What's Inside

Verify

Ready to meet your personal AI?