Question 1

What is Whissle?

Accepted Answer

Whissle is a personal AI assistant that combines real-time voice intelligence, deep research, live call coaching, smart notes, and daily briefings. It works across web (lulu.whissle.ai), macOS desktop app, and as an API for developers.

Question 2

What metadata does the Whissle Speech-to-Text API extract?

Accepted Answer

Beyond transcription, Whissle STT extracts rich metadata in a single pass — intent detection, emotion recognition, named entity recognition (NER), speaker diarization, age and gender estimation, and punctuation. The same metadata is also available for text input via our Text Intelligence API.

Question 3

How does Whissle compare to other speech-to-text APIs?

Accepted Answer

Whissle's META-1 model performs transcription and metadata extraction simultaneously in a single pass, unlike traditional pipelines that require separate models for each task. This results in lower latency, lower cost, and richer output — all from one API call.

Question 4

Is Whissle free to use?

Accepted Answer

Yes — the personal AI assistant at lulu.whissle.ai is free to use. The Speech-to-Text and Intelligence APIs have usage-based pricing starting at $0.003 per minute. Self-hosting via Docker is also available at no cost.

Question 5

Can I self-host Whissle?

Accepted Answer

Absolutely. Whissle provides a full Docker Compose setup that runs the frontend, gateway (ASR + agent + proxy), and backend locally. It replaces cloud dependencies with SQLite and local storage, requiring only 16 GB RAM and a Gemini API key.

Question 6

What is Live Assist / call coaching?

Accepted Answer

Live Assist provides real-time AI coaching during phone calls or meetings. It listens to the conversation, detects intent and emotion, and surfaces contextual suggestions, key points, and action items — all in real time with low latency.

Question 7

What languages does Whissle support?

Accepted Answer

Whissle currently supports English with high-accuracy models (300M and 600M parameter variants). Multi-language support for Spanish, German, French, and other major languages is available through the 1.1B parameter model. Translation covers any-to-any language pairs.

Question 8

How do I integrate Whissle into my application?

Accepted Answer

You can integrate via the JavaScript SDK (npm install @whissle/sdk), the REST API (api.whissle.ai), or self-host using Docker. The SDK provides streaming ASR, voice intelligence, and behavioral profiling out of the box for web and Node.js applications.

Frequently Asked Questions