How Does Text to Speech Work?
The Web Speech API runs synthesis directly in your browser — your text never leaves your device, and no audio is processed on a remote server. Clicking Play triggers the browser's SpeechSynthesis interface, which reads text using voices installed on your operating system. The SpeechSynthesisUtterance object exposes three controls independently: rate (0.1–10×), pitch (0–2), and volume. In our testing, Chrome on Windows 11 lists 18 English voice options with regional accents — US, UK, and Australian among them. Chrome on macOS surfaces Apple's high-quality neural voices instead. Voice availability varies by browser version and OS; the selector dynamically shows only what your device actually has installed. No server calls means no rate limits, no quotas, and no account required.