Voices
A voice is the speaker identity used to synthesize audio. Every text-to-speech request targets a specific voice via its voice_id. Voices come from three sources:
- Public catalog — curated voices that ship with Breeze. List them with
GET /v1/voices. - Designed voices — created from a text prompt with
POST /v1/voice-previews/designand finalized withPOST /v1/voice-previews/{generated_voice_id}/save. - Cloned voices — created from audio samples with
POST /v1/voice-previews/clone, previewed viaGET /v1/voice-previews/{generated_voice_id}/stream, and finalized withPOST /v1/voice-previews/{generated_voice_id}/save.
voice_type="default" is the official Breeze catalog. voice_type="personal" is user-saved voices. Categories: premade, generated, cloned.
Voice settings
Each voice stores default voice_settings. Override per call by passing voice_settings in the request body.
guidance_scaleadjusts how strongly generation follows the prompt and reference voice.
Update a voice's persisted defaults with PATCH /v1/voices/{voice_id}/settings.
Browsing voices
Use the voice library to browse the public catalog and preview samples.