Voices

A voice is the speaker identity used to synthesize audio. Every text-to-speech request targets a specific voice via its voice_id. Voices come from three sources:

voice_type="default" is the official Breeze catalog. voice_type="personal" is user-saved voices. Categories: premade, generated, cloned.

Voice settings

Each voice stores default voice_settings. Override per call by passing voice_settings in the request body.

  • guidance_scale adjusts how strongly generation follows the prompt and reference voice.

Update a voice's persisted defaults with PATCH /v1/voices/{voice_id}/settings.

Browsing voices

Use the voice library to browse the public catalog and preview samples.