Skip to content

Expose Chatterbox TTS model arguments on the Chatterbox backend #5979

@rampa3

Description

@rampa3

Is your feature request related to a problem? Please describe.

Chatterbox TTS as a voice cloning TTS posses parameters which need to be provided together with the audio prompt for tuning the output voice produced. These parameters are: exageration, temperature and cfg_weight. Without access to them, tuning the output is impossible.

Describe the solution you'd like

I would like to propose exposing these parameters on the API or at least in model file to allow adjusting them on voice by voice basis.

Describe alternatives you've considered

Finding the best preset for most of user's voices using external Chatterbox install, and building the backend with this preset from source. (Applicable only for advanced users.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions