Gradio

Max New Tokens (Audio Length)

Controls the maximum length of the generated audio (more tokens = longer audio).

860 3072

CFG Scale (Guidance Strength)

Higher values increase adherence to the text prompt.

1 5

Temperature (Randomness)

Lower values make the output more deterministic, higher values increase randomness.

1 1.5

Top P (Nucleus Sampling)

Filters vocabulary to the most likely tokens cumulatively reaching probability P.

0.8 1

CFG Filter Top K

Top k filter for CFG guidance.

15 50

Speed Factor

Adjusts the speed of the generated audio (1.0 = original speed).

0.8 1

Nari Text-to-Speech Synthesis