random drop description and text prompt #185

Balladie · 2025-01-05T11:13:43Z

Add options to drop description and text prompt with specified probability in data collator, controlled by the following arguments:

p_drop_description: probability of dropping description (which can be an option for better disentanglement between speaker and description)
range_cond_drop_description: ratio range of the index up to which the audio codes will not be trained (gives option to prevent initial parts to be trained without description)
p_drop_prompt: probability of dropping text prompt (to randomly learn pure unconditioned audio codes)

Not sure if they would work well in all scenarios, but I've noticed some improvement on zero-shot capability with empty description, so I wanted to just open the options to interestingly see how it works for more cases (e.g. applied during pretraining).

Appreciate for the great work! Please let me know if there's any missing or better option (or already has a progress related to this...)

Balladie added 2 commits January 5, 2025 09:13

add dropping probability of description and text prompt

3ae75a9

fix argument help

13306f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

random drop description and text prompt #185

random drop description and text prompt #185

Uh oh!

Balladie commented Jan 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

random drop description and text prompt #185

Are you sure you want to change the base?

random drop description and text prompt #185

Uh oh!

Conversation

Balladie commented Jan 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant