Global condition and Local conditioning

In the white paper, they mention conditioning to a particular speaker as an input they condition globally, and the TTS component as an up-sampled (deconvolution) conditioned locally. For the latter, they also mention that they tried just repeating the values, but found it worked less well than doing the deconvolutions.

Is there effort underway to implement either of these? Practically speaking, implementing the local conditioning would allow us to begin to have this implementation speak recognizable words.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Global condition and Local conditioning #112

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Global condition and Local conditioning #112

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions