Why there is no positional embeds in the flow decoder transformer layers inputs? #381

JohnHerry · 2024-09-11T07:18:33Z

the flow model in cosyvoice, its encoder Conformers contains position embeds while in its decoder transformers, I see no such addition. is that means no benifit here in flow-matching? sorry for it if I did not find the code.

aluminumbox · 2024-09-12T15:43:38Z

well decoder is from matha-tts, we haven't tested whether position embedding can help in decoder, maybe it can

JohnHerry · 2024-09-13T01:00:32Z

Yes there is no position in macha, may be it assume the intput hidden had contain such info?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why there is no positional embeds in the flow decoder transformer layers inputs? #381

Why there is no positional embeds in the flow decoder transformer layers inputs? #381

JohnHerry commented Sep 11, 2024

aluminumbox commented Sep 12, 2024

JohnHerry commented Sep 13, 2024

Why there is no positional embeds in the flow decoder transformer layers inputs? #381

Why there is no positional embeds in the flow decoder transformer layers inputs? #381

Comments

JohnHerry commented Sep 11, 2024

aluminumbox commented Sep 12, 2024

JohnHerry commented Sep 13, 2024