Abstract: Many recent Text-to-Speech (TTS) models employing zero-shot voice cloning techniques are capable of reproducing the emotional tone present in the reference speech. However, they frequently ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results