To move beyond a "robotic" Wiseguy delivery, research suggests:

State-of-the-art models like Tacotron 2, FastSpeech, and VALL-E excel at naturalness but fail on the Wiseguy for three reasons: