SYNTHESIS OF SPEECH FROM TEXT IN A VOICE OF A TARGET SPEAKER USING NEURAL NETWORKS

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250095630A1
SERIAL NO

18966088

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech synthesis. The methods, systems, and apparatus include actions of obtaining an audio representation of speech of a target speaker, obtaining input text for which speech is to be synthesized in a voice of the target speaker, generating a speaker vector by providing the audio representation to a speaker encoder engine that is trained to distinguish speakers from one another, generating an audio representation of the input text spoken in the voice of the target speaker by providing the input text and the speaker vector to a spectrogram generation engine that is trained using voices of reference speakers to generate audio representations, and providing the audio representation of the input text spoken in the voice of the target speaker for output.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GOOGLE LLC1600 AMPHITHEATRE PARKWAY MOUNTAIN VIEW CA 94043

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Chen, Zhifeng Sunnyvale, US 109 1008
Jia, Ye Mountain View, US 21 40
Lopez, Moreno Ignacio Brooklyn, US 26 346
Nguyen, Patrick An Phu Palo Alto, US 21 783
Pang, Ruoming New York, US 60 341
Ren, Fei Mountain View, US 33 39
Shen, Jonathan Mountain View, US 14 88
Wang, Quan Hoboken, US 221 1849
Weiss, Ron J New York, US 40 927
Wu, Yonghui Fremont, US 87 554
Zhang, Yu Mountain View, US 1971 9206

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation