Scaling Multilingual Speech Synthesis with Zero Supervision of Found Data

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250078805A1
SERIAL NO

18823661

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method includes receiving training data that includes a plurality of sets of training utterances each associated with a respective language. Each training utterance includes a corresponding reference speech representation paired with a corresponding input text sequence. For each training utterance, the method includes generating a corresponding encoded textual representation for the corresponding input text sequence, generating a corresponding speech encoding for the corresponding reference speech representation, generating a shared encoder output, and determining a text-to-speech (TTS) loss based on the corresponding encoded textual representation, the corresponding speech encoding, and the shared encoder output. The method also includes training a TTS model based on the TTS losses determined for the training utterances in each set of the training utterances to teach the TTS model to learn how to synthesize speech in each of the respective languages.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GOOGLE LLC1600 AMPHITHEATRE PARKWAY MOUNTAIN VIEW CA 94043

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Beaufays, Francoise Mountain View, US 59 3225
Ramabhadran, Bhuvana Mt. Kisoo, US 125 2541
Rosenberg, Andrew M Brooklyn, US 20 52
Saeki, Takaaki Mountain View, US 6 51

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation