TEXT TO SPEECH SYNTHESIS WITHOUT USING PARALLEL TEXT-AUDIO DATA

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20240119922A1
SERIAL NO

17953851

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

An unsupervised text to speech system utilizing a lexicon to map input text to the phoneme sequence, which is expanded to the frame-level forced alignment with a speaker-dependent duration model. An alignment mapping module that converts the forced alignment to the unsupervised alignment (UA). Afterword, a Conditional Disentangled Sequential Variational Auto-encoder (C-DSVAE), serving as the self-supervised TTS AM, takes the predicted UA and a target speaker embedding to generate the mel spectrogram, which is ultimately converted to waveform with a neural vocoder.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
TENCENT AMERICA LLC2747 PARK BOULEVARD PALO ALTO CA 94306

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
LIAN, Jiachen Palo Alto, US 3 1
YU, Dong Palo Alto, US 354 6818
ZHANG, Chunlei Bellevue, US 86 685

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation