TEXT-TO-AUDIO CONVERSION WITH BYTE-ENCODING VECTORS

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20250104692A1
SERIAL NO

18407083

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A diffusion model may be used to generate an audio signal from text. The diffusion model may process received text and noise vectors to compute encoded audio vectors that correspond to the text. The encoded audio vectors may be decoded to generate an audio signal of a person speaking the text that may be presented to a user. The diffusion model may process a sequence of byte-encoding vectors corresponding to the text, and the use of the byte-encoding vectors may allow for the generation of higher quality audio signals. In some implementations, prompt audio of a person may also be used to generate an audio signal that resembles the speech of that person.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
ASAPP INCONE WORLD TRADE CENTER 80TH FLOOR NEW YORK NY 10007

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Kim, Kwangyoun San Jose, US 29 126
Lovelace, Justin Robert Ithaca, US 1 0
Ray, Soham Ithaca, US 2 0
Weinberger, Kilian Quirin Ithaca, US 17 398
Wu, Felix Issaquah, US 15 276

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation