PRONUNCIATION-AWARE EMBEDDING GENERATION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250078827A1
SERIAL NO

18423224

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

One or more embodiments include: receiving a first frame of acoustic input and one or more prior textual tokens associated with a prior frame of the acoustic input, wherein the prior textual token represents one or more spoken word included in the acoustic input; generating a multi-dimensional embedding associated with the prior textual token, wherein each dimension of the embedding represents a different characteristic of the prior textual token, and at least one dimension of the embedding represents pronunciation information associated with the prior textual token; and generating a textual token associated with the first frame based at least on an encoded representation of the first frame and the multi-dimensional embedding associated with the prior textual token.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NVIDIA CORPORATION2788 SAN TOMAS EXPRESSWAY SANTA CLARA CA 95051

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
CHEN, Zhehuai Odessa, US 21 26
GINSBURG, Boris Sunnyvale, US 18 106
JIA, Fei Santa Clara, US 12 26
XU, Hainan New York, US 3 3

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation