TEXTUAL KNOWLEDGE TRANSFER FOR IMPROVED SPEECH RECOGNITION AND UNDERSTANDING

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20240371361A1
SERIAL NO

18310598

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Systems, computer-implemented methods, and computer program products to facilitate fine-grained textual knowledge transfer to improve speech recognition and understanding are provided. According to an embodiment, a system can comprise a processor that executes components stored in memory. The computer executable components comprise deriving component that can derive one or more speech-based embeddings from an utterance via a speech encoder. The computer executable components can comprise a cross-attention component that can align, at a token level, one or more large language model (LLM) based sentence embeddings with the one or more speech-based embeddings. The computer executable components can comprise a loss component that can combine an alignment loss and an automatic speech recognition (ASR) loss.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
INTERNATIONAL BUSINESS MACHINES CORPORATIONNEW ORCHARD ROAD ARMONK NY 10504

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Fosler-Lussier, Eric Columbus, US 2 0
Kingsbury, Brian E D Cortlandt Manor, US 34 572
Kuo, Hong-Kwang Pleasantville, US 18 409
Saon, George Andrei Stamford, US 19 112
Sunder, Vishal Columbus, US 3 5
Thomas, Samuel White Plains, US 50 241

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation