Mixture Model Attention for Flexible Streaming and Non-Streaming Automatic Speech Recognition

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250022458A1
SERIAL NO

18896830

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method for an automated speech recognition (ASR) model for unifying streaming and non-streaming speech recognition including receiving a sequence of acoustic frames. The method includes generating, using an audio encoder of an automatic speech recognition (ASR) model, a higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The method further includes generating, using a joint encoder of the ASR model, a probability distribution over possible speech recognition hypothesis at the corresponding time step based on the higher order feature representation generated by the audio encoder at the corresponding time step. The audio encoder comprises a neural network that applies mixture model (MiMo) attention to compute an attention probability distribution function (PDF) using a set of mixture components of softmaxes over a context window.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GOOGLE LLC1600 AMPHITHEATRE PARKWAY MOUNTAIN VIEW CA 94043

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Audhkhasi, Kartik Mount Kisco, US 25 364
Chen, Tongzhou Mountain View, US 7 11
Mengibar, Pedro J Moreno Jersey City, US 65 1165
Ramabhadran, Bhuvana Mt. Kisco, US 125 2541

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation