DYNAMIC LOW-RANK ESTIMATION FOR TRANSFORMER-BASED LANGUAGE MODELS

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20250029005A1
SERIAL NO

18669413

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method includes accessing a plurality of weight matrices of a machine learning model. The method also includes, for each weight matrix, decomposing the weight matrix into a U matrix, an S matrix, and a V matrix using singular value decomposition. The S matrix is a diagonal matrix, and a singular group corresponds to each element in the S matrix. The method further includes, for each weight matrix, determining an importance score of each singular group. The importance score of the singular group represents a change in loss if the singular group is removed from the machine learning model. The method also includes, for each weight matrix, ranking the singular groups across the plurality of weight matrices based on the importance scores. In addition, the method includes, for each weight matrix, identifying one or more of the singular groups to prune based on the ranking of the singular groups.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
SAMSUNG ELECTRONICS CO LTDSUWON-SI

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Gao, Shangqian Mountain View, US 14 32
Hsu, Yen-Chang Fremont, US 11 1
Hua, Ting Cupertino, US 18 7
Jin, Hongxia San Jose, US 149 1846
Li, Xiao Ann Arbor, US 322 4192
Shen, Yilin Mountain View, US 62 197

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation