LOSS-ERROR-AWARE QUANTIZATION OF A LOW-BIT NEURAL NETWORK

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250117639A1
SERIAL NO

18886625

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Methods, apparatus, systems and articles of manufacture for loss-error-aware quantization of a low-bit neural network are disclosed. An example apparatus includes a network weight partitioner to partition unquantized network weights of a first network model into a first group to be quantized and a second group to be retrained. The example apparatus includes a loss calculator to process network weights to calculate a first loss. The example apparatus includes a weight quantizer to quantize the first group of network weights to generate low-bit second network weights. In the example apparatus, the loss calculator is to determine a difference between the first loss and a second loss. The example apparatus includes a weight updater to update the second group of network weights based on the difference. The example apparatus includes a network model deployer to deploy a low-bit network model including the low-bit second network weights.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
INTEL CORPORATION2200 MISSION COLLEGE BLVD SANTA CLARA CALIFORNIA 95054 UNITED STATES OF AMERICA

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Chen, Yurong Beijing, CN 121 1280
Wang, Kuan Beijing, CN 10 29
Yao, Anbang Beijing, CN 156 2323
Zhao, Hao Beijing, CN 53 166
Zhou, Aojun Beijing, CN 6 34

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation