TENSOR PROCESSING USING LOW PRECISION FORMAT

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20170372202A1
SERIAL NO

15624577

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Aspects of the present invention are directed to computer-implemented techniques for improving the training of artificial neural networks using a reduced precision (e.g., float16) data format. Embodiments of the present invention rescale tensor values prior to performing matrix operations (such as matrix multiplication or matrix addition) to prevent overflow and underflow. To preserve accuracy throughout the performance of the matrix operations, the scale factors are defined using a novel data format to represent tensors, wherein a matrix is represented by the tuple X, where X=(a, v[.]), wherein a is a float scale factor and v[.] are scaled values stored in the float16 format. The value of any element X[i] according to this data format would be equal to a*v[i].

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NVIDIA CORPORATION2788 SAN TOMAS EXPRESSWAY SANTA CLARA CA 95051

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
FIT-FLOREA, Alex Belmont, US 7 81
GHOLAMINEJAD, Amir Santa Clara, US 5 89
GINSBURG, Boris Santa Clara, US 18 106
HOUSTON, Michael Saratoga, US 20 200
KIERAT, Slawomir Mountain View, US 4 78
KISWANI, Ahmad Santa Clara, US 1 61
NIKOLAEV, Sergei Santa Clara, US 7 135
WU, Hao Hanzhou, CN 930 3100

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation