AUGMENTING LEGACY NEURAL NETWORKS FOR FLEXIBLE INFERENCE

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20230325670A1
SERIAL NO

17820780

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A technique for dynamically configuring and executing an augmented neural network in real-time according to performance constraints also maintains the legacy neural network execution path. A neural network model that has been trained for a task is augmented with low-compute “shallow” phases paired with each legacy phase and the legacy phases of the neural network model are held constant (e.g., unchanged) while the shallow phases are trained. During inference, one or more of the shallow phases can be selectively executed in place of the corresponding legacy phase. Compared with the legacy phases, the shallow phases are typically less accurate, but have reduced latency and consume less power. Therefore, processing using one or more of the shallow phases in place of one or more of the legacy phases enables the augmented neural network to dynamically adapt to changes in the execution environment (e.g., processing load or performance requirement).

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NVIDIA CORPORATION2701 SAN TOMAS EXPRESSWAY SANTA CLARA CA 95050

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Alvarez, Lopez Jose Manuel Mountain View, US 23 30
Clemons, Jason Lavar Leander, US 5 30
Frosio, Iuri Bergamo, IT 26 325
Keckler, Stephen W Austin, US 37 1250
Shen, Maying Fremont, US 6 6

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation