TRAINING AN UNSUPERVISED MEMORY-BASED PREDICTION SYSTEM TO LEARN COMPRESSED REPRESENTATIONS OF AN ENVIRONMENT

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250051289A1
SERIAL NO

18929321

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a memory-based prediction system configured to receive an input observation characterizing a state of an environment interacted with by an agent and to process the input observation and data read from a memory to update data stored in the memory and to generate a latent representation of the state of the environment. The method comprises: for each of a plurality of time steps: processing an observation for the time step and data read from the memory to: (i) update the data stored in the memory, and (ii) generate a latent representation of the current state of the environment as of the time step; and generating a predicted return that will be received by the agent as a result of interactions with the environment after the observation for the time step is received.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
DEEPMIND TECHNOLOGIES LIMITED5 NEW STREET SQUARE LONDON EC4A 3TW

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Ahuja, Arun London, GB 24 663
Amos, David Antony Hassocks West Sussex, GB 2 2
Hung, Chia-Chun London, GB 47 145
Lillicrap, Timothy Paul London, GB 28 163
Mirza, Mohammadi Mehdi London, GB 2 2
Wayne, Gregory Duncan London, GB 19 105

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation