SYSTEM AND METHOD FOR MULTI-OBJECTIVE REINFORCEMENT LEARNING WITH GRADIENT MODULATION

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20230351201A1
SERIAL NO

18139330

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Systems are methods are provided for processing multiple input objectives by a reinforcement learning agent. The method may include: instantiating a reinforcement learning agent that maintains a reinforcement learning neural network and generates, according to outputs of the reinforcement learning neural network, signals for communicating task requests; receiving a plurality of input data representing a plurality of user objectives associated with a task request and a plurality of weights; generating a plurality of preferences based on the plurality of user objectives and the plurality of weights; computing a plurality of loss values; computing a plurality of first gradients based on the plurality of loss values; for a plurality of pairs of references, computing a plurality of similarity metrics; computing an updated gradient based on the first gradients and the plurality of similarity metrics; and updating the reinforcement learning neural network based on the updated gradient.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
ROYAL BANK OF CANADA6TH FLOOR SOUTH TOWER ROYAL BANK PLAZA 200 BAY STREET TORONTO M5J 2J5

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
AZAM, Muhammad Mustajab Mississauga, CA 1 0
CHMURA, Jacob Mississauga, CA 2 0
HUANG, Hongfeng Toronto, CA 6 444
YU, Zhuo Toronto, CA 6 17

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation