POLICY TRAINING DEVICE, POLICY TRAINING METHOD, AND COMMUNICATION SYSTEM

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20250103957A1
SERIAL NO

18814671

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A policy training device that trains, through first reinforcement learning, a first agent configured to output a first action of a control object according to an input of a first state of the control object, includes a memory, and processor circuitry coupled to the memory and configured to change a first parameter regarding a constraint condition in the first reinforcement learning for every predetermined number of times of a training operation in the first reinforcement learning, and train the first agent by using the first parameter as at least a part of the first state and by ensuring that the constraint condition is satisfied.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
FUJITSU LIMITED1-1 KAMIKODANAKA 4-CHOME NAKAHARA-KU KAWASAKI-SHI KANAGAWA 211-8588

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
ABE, Fumika Sasaguri, JP 2 0
ISHIKAWA, Natsuki Yamato, JP 9 0
OGAWA, Masatoshi Zama, JP 65 471
OKAWA, Yoshihiro Yokohama, JP 44 105
TERANISHI, Yuta Fukuoka, JP 21 48

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation