EXPLORATION METHOD BASED ON REWARD DECOMPOSITION IN MULTI-AGENT REINFORCEMENT LEARNING

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20240256885A1
SERIAL NO

18517931

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Provided is an exploration method based on reward decomposition in multi-agent reinforcement learning. The exploration method includes: generating a positive reward estimation model through neural network training based on training data including states of all agents, actions of all the agents, and a global reward true value; generating, for each of the agents, a first individual utility function based on the global reward true value and generating a second individual utility function using the positive reward estimation model; and determining an action of each of the agents using the first individual utility function and the second individual utility function based on the state of each of the agents.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTEDAEJEON

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
CHUNG, Euisok Daejeon, KR 12 50
HAN, Ran Daejeon, KR 18 28
KIM, Hyun Woo Daejeon, KR 139 807
SONG, Hwajeon Daejeon, KR 8 0
YANG, Jeongmin Daejeon, KR 14 56
YI, Sungwon Daejeon, KR 15 490
YOO, Byunghyun Daejeon, KR 5 0

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation