Analysis of Model-Free Reinforcement Learning Algorithm for Target Tracking

Fikry, Muhammad; Adek, Rizal Tjut; Hartanto, Subhan; Taufiqurrahman; Rinawati, Dyah Ika

Date

2022-04-01

Author

Fikry, Muhammad

Adek, Rizal Tjut

Hartanto, Subhan

Taufiqurrahman

Rinawati, Dyah Ika

Metadata

Show full item record

Abstract

Target tracking is a process that can find points in different domains. In tracking, some places contain prizes (positive or negative values) that the agent does not know at first. Therefore, the agent, which is a system, must learn to get the maximum value with various learning rates. Reinforcement learning is a machine learning technique in which agents learn through interaction with the environment using reward functions and probabilistic dynamics to allow agents to explore and learn about the environment through various iterations. Thus, for each action taken, the agent receives a reward from the environment, which determines positive or negative behavior. The agent's goal is to maximize the total reward received during the interaction. In this case, the agent will study three different modules, namely sidewalk, obstacle, and product, using the Q-learning algorithm. Each module will be training with various learning rates and rewards. Q-learning can work effectively with the highest final reward at a learning rate of 0.8 for 500 rounds with an epsilon of 0.9.

URI

https://ejournal.upi.edu/index.php/COELITE/article/view/43795

Collections

LP - Artikel Publikasi Nasional