Enhancing performance of reinforcement learning models in the presence of noisy rewards

Date

2019-04-08

Authors

Kumar, Aashish

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Reinforcement learning models learn the optimal policy by interacting with the environment and observing the states and rewards. If the rewards that the model observes are noisy then learning an optimal policy becomes a difficult task. We present an approach that can enhance the performance of reinforcement learning models in the presence of noisy rewards. Along with the standard reinforcement learning model, we propose to use a noise filter which estimates the true reward that the model should receive. The noise filter is designed using a non-linear approximator. Through various experiments we demonstrate that this approach improves the performance of the model in the presence of noisy rewards

Description

LCSH Subject Headings

Citation