Punit Pandey

  1. Reinforcement Learning by Comparing Immediate Reward.

    Authors: Shishir Kumar, Punit Pandey, Deepshikha Pandey
    Subjects: Learning
    Abstract

    This paper introduces an approach to Reinforcement Learning Algorithm by
    comparing their immediate rewards using a variation of Q-Learning algorithm.
    Unlike the conventional Q-Learning, the proposed algorithm compares current
    reward with immediate reward of past move and work accordingly. Relative reward
    based Q-learning is an approach towards interactive learning. Q-Learning is a
    model free reinforcement learning method that used to learn the agents.

RSS-материал