Reinforcement Learning Types

There are three main types of machine reinforcement learning: Value-based reinforcement learning Value-based reinforcement learning algorithms learn a value function, which maps from states to expected rewards. The value function is used to estimate the expected reward of taking a particular action in a particular state. Some of the most popular value-based reinforcement learning algorithms include Q-learning, SARSA, and Deep Q-learning. Policy-based reinforcement learning Policy-based reinforcement learning algorithms learn a policy, which maps from states to actions. The policy specifies the probability of taking each action in each state. Some of the most popular policy-based reinforcement learning algorithms include policy gradients, actor-critic methods, and trust region policy optimization. Model-based reinforcement learning Model-based reinforcement learning algorithms learn a model of the environment. The model is used to predict the state of the environment after taking an ac...