Quantcast
Channel: Cloud Training Program
Viewing all articles
Browse latest Browse all 1891

An Introduction To Reinforcement Learning

$
0
0

Reinforcement Learning is an influential branch of Machine Learning. It is used to solve interacting problems where the info observed up to time t is taken into account to decide which action to require at time t + 1. In this blog, we are going to cover everything about Reinforcement Learning.

In this blog, I’m going to cover:

  1. What Is Reinforcement Learning (RL)?
  2. What is Policy Search?
  3. What is Learning?
  4. How Is RL Similar to Traditional Controls?
  5. RL Workflow Overview.
  6. What is Environment?
  7. Model-Free RL Vs Model-Based RL.

What Is Reinforcement Learning (RL)?

  • In reinforcement learning, an agent (agent is our artificial intelligence) takes actions within a true or virtual environment, relying on feedback from rewards to find out the foremost suitable way to achieve its goal.
  • A machine or a robot using reinforcement learning to solve an identical problem in different ways.
  • It is also used for AI when training machines to perform tasks such as walking.
  • Required outcomes provide the AI with reward, undesired with punishment. Machines learn through trial and error.
  • The agent’s going to keep doing that was going to be taking actions changing the state, getting rewards changing action taking actions changing the state, and getting rewards.
  • And by doing that process it’s going to be learning about what was going to be exploring the environment understanding what actions lead to good rewards and favorable states and what actions the two rewards an unfavorable state.
  • The simplest way to think of reinforcement learning is like training a dog when you train the dog you to give it certain commands and if it obeys those commands then you give it a reach you give it like a biscuit or something if it doesn’t obey those commands you tell it that it’s a bad dog or you just don’t give it a treat.
  • Reinforcement Learning using a neural network policy.

reinforcement-learning-method

What is Policy Search?

  • The algorithm a software agent uses to search out its actions is called its policy.
  • The policy can be a neural network taking observations as inputs and outputting the action to require.
  • The ultimate aim of the agent is to use reinforcement learning algorithms to find out the foremost appropriate policy as it interacts with the environment.
  • So that, specified in any state, the agent will always take the most optimal action.

Reinforcement_Diagram-06

What is Learning?

  • In reinforcement learning, Learning is that the term given to the method of regularly adjusting those parameters to converge on the optimal policy.
  • We can specialize in putting in place an appropriate policy structure without manually tuning the function to induce the proper parameters.
  • The computer learns the parameters on its own through a process of fancy trial and error.

actions

How Is RL Almost Like Traditional Controls?

  • The target of reinforcement learning is comparable to the control problem; it’s just a special way and uses different terms to represent similar concepts.
  • With both approaches, we would like to see the proper inputs into a system that may generate the required system behavior.
  • We are trying to figure out how to construct the policy (or the controller) that maps the observed state of the environment to the best actions.
  • The feedback signal is the information from the environment, and the reference signal is built into both the environment observations and the reward function.

Reinforcement_Diagram-05

RL Workflow Overview

1) In reinforcement learning, we need an environment where our agents can learn. We need to choose what should exist within the environment and whether it’s a simulation or a physical setup.

rl-environment

2) We need to think about what we ultimately want our agent to do and craft a reward function that will incentivize the agent to do just that.

rl-reward

3) In reinforcement learning, we need to choose a way to represent the policy. Consider how we want to structure the parameters and logic that make up the decision-making part of the agent.

rl-policy

4) We need to choose an algorithm to train the agent that works to find the optimal policy parameters.

rl-training

5) Finally, we need to exploit the policy by deploying it in the field and verifying the results.

rl-deploy

What is Environment?

  • In reinforcement learning, the environment is everything that exists outside of the agent.
  • It is where the agent sends actions, and it is what generates rewards and observations.
  • The environment is everything but the agent. This includes the system dynamics. In this way, most of the system is actually part of the environment. The agent is just a bit of software that is generating the actions and updating the policy through learning.

environment

Model-Free RL Vs Model-Based RL

  • Model-based RL can lower the time it takes to learn an optimal policy because we can use the model to guide the agent away from areas of the state space that you know have low rewards.
  • Model-free reinforcement learning is the more general case. For example, the agent doesn’t need to know the dynamics or kinematics of the walking robot. It will still find out how to gather the most reward without knowing how the joints move or the lengths of the appendages.
  • With model-based RL, we don’t need to know the full environment model; we can provide the agent with just the parts of the environment we know.
  • Model-free RL is popular right now because people hope to use it to solve problems were developing a model even straightforward – is difficult.

Reinforcement_Diagram-07

Next Task For You

Begin your journey towards Introduction To Data Science and Machine Learning by joining our FREE Informative Class on Introduction To Data Science and Machine Learning by clicking on the below image.

AIML_ContentUpgrade-06

The post An Introduction To Reinforcement Learning appeared first on Cloud Training Program.


Viewing all articles
Browse latest Browse all 1891

Trending Articles