Reinforcement Learning: Fundamentals

6 min readOct 6, 2024

An agent interacting with its environment.

Table of Contents:
1. Overview
2. Multi-armed Bandits
3. Markov Decision Process
4. Returns and episodes
5. Value Functions
6. Bellman Equation
7. Policy Iteration

Overview

Reinforcement learning is learning what to do — how to map situations to actions — so as to maximize a numerical reward signal. The main sub-elements of a reinforcement learning system: are policy, reward signal, value function, agent, and environment. A policy is a mapping from perceived states of the environment to actions to be taken when in those states. A reward signal defines the goal of a reinforcement learning problem. The value of a state is the total amount of reward an agent can expect to accumulate over the future, starting from that state. We seek actions that bring about states of highest value, not highest reward. A state might always yield a low immediate reward but still have a high value because it is regularly followed by other states that yield high rewards. The learner and decision maker is called the agent and every thing else is environment.The agent exists in an environment described by some set of possible states S (shown in the above figure). It can perform any of a set of…

Reinforcement Learning: Fundamentals

Overview

Create an account to read the full story.

Written by Rahul Kumar

More from Rahul Kumar

Debugging Spark Job

Table of Contents: 1. Spark UI Basics 2. Slow Tasks or Stragglers 3. Slow Aggregations 4. Slow Joins 5. Slow Reads and Writes 6. Out Of…

Pandas to PySpark

Table of Contents : 1. Create/Read/Write files as Dataframes 2. Inspect Data 3. Filter/Queries 4. Drop or Fill Nulls and Duplicate Values…

Recommender Systems

Table of Contents 1. Content-based filtering 2. Collaborative filtering 3. Hybrid Models 4. Recommendation Evaluation metrics 5. Cold Start…

Hypothesis Evaluation

Table of Contents: 1. Introduction 2. Parametric Tests 3. Non-Parametric Tests 4. Stationarity Tests 5. Correlation Tests 6. Normality…

Recommended from Medium

Reinforcement Learning and Inverse Reinforcement Learning Notes

Reinforcement learning is about learning to act in an environment to achieve the best long-term outcomes through trial, feedback, and…

Stuck Without Data? — Five Python Libraries Can Help!

One of the most annoying issues in a data science project or academic research is having no data to work on. In many situations, we may…

Lists

Natural Language Processing

Practical Guides to Machine Learning

data science and AI

Staff Picks

7 GitHub Repos to Transform You into a Pro ML/AI Engineer

Hands-On Guides, Tools, and Frameworks to Fast-Track Your AI Journey

Reinforcement Learning

Reinforcement Learning (RL) is a subset of machine learning that enables an agent to learn in an interactive environment by trial and error…

Top Data Science Career Questions, Answered

I’ve been a data scientist for over 3 years. This is what most people want to know about the field.

Building a Maze Solver with Reinforcement Learning in Python

Step-by-Step guide with code snippets and explanations