The Cart-Pole, also known as an inverted pendulum, is a system with a center of gravity above its pivot point. It's unstable by nature, but can be controlled by moving the pivot point under the center of mass. The goal is to keep the Cart-Pole balanced by applying appropriate forces to the pivot point.
You can try it out yourself here before diving into the reinforcement learning (RL) algorithm used in this project, Q-Learning.
This project uses OpenAI Gym's version of the Cart-Pole environment, which you can find here if you want to learn more about how the environment works.
To run the project, simply execute the Q-Learning_Cartpole.py
file
The reward during training is plotted against the number of iterations:
The reward during validation is plotted against the number of iterations: