Project 3: Reinforcement Learning

This project implements model-based and model-free reinforcement learning algorithms.

Value Iteration Agent: It utilizes an MDP and runs value iteration for set iterations before the constructor returns. It implements both asynchronous & prioritized sweeping.
Q-Learning: A RL agent that learns by trial and error from interactions with the environment through its update(state, action, nextState, reward) method. Approximate Q-learning is also implemented

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
layouts		layouts
test_cases		test_cases
README.md		README.md
VERSION		VERSION
analysis.py		analysis.py
autograder.py		autograder.py
crawler.py		crawler.py
environment.py		environment.py
featureExtractors.py		featureExtractors.py
game.py		game.py
ghostAgents.py		ghostAgents.py
grading.py		grading.py
graphicsCrawlerDisplay.py		graphicsCrawlerDisplay.py
graphicsDisplay.py		graphicsDisplay.py
graphicsGridworldDisplay.py		graphicsGridworldDisplay.py
graphicsUtils.py		graphicsUtils.py
graphicsUtils_old.py		graphicsUtils_old.py
gridworld.py		gridworld.py
keyboardAgents.py		keyboardAgents.py
layout.py		layout.py
learningAgents.py		learningAgents.py
mdp.py		mdp.py
pacman.py		pacman.py
pacmanAgents.py		pacmanAgents.py
projectParams.py		projectParams.py
qlearningAgents.py		qlearningAgents.py
reinforcementTestClasses.py		reinforcementTestClasses.py
testClasses.py		testClasses.py
testParser.py		testParser.py
textDisplay.py		textDisplay.py
textGridworldDisplay.py		textGridworldDisplay.py
util.py		util.py
valueIterationAgents.py		valueIterationAgents.py

Provide feedback