Crossy Road RL Agent

Project Overview

This project aims to develop an AI agent that plays the mobile game Crossy Road using Reinforcement Learning (RL) and Computer Vision (CV) techniques. The agent's main objective is to navigate the game environment, avoiding obstacles and successfully crossing the road.

Repository

The source code and related artifacts can be found at: GitHub Repository

Artifacts

Repository: Contains the main codebase for the project.
Dataset: Custom dataset of annotated game screenshots.
Script for Screenshots Collection: Python script for capturing game screenshots at regular intervals.
CV_Models: Models developed for object detection.
RL_Models: Models developed for reinforcement learning.

Problem Statement and Objective

In Crossy Road, the player controls a hen that must cross a busy road while avoiding obstacles. Our goal was to create a reinforcement learning agent capable of mastering this game. We experimented with various techniques, starting with raw screenshots and evolving to a grid-based representation of the game environment to enhance the agent's understanding.

Methodology

The project consists of two main components:

1. Reinforcement Learning (RL) Component

Environment: A continuous scrolling road with various obstacles.
State-space: Includes the game screenshot, the position of the hen, and the position of obstacles.
Action-space: The agent can move left, right, backward, or forward.
Reward Function Design:
- Positive Rewards: Encourages forward movement and strategic patience.
- Negative Rewards: Penalizes game over and inefficient movements.

2. Computer Vision (CV) Component

Object Detection with visual model: Identifies key elements in the game, including the hen and obstacles.
Template Matching Method: Used for restart button detection.

Watch the Crossy Road obstacles Detection

Iterations and Improvements

1st Iteration: Direct Screenshot Input
- Input: Raw game screenshots.
- Outcome: Ineffective; the model failed to converge.
2nd Iteration: Detected Objects
- Input: Detected objects from CV pipeline.
- Outcome: Some improvement, but performance limited.
3rd Iteration: Grid Representation
- Input: Transformed game field into a grid.
- Outcome: Effective; improved training speed and model performance.
4th Iteration: R-IQN with Enhanced Environment
- Input: Replaced DQN with R-IQN and increased class labels.
- Outcome: Highly effective; significant improvement in adaptability and performance.

Results

The fourth iteration achieved the best performance, with the agent scoring 29 points in the Crossy Road game. The use of the R-IQN model allowed for better handling of sequential dependencies and improved decision-making.

Watch the Crossy Road RL Agent Video

The detatiled report can be found here

Limitations

Object detection inaccuracies may affect the agent's performance.
The grid representation might oversimplify complex game dynamics.
The RL training process was computationally intensive, limited by available resources.

References

For further reading and inspiration, explore the following related projects:

Deep Learning Crossy Road by Yilong Song
CS221 Poster: Reinforcement Learning for Crossy Road by Sajana Weerawardena, Alwyn Tan, and Nick Rubin
Deep Q-Learning Crossy Road by Marlon Facey et al.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.gitignore		.gitignore
README.md		README.md
REPORT.pdf		REPORT.pdf
requirements.txt		requirements.txt
screenshots_collector.py		screenshots_collector.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crossy Road RL Agent

Project Overview

Repository

Artifacts

Problem Statement and Objective

Methodology

1. Reinforcement Learning (RL) Component

2. Computer Vision (CV) Component

Iterations and Improvements

Results

Limitations

References

License

About

Contributors 3

Languages

1kkiRen/Crossy-Road-Course-Project

Folders and files

Latest commit

History

Repository files navigation

Crossy Road RL Agent

Project Overview

Repository

Artifacts

Problem Statement and Objective

Methodology

1. Reinforcement Learning (RL) Component

2. Computer Vision (CV) Component

Iterations and Improvements

Results

Limitations

References

License

About

Resources

Stars

Watchers

Forks

Contributors 3

Languages