Simple environment where a Reinforcement Learning agent has to find a treasure in a room with different obstacles.
Our agent lies in a room discretized in
- MOVE UP
- MOVE LEFT
- MOVE RIGHT
- MOVE DOWN
The room contains three type of special cells:
- A wall the agent cannot pass through
- A cell with a treasure that represents the goal of the game
- A cell with poison
And all of the other cells are empty, allowing regular movement of the agent inside the room.
The agent may start at any position except for the wall cell. If the agent reaches either the treasure or the poison block, the game is over.
The agent receives a reward of +10 if it reaches the treasure block, a reward of −10 if it reaches the poison block and −1 point for the other blocks.