docs: update README

This commit is contained in:
Ting-Jun Wang 2022-07-02 03:25:28 +08:00
parent dccbf639e5
commit 8accb27ccf
Signed by: snsd0805
GPG Key ID: 8DB0D22BC1217D33

View File

@ -1,6 +1,23 @@
# TetrisRL
- Implement a reinforcement learning model which can play Tetris
# Tetris Enviroment
- 10 actions
- 0: don't move
- 1: shift left 1 block
- 2: shift left 2 block
- 3: shift left 3 block
- 4: shift right 1 block
- 5: shift right 2 block
- 6: shift right 3 block
- 7: rotate once
- 8: rotate twice
- 9: rotate three times
- return
- pixel(10*20)
- block_id
- block_location(x, y)
# Version 0
- Policy Gradient Algorithm
- Reward Delay