docs: README

This commit is contained in:
Ting-Jun Wang 2022-07-02 03:21:00 +08:00
parent 1366659a41
commit dccbf639e5
Signed by: snsd0805
GPG Key ID: 8DB0D22BC1217D33

12
README.md Normal file
View File

@ -0,0 +1,12 @@
# TetrisRL
- Implement a reinforcement learning model which can play Tetris
# Version 0
- Policy Gradient Algorithm
- Reward Delay
- Bad Performance
# TODO
- change reward function
- reward baseline
- DQN