update todo

2018-03-23 09:33:02 +01:00 · 2018-03-23 09:33:02 +01:00 · 4765104c7a
commit 4765104c7a
parent 57cead431d
1 changed files with 3 additions and 1 deletions
--- a/4
+++ b/4
@ -5,12 +5,14 @@ however, feel free to copy any snippets of code you find useful.
 TODOs: (that i can remember right now)
 - finish implementing backprop
 - replace evolution strategy algorithm with
-  something that utilizes gradients like PPO
+  something that utilizes backprop like PPO
 - settle on a network architecture
 - normalize and/or embed sprite inputs
 - fix lag-frames skipped-inputs bug
 - detect frames when Mario is in a controllable state
 - fix offscreen sprites sometimes being visible to network
 - add some detection for enemies later in the game
 - compute how many input neurons the network needs instead of hardcoding
 naive:
 - learn any combination of buttons, starting from title screen