|
d87b8e7118
|
add mean adaptation hyperparameter
|
2018-06-10 16:38:25 +02:00 |
|
|
47eb173dac
|
add exists utility function
|
2018-06-10 16:36:15 +02:00 |
|
|
771650613c
|
move dot_mv to nn
|
2018-06-10 16:34:20 +02:00 |
|
|
0100934ac4
|
add antithetic sampling for xNES
|
2018-06-10 16:33:38 +02:00 |
|
|
695730335c
|
set experimental config
|
2018-06-09 19:19:01 +02:00 |
|
|
d6cc49cde1
|
fix learning without negate_trials
|
2018-06-09 18:56:10 +02:00 |
|
|
bcb6cb9da1
|
add xNES optimizer
|
2018-06-09 18:56:10 +02:00 |
|
|
fe9494b0d5
|
refactor ARS out of main (breaks a bunch of stuff)
|
2018-06-09 18:56:10 +02:00 |
|
|
d3e6441c40
|
reduce tile input to 5 per row using new layers
|
2018-06-09 16:20:20 +02:00 |
|
|
dd5ec3dbde
|
make network linear
|
2018-06-09 16:20:07 +02:00 |
|
|
2b4bffb401
|
add Reshape and DenseBroadcast layers
|
2018-06-09 16:17:52 +02:00 |
|
|
ae331ce60b
|
remove remnants of backwards pass
|
2018-06-09 15:24:12 +02:00 |
|
|
f03e80b1b6
|
rename notice
|
2018-06-09 09:47:33 +02:00 |
|
|
cbb094adc9
|
restore flagpole bonus, add missing overlay check
|
2018-06-09 04:35:09 +02:00 |
|
|
9fb98d3fe0
|
allow setting of world-level, plus random option
|
2018-06-09 04:34:21 +02:00 |
|
|
81d6b509d0
|
detect when mario is controllable
|
2018-06-09 01:43:22 +02:00 |
|
|
5f85b92b6d
|
simplify gameconfig button specification
|
2018-06-08 23:59:55 +02:00 |
|
|
9b23327df4
|
add score multiplier
|
2018-06-08 23:59:43 +02:00 |
|
|
431a591481
|
fix offscreen sprites sometimes being visible
|
2018-06-08 15:03:09 +02:00 |
|
|
f576a47282
|
make sprite inputs relative to center of screen
|
2018-06-08 14:52:04 +02:00 |
|
|
fec148fb79
|
don't turbo in playable mode, note overlay bug
|
2018-06-08 14:51:17 +02:00 |
|
|
c30f07f407
|
prevent reward gained from fireworks
|
2018-06-08 14:12:21 +02:00 |
|
|
c40e1f929d
|
fix skipped inputs on lag frames
|
2018-06-08 13:48:59 +02:00 |
|
|
912e114efe
|
update todo
|
2018-06-08 13:47:32 +02:00 |
|
|
e24c3d31a4
|
use argsort
|
2018-06-08 13:46:38 +02:00 |
|
|
d33bdfea62
|
add argsort function
|
2018-06-08 02:46:00 +02:00 |
|
|
374fa4d876
|
cleanup
|
2018-06-08 02:45:07 +02:00 |
|
|
37d404e77d
|
reduce embed layer to values actually used ingame
|
2018-06-07 22:40:31 +02:00 |
|
|
9c8c1ccd0c
|
add tanh activation
|
2018-05-14 08:27:20 +02:00 |
|
|
3030e83d00
|
refactor learn_from_epoch
|
2018-05-14 01:34:08 +02:00 |
|
|
ec19774af5
|
localize a couple more things
|
2018-05-12 23:08:00 +02:00 |
|
|
15f0292485
|
remove defer_prints option (now always true)
|
2018-05-12 22:56:04 +02:00 |
|
|
0fb3b1780f
|
remove some old comments
|
2018-05-12 22:55:04 +02:00 |
|
|
a836314b8b
|
refactor game and utility functions
|
2018-05-12 22:44:53 +02:00 |
|
|
7f34de8e7c
|
add cosine activation
|
2018-05-12 21:51:00 +02:00 |
|
|
7db43038ac
|
adjust range of timed inputs to stdev of roughly 1
|
2018-05-07 16:27:51 +02:00 |
|
|
7357c8ed62
|
move LayerNorm after Relu
|
2018-05-07 16:22:48 +02:00 |
|
|
e3a8a6b87f
|
tweak config
|
2018-05-07 16:22:02 +02:00 |
|
|
946f05bd3e
|
base timed inputs on start of trial time
|
2018-05-07 16:20:59 +02:00 |
|
|
3e7aeb3c91
|
config tweaks and fixes
|
2018-05-07 09:20:22 +02:00 |
|
|
ce64801368
|
fix some inputs
|
2018-05-07 09:19:24 +02:00 |
|
|
5201b75509
|
add Lipschitz heuristic/approximation
|
2018-05-07 05:57:52 +02:00 |
|
|
ee066154b2
|
add test trial logging
|
2018-05-07 05:57:18 +02:00 |
|
|
deb1ea7de0
|
add LayerNorm layer
|
2018-05-07 05:55:58 +02:00 |
|
|
feaf86dc6b
|
allow weights/params file to be configured
|
2018-05-04 21:02:08 +02:00 |
|
|
90922a2bc3
|
add AMSgrad optimizer and logging
|
2018-05-03 16:48:12 +02:00 |
|
|
c7c657513e
|
fix softmax
|
2018-05-03 16:48:12 +02:00 |
|
|
7831f534c9
|
tweaks
|
2018-05-03 16:48:12 +02:00 |
|
|
2bdd67b721
|
add playback_mode
|
2018-05-03 16:48:12 +02:00 |
|
|
b453438055
|
add graycode-like distribution option
|
2018-05-03 16:48:12 +02:00 |
|