Commit Graph

37 Commits

Author SHA1 Message Date
Connor Olding 102eefe98c add PowerSign momentum to ARS, antithetic by default 2018-06-21 05:15:05 +02:00
Connor Olding dc235f5d18 add hidden layer settings 2018-06-17 23:48:43 +02:00
Connor Olding ebc494fb60 overhaul presets system 2018-06-17 05:52:24 +02:00
Connor Olding 155f868f56 another attempt at fixing preset argument 2018-06-16 00:37:19 +02:00
Connor Olding e3695bfb84 rename weight* to param* outside of nn.lua 2018-06-16 00:33:47 +02:00
Connor Olding f3fc95404c overhaul learning rates:
- rename mean_adapt to weight_rate
- sigma and covar update rates can be specified separately
  (sigma_rate, covar_rate)
- base decays on current rates instead of initially configured rates
  (this might break stuff)
- base_rate takes the place of learning_rate
2018-06-16 00:29:15 +02:00
Connor Olding 474bac45b8 update xnes2 preset with better sample efficiency 2018-06-16 00:06:58 +02:00
Connor Olding 655cb6f5b3 add snes2 xnes2 presets, cleanup, tweaks etc. 2018-06-14 22:42:51 +02:00
Connor Olding f512f8ac3a add sigma decay to xNES 2018-06-14 22:40:39 +02:00
Connor Olding a1ec797de0 add playable preset 2018-06-13 22:46:57 +02:00
Connor Olding 6fa042eda5 fix preset failing to default 2018-06-13 06:08:32 +02:00
Connor Olding 7800510d1f add xNES preset, add options, allow preset specified by argument 2018-06-13 03:01:54 +02:00
Connor Olding 719edfa4ae remove unused functions from config 2018-06-13 02:07:14 +02:00
Connor Olding 16acc68850 add presets to config, update config in general 2018-06-13 02:02:56 +02:00
Connor Olding 5c64fcf395 overhaul SNES (importance sampling, adaptation sampling, etc) 2018-06-13 01:19:32 +02:00
Connor Olding b7a9360d6d add min_time setting (cap_time -> max_time) 2018-06-13 00:59:36 +02:00
Connor Olding ccce6a2d55 sigma tweaks 2018-06-12 05:39:22 +02:00
Connor Olding fa0287d966 add sigma decay; move printing to start of epoch 2018-06-10 19:34:17 +02:00
Connor Olding 19cd10382f use experimental config/network 2018-06-10 16:41:45 +02:00
Connor Olding d87b8e7118 add mean adaptation hyperparameter 2018-06-10 16:38:25 +02:00
Connor Olding 0100934ac4 add antithetic sampling for xNES 2018-06-10 16:33:38 +02:00
Connor Olding 695730335c set experimental config 2018-06-09 19:19:01 +02:00
Connor Olding bcb6cb9da1 add xNES optimizer 2018-06-09 18:56:10 +02:00
Connor Olding fe9494b0d5 refactor ARS out of main (breaks a bunch of stuff) 2018-06-09 18:56:10 +02:00
Connor Olding 9fb98d3fe0 allow setting of world-level, plus random option 2018-06-09 04:34:21 +02:00
Connor Olding 9b23327df4 add score multiplier 2018-06-08 23:59:43 +02:00
Connor Olding 374fa4d876 cleanup 2018-06-08 02:45:07 +02:00
Connor Olding 15f0292485 remove defer_prints option (now always true) 2018-05-12 22:56:04 +02:00
Connor Olding e3a8a6b87f tweak config 2018-05-07 16:22:02 +02:00
Connor Olding 3e7aeb3c91 config tweaks and fixes 2018-05-07 09:20:22 +02:00
Connor Olding 5201b75509 add Lipschitz heuristic/approximation 2018-05-07 05:57:52 +02:00
Connor Olding deb1ea7de0 add LayerNorm layer 2018-05-07 05:55:58 +02:00
Connor Olding 90922a2bc3 add AMSgrad optimizer and logging 2018-05-03 16:48:12 +02:00
Connor Olding 7831f534c9 tweaks 2018-05-03 16:48:12 +02:00
Connor Olding 2bdd67b721 add playback_mode 2018-05-03 16:48:12 +02:00
Connor Olding b453438055 add graycode-like distribution option 2018-05-03 16:48:12 +02:00
Connor Olding 545618c70b refactor config vars to their own files 2018-05-03 16:48:12 +02:00