Connor Olding
c81ce0afbb
rename stuff and add a couple missing imports
2018-01-21 22:16:36 +00:00
Connor Olding
bbdb91fcb1
merge and split modules into a package
2018-01-21 22:07:57 +00:00
Connor Olding
db65fbdd62
add Neumann optimizer
2018-01-12 15:42:04 +00:00
Connor Olding
1ebb897f14
use @ operator
2017-10-19 04:12:16 +00:00
Connor Olding
a85ee67780
allow CLRs to use optimizer's learning rate
2017-10-19 04:03:44 +00:00
Connor Olding
763246df98
add RMSpropCentered to model from config
2017-09-26 23:12:40 +00:00
Connor Olding
9bb26b1ec5
add Huber loss
2017-09-25 16:37:52 +00:00
Connor Olding
eb16377ba6
add Adagrad optimizer
2017-09-25 16:06:45 +00:00
Connor Olding
c964f143d2
not true
2017-09-25 07:12:19 +00:00
Connor Olding
5b6fd6259f
update example
2017-09-25 06:28:59 +00:00
Connor Olding
a760c4841b
add fallback to optim.lr in AnnealingLearner
2017-09-25 06:10:54 +00:00
Connor Olding
916c6fe1f0
assert that rituals have been prepared
2017-09-25 06:10:04 +00:00
Connor Olding
615f43c550
support AddSign and PowerSign in config
2017-09-25 06:09:36 +00:00
Connor Olding
fe54002671
remove DumbLearner
2017-09-25 06:09:07 +00:00
Connor Olding
9a7ffe5f0d
add AddSign and PowerSign optimizers
2017-09-25 04:02:17 +00:00
Connor Olding
5c83f063be
remove keras stuff so it won't show on google
...
bleh
2017-09-18 04:42:41 +00:00
Connor Olding
d38e2076f0
allow multi-input and multi-output models
2017-09-16 18:28:05 +00:00
Connor Olding
3386869b30
move actreg tweaking into if statement
...
i was getting division by zero.
2017-09-16 17:33:47 +00:00
Connor Olding
65ba80bb96
skip over irrelevant nodes
2017-09-16 17:31:39 +00:00
Connor Olding
dcbaef3032
use MomentumClip in warmup for stability
2017-09-16 17:30:52 +00:00
Connor Olding
7878f94f43
auto-increment loss filenames to stop clobbering
2017-09-16 17:30:02 +00:00
Connor Olding
e22316a4c9
move losses into Model and refactor methods
2017-09-16 17:05:25 +00:00
Connor Olding
910facf98d
move NLL to core
2017-08-05 10:59:05 +00:00
Connor Olding
0b9c1fe117
allow SGDR to anneal optimizer's learning rate
...
e.g. YellowFin
2017-08-05 10:43:38 +00:00
Connor Olding
dbd6c31ea5
fix final rate calculation
2017-08-05 10:43:18 +00:00
Connor Olding
915b39d783
allow Optimizers to inspect Models (currently unused)
...
the thing that takes advantage of this may or may not be committed,
so this may or may not get reverted.
2017-08-05 10:41:35 +00:00
Connor Olding
de5af4f7f4
allow argument passthru to normalizer in _mr_make_norm
2017-08-05 10:40:39 +00:00
Connor Olding
957ee86e20
add PolyLearner: polynomial learning scheduler
2017-08-05 10:40:06 +00:00
Connor Olding
cc89465adc
tweak comment
2017-08-05 10:39:59 +00:00
Connor Olding
058a779f6c
remove some unused arguments
2017-08-05 10:39:32 +00:00
Connor Olding
2e74c9160c
tweak CubicGB defaults
2017-08-03 03:39:25 +00:00
Connor Olding
001a997e09
correction: batches, not epochs.
2017-08-03 03:38:07 +00:00
Connor Olding
9138f73141
update mnist training
...
crank up the learning rate on emnist and use momentum with gradient clipping.
add a simple restart callback.
remove batch size adapation crap.
remove confidence measures.
2017-08-03 03:36:46 +00:00
Connor Olding
7ac67fba8f
fix Bias layer
2017-08-02 11:37:39 +00:00
Connor Olding
049d966710
remove biasing from Conv1Dper in favor of Bias layer
2017-08-02 11:30:08 +00:00
Connor Olding
4ee2181691
add standalone Bias layer
2017-08-02 11:28:41 +00:00
Connor Olding
e7c12c1f44
add ad-hoc weight-sharing method
2017-08-02 11:28:18 +00:00
Connor Olding
f507dc10f8
remove DenseOneLess
...
not useful.
2017-08-02 10:52:26 +00:00
Connor Olding
4d2251f69f
allow weight sharing; disableable gradient clearing
2017-08-02 10:29:58 +00:00
Connor Olding
89fcd25962
fix wording
2017-08-02 07:00:33 +00:00
Connor Olding
e4fa5bf63f
add positional control to convolution
2017-08-02 06:47:37 +00:00
Connor Olding
5074dcb2aa
add Decimate and Undecimate layers
2017-08-02 06:47:15 +00:00
Connor Olding
f28e8d3a54
add/remove comments and fix code style
2017-08-02 03:59:15 +00:00
Connor Olding
8b3b8d8288
add rough 1D circular convolution
2017-08-02 03:58:24 +00:00
Connor Olding
5d9efa71c1
move SquaredHalved to core
2017-07-25 22:14:17 +00:00
Connor Olding
f43063928e
rename Linear activation to Identity layer
2017-07-25 22:12:27 +00:00
Connor Olding
e5fd937ef6
remove cruft from YellowFin
...
i might just remove YellowFin itself because it isn't working for me.
2017-07-25 21:38:09 +00:00
Connor Olding
2cf38d4ece
finally fix learning rate scheduling for real
...
okay, this is a disaster, but i think i've got it under control now.
the way batch-based learners now work is:
the epoch we're working towards is the truncated part of the epoch variable,
and how far we are into the epoch is the fractional part.
epoch starts at 1, so subtract by 1 when doing periodic operations.
2017-07-25 04:25:35 +00:00
Connor Olding
93547b1974
add a linear (identity) activation for good measure
2017-07-25 04:24:32 +00:00
Connor Olding
6933e21e0e
update mnist example
2017-07-23 04:23:57 +00:00