Dropout WIP #535

breznak · 2019-07-03T05:18:09Z

WIP dropout implementation

dropout works
move to connections?
use for SP, TM
doc

EDIT:
Motivation: I believe this change can be considered biological (noise on signal during transfer) and also acording to deep learning (more robust representations). It should be supported by measurable of SDR quality #155

to input, output

breznak · 2019-07-03T05:20:50Z

src/htm/types/Sdr.hpp

@@ -471,10 +471,16 @@ class SparseDistributedRepresentation : public Serializable
     * @param rng The random number generator to draw from.  If not given, this
     * makes one using the magic seed 0.
     */
-    void addNoise(Real fractionNoise);
+    void addNoise(Real fractionNoise); //TODO the name is confusing, rename to shuffle ?


is it OK to rename this to shuffle() and have addNoise for the new fn? @ctrl-z-9000-times

I looked at your new addNoise function and I think it will have issues with keeping the sparsity at a reasonable level. I think that the sparsity of an SDR after this method is called on it will always tend towards 50%.

that would be indeed wrong. What I intended:

have SDR of current input

flip 0.01% bits

have a new SDR

flip 0.01%bits

So the sparsity would remain the same (actially grow, because we have much more off bits, so flipping on would be more probable). But it should remain the x% (2%) + 0.001%

breznak · 2019-07-03T05:42:13Z

src/htm/types/Sdr.cpp

+    void SparseDistributedRepresentation::addNoise2(const Real probability, Random& rng) {
+      NTA_ASSERT( probability >= 0.0f and probability <= 1.0f );
+      const ElemSparse numFlip = static_cast<ElemSparse>(size * probability);
+      if (numFlip == 0) return; 


I'm trying to write an effective implementation, but this has problem with p << size. Should we bother with such cases? return/assert?

breznak · 2019-07-03T05:46:06Z

src/htm/algorithms/SpatialPooler.cpp

+  input.addNoise2(0.01f, rng_); //TODO apply at synapse level in Conn?
+  //TODO fix for probability << input.size
+  //TODO apply killCells to active output? 
+  //TODO apply dropout to segments? (so all are: synapse, segment, cell/column)


proof of concept dropout applied to input (as noise) and output (as killCells).

I'd prefer this be applied in Connections (in adaptSegment?)

where to apply?

ideally all of: SP, TM. & synapse, segment, cell, column

but that would be computationally infeasible, so..?

breznak · 2019-07-03T08:18:43Z

Deterministic test are still expected to fail, until we decide on values and update the exact outputs.

ctrl-z-9000-times · 2019-07-03T15:48:23Z

Maybe I don't understand this change, but it seems this will make the HTM perform worse. While it's interesting that the HTM keeps working even when some of its components are disabled, I don't think this belongs in the mainline. Maybe instead you could make an example/demonstration of these fault-tolerance properties (like numenta did in their SP paper).

breznak · 2019-07-03T16:27:58Z

but it seems this will make the HTM perform worse.

it's commonly used in deeplearning where it improves a lot. To be exact, dropout helps to prevent overfitting.

While HTM is already more robust to that (sparse SDR for output, stimulus threshold on input segments) I want to see if this helps and how much.
For now, on MNIST this gave a little (1-2%) better results, that is similar as impact of boosting.

I am looking for biological confirmation and datasets to proof if this works better. (It does slowdown a bit but that is an implementation detail).

I'd say it's biologically possible that the signal gets corrupted while transfered over the dendrite.
maybe this would help in wrong parameter configuration, for a well tuned system dropout indeed had low effect.

HTM keeps working even when some of its components are disabled

umm..no components are disabled permanently, this temporarily flips the bit, adding noise to the input.

breznak · 2019-09-18T10:07:50Z

Hotgym example internally uses dropout input.addNoise(), when testing, disable the explicit dropout there and use one from this PR (also test results w/o dropout)

breznak added 2 commits July 3, 2019 05:00

SDR:addNoice2 : adds random noice with probability

72067a8

SP apply dropout / noise WIP

bfbb417

to input, output

breznak added SP research new functionality of HTM theory, research idea labels Jul 3, 2019

breznak requested a review from ctrl-z-9000-times July 3, 2019 05:18

breznak self-assigned this Jul 3, 2019

breznak commented Jul 3, 2019

View reviewed changes

breznak added the in_progress label Jul 3, 2019

update some exact output tests

7d43342

breznak closed this Jul 6, 2019

breznak reopened this Jul 6, 2019

breznak closed this Jul 6, 2019

breznak reopened this Jul 6, 2019

breznak mentioned this pull request Nov 26, 2019

Example implemented using NetworkAPI #760

Merged

Zbysekz closed this Jun 26, 2020

Zbysekz deleted the dropout branch June 26, 2020 06:57

breznak restored the dropout branch June 26, 2020 07:07

breznak reopened this Jun 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dropout WIP #535

Dropout WIP #535

breznak commented Jul 3, 2019 •

edited

Loading

breznak Jul 3, 2019

ctrl-z-9000-times Jul 3, 2019

breznak Jul 3, 2019

breznak Jul 3, 2019

breznak Jul 3, 2019

breznak commented Jul 3, 2019

ctrl-z-9000-times commented Jul 3, 2019

breznak commented Jul 3, 2019

breznak commented Sep 18, 2019

Dropout WIP #535

Are you sure you want to change the base?

Dropout WIP #535

Conversation

breznak commented Jul 3, 2019 • edited Loading

breznak Jul 3, 2019

Choose a reason for hiding this comment

ctrl-z-9000-times Jul 3, 2019

Choose a reason for hiding this comment

breznak Jul 3, 2019

Choose a reason for hiding this comment

breznak Jul 3, 2019

Choose a reason for hiding this comment

breznak Jul 3, 2019

Choose a reason for hiding this comment

breznak commented Jul 3, 2019

ctrl-z-9000-times commented Jul 3, 2019

breznak commented Jul 3, 2019

breznak commented Sep 18, 2019

breznak commented Jul 3, 2019 •

edited

Loading