Classifier: use map to allow sparse categories WIP #680

breznak · 2019-09-20T11:18:04Z

use map internally for categories_, instead of vector, which allows us
to have sparse {1,2,999} categories (=3 total). Instead, with vector
this would have to be 999 categories!

Do not merge, for review comments only. Attempt to switch from using continuous label indices to non-cont. labels in Classifier.

use map internally for categories_, instead of vector, which allows us to have sparse {1,2,999} categories (=3 total). Instead, with vector this would have to be 999 categories!

ctrl-z-9000-times · 2019-09-27T13:40:45Z

I'm not against you changing the classifier to use sparse categories. There are pro's and con's for both map and vector solutions. Most importantly, the map solution is easier to use than the vector.

Thanh-Binh · 2019-11-05T13:10:12Z

@breznak do you get any better performance after changing to use map?

breznak · 2019-11-05T13:54:08Z

better performance after changing to use map?

I didn't get back to this PR yet, and haven't tested performance extensively.

OT: Coincidentely, I'm just running performance benchmarks right now, the master branch is faster since I've last measured (comparison with Martin's GPU version.). And I have prepared another PR that simplifies and makes the TM faster. I'm now tuning PGO (but results seem worse than w/o it (??))
CC @marty1885

marty1885 · 2019-11-05T14:57:51Z

Generally linear searching through a std::vector is faster than the O(log n) lookup in a map when you have say < 120 elements. I usually code up a STL compatible linear_map for a small look-up table in performance critical applications. Well.. I can't share the code as my implementation was for a commercial project.

Side note: The SDRClassifer in my library is in fact CLAClassifer (which I believe have been deprecated in this repo). I'm hesitate to pull in NN stuff. That is a steep slippery slope.

breznak · 2019-11-05T15:57:28Z

td::vector is faster than the O(log n) lookup in a map when you have say < 120 elements. I usually code up a STL compatible linear_map for a small look-up table in performance critical applications

speaking in pseudocode, this is an internal switch:

if small: use vector
else: use hashmap

?

We might try that for Connections, but I'm not sure I'll go such microoptization (yet).

Side note: The SDRClassifer in my library is in fact CLAClassifer (which I believe have been deprecated in this repo). I'm hesitate to pull in NN stuff. That is a steep slippery slope.

I'm not sure what CLAClassifier was, but the "NN stuff" means you don't want to use "other than HTM NN stuff"? As the current SDRClassifier is a simple softmax regression mapping SDR -> result

Thanh-Binh · 2019-11-05T16:04:04Z

Thanks all

marty1885 · 2019-11-06T03:59:05Z

@breznak

speaking in pseudocode, this is an internal switch:

Exactly. I wonder if we will ever need a hash map tho. Even DNNs rarely do > 120 class classification.

As the current SDRClassifier is a simple softmax regression mapping SDR -> result

Maybe I'm wrong. I thought the SDRClassifer in NuPIC/HTM.core is a simple MLP with softmax activation. I could easily go too far and make Etaler support advanced Deep Learning. (I that a good thing)?

I'm not sure what CLAClassifier was

Described here. https://www.youtube.com/watch?v=QZBtaP_gcn0
Then I guess it becomes KNN Classifer in NuPIC. https://github.com/numenta/nupic/blob/master/src/nupic/algorithms/knn_classifier.py

Thanh-Binh · 2019-11-06T07:56:34Z

@marty1885 you are right. Currently, SDRClassifier has only one hidden layer + softmax. Maybe by adding more (recurrent) layer we can get better classification quality

breznak · 2019-11-06T10:40:59Z

will [we] ever need a hash map tho. Even DNNs rarely do > 120 class classification ?

ok, that's another valid point. Maybe I got carried away and we're fine with the vectors
- my other motivation for hashmap was that we wouldn't have to do transformation to indices, but use the raw values ("cat")

SDRClassifer in NuPIC/HTM.core is a simple MLP with softmax activation

just a single layer + softmax, but right..

SDRClassifier has only one hidden layer + softmax. Maybe by adding more (recurrent) layer we can get better classification quality

adding more (and reccurent only make sense for sequences) should not make any difference if we think HTM works. It is fair to test if (and I hope it should not) it makes things better. But if HTM works correctly, all the (incl. temporal) information is in SDR and well distributed for trivial classification. So from the point of view of the theory improved classifiers should not (need to) exist in the repo.

Actually, my plan is to go in the opposite direction and introduce biiological classifier. Works like 1-NN. I plan to test it on MNIST, create "etalons" for each digit (0-9)-> SDR_0-9. Then classification would be nearest neighbor (=max overlap) over the etalons and the tried sample.

Thanh-Binh · 2019-11-06T11:00:03Z

@breznak

biiological classifier.

Can you tell more about it?

breznak · 2019-11-06T11:10:24Z

Works like 1-NN. I plan to test it on MNIST, create "etalons" for each digit (0-9)-> SDR_0-9

should be really simple and builds on the principles of SDR representations.
For finite set of labels (classification, not regression) we can train HTM's SP, then produce etalon SDR for each of the categories. Classification is then reduced to simple "which looks closes to the representation I have produced now?" (max overlap). If the SP works correctly as described, this should be all needed for recognizing discriminative information in the patterns.

marty1885 · 2019-11-06T11:25:37Z

@breznak
May you describe the algorithm in more details? It sounds like my implementation of CLAClassifer.

breznak · 2019-11-06T11:39:35Z

May you describe the algorithm in more details? It sounds like my implementation of CLAClassifer.

let's demonstrate it on a simple example MNIST:

unsupervised training of HTM (SP) on train data
for each label (0-9)

pick some (here simplty 1) data point
get SDR of this data, note the relation label ~ SDR_{label}
this is all needed "training" for classifier.

classification:

for any data d
get SDR_d
go through the stored SDRs in 2/, find closes match (= min over SDR.overlap(SDR_d, SDR_{label})
return label associated with the closes match SDR from above.

Basically, it assumes all needed info is already in the SDR, and that overlap describes closeness of representations, which translates to closeness/semantic similarity of their origins (raw data).

marty1885 · 2019-11-06T13:18:30Z

I am 60% sure they are the same algorithm.

Again, in a MINIST example.

Train a SP unsupervised
- Assuming SP generates a n bit SDR
Initialize 10 arrays a of size n
For each data d in training set (or a subset of training set)
- data d is associated with label l
- add d to a[l] (vector addition)
Classifcation
- for any data d' and an threshold th. 0 < th <= 1
- th is a threshold to reduce the effect of noise
- Go through all arrays in 2/3. Compute s[i] = SDR.overlap(d', a[i] > max(a[i])*th)
- return argmax(s), which is the best match we can find

Thanh-Binh · 2019-11-06T13:49:45Z

@breznak @marty1885 if I remember well, your algo looks like what htmresearch does for object classification using sensor and location information. By learning, it use union of all SDR representation for the same object. By inference, it calculates the overlap between the current SDR and the learned SDR representation.
I am very interested in your classification results vs Numenta SDRClassifier

breznak · 2019-11-07T01:02:47Z

seems the same in principle to me.

add d to a[l] (vector addition)

d is your raw data, or SDR (sp.compute(d))?
And another detail is, as I understood it, your vector a[L] is integer elements, while in my version it's binary, as it's still a SDR. What you're using sounds like sim-hashing (added and described in our new SimHashDocumentEncoder).

By learning, it use union of all SDR representation for the same object. By inference, it calculates the overlap between the current SDR and the learned SDR representation.

yes, this is the principle. IMHO the only plausible way to do classification with HTM.

marty1885 · 2019-11-07T06:25:04Z

d is your raw data, or SDR (sp.compute(d))?

Ahh, I forget that. Yes, d is indeed sp.compute(x)

while in my version it's binary

I tried binary initially. But it turned out to be a bad idea as noise ended up turning the SDR into a Dense DR.

breznak · 2019-11-07T15:24:04Z

I tried binary initially. But it turned out to be a bad idea as noise ended up turning the SDR into a Dense DR.

interesting, indeed. It would suggest the SP had not learned properly (too small SDR, not enough sparse, ...) or that the concepts (of "1 one" etc) are learned as more entities.

Would your code be easily applicable to this codebase, or am I better off writing it from scratch?

marty1885 · 2019-11-08T05:00:56Z

Would your code be easily applicable to this codebase, or am I better off writing it from scratch?

It is easy to code from scratch. You can use my code from HTMHelper. Although I'm using a dense array instead of SDR there.

It would suggest the SP had not learned properly

I suspect it is the problem of MNIST itself. Images does not fulfill the properties of a SDR. ex. 1 and 7 can have many overlapping bits even though they are different numbers. So SP will have a hard time separating them.

Classifier: use map to allow sparse categories

f3a58a4

use map internally for categories_, instead of vector, which allows us to have sparse {1,2,999} categories (=3 total). Instead, with vector this would have to be 999 categories!

breznak mentioned this pull request Sep 20, 2019

Hotgym predictor, anomaly tests #675

Open

2 tasks

marty1885 mentioned this pull request Nov 5, 2019

Temporal Memory extreamly slow when benchmarking with random inputs #745

Closed

marty1885 mentioned this pull request Nov 6, 2019

Deep Learning support in Etaler etaler/Etaler#91

Open

dkeeney mentioned this pull request Nov 30, 2019

Classifier Region #761

Closed

Zbysekz closed this Jun 26, 2020

Zbysekz deleted the classifier_map_wip branch June 26, 2020 06:57

breznak restored the classifier_map_wip branch June 26, 2020 07:04

breznak reopened this Jun 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classifier: use map to allow sparse categories WIP #680

Classifier: use map to allow sparse categories WIP #680

breznak commented Sep 20, 2019

ctrl-z-9000-times commented Sep 27, 2019

Thanh-Binh commented Nov 5, 2019

breznak commented Nov 5, 2019

marty1885 commented Nov 5, 2019 •

edited

Loading

breznak commented Nov 5, 2019

Thanh-Binh commented Nov 5, 2019

marty1885 commented Nov 6, 2019 •

edited

Loading

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 6, 2019

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 6, 2019

marty1885 commented Nov 6, 2019 •

edited

Loading

breznak commented Nov 6, 2019

marty1885 commented Nov 6, 2019

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 7, 2019

marty1885 commented Nov 7, 2019

breznak commented Nov 7, 2019

marty1885 commented Nov 8, 2019 •

edited

Loading

Classifier: use map to allow sparse categories WIP #680

Are you sure you want to change the base?

Classifier: use map to allow sparse categories WIP #680

Conversation

breznak commented Sep 20, 2019

ctrl-z-9000-times commented Sep 27, 2019

Thanh-Binh commented Nov 5, 2019

breznak commented Nov 5, 2019

marty1885 commented Nov 5, 2019 • edited Loading

breznak commented Nov 5, 2019

Thanh-Binh commented Nov 5, 2019

marty1885 commented Nov 6, 2019 • edited Loading

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 6, 2019

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 6, 2019

marty1885 commented Nov 6, 2019 • edited Loading

breznak commented Nov 6, 2019

marty1885 commented Nov 6, 2019

Thanh-Binh commented Nov 6, 2019

breznak commented Nov 7, 2019

marty1885 commented Nov 7, 2019

breznak commented Nov 7, 2019

marty1885 commented Nov 8, 2019 • edited Loading

marty1885 commented Nov 5, 2019 •

edited

Loading

marty1885 commented Nov 6, 2019 •

edited

Loading

marty1885 commented Nov 6, 2019 •

edited

Loading

marty1885 commented Nov 8, 2019 •

edited

Loading