Re: mnist test result on SP+KNN

50014646142 Wed, 11 Mar 2015 20:37:44 -0700

Hello,

KNN classifier is similar to the sensor. both are like regions, withdifferent functions. In mnist_test, just set up the parameters and useapi to spawn the classifier , train it and do the classification. Ifnot using network api, use the "nupic.algorithms.KNNClassifier". BTW,the knn classifier without any preprocessing can achieve 3%-5% errorrate (http://yann.lecun.com/exdb/mnist/). so I think 95.56% accuracyis not bad for KNN.


An

On Wed, 11 Mar 2015 18:23:39 -0500
 Pulin Agrawal <[email protected]> wrote:

Can you please explain how you are using KNN in this case?

Pulin Agrawal
पुलिन अग्रवाल

On Sun, Mar 8, 2015 at 4:40 AM, <[email protected]> wrote:
Hello,
The mnist_test uses ImageSensor to read the image. It seems theimagesensor binarizes the image and parses the binarized image to SP. Inmycode, I just set up a threshold and use it to get the binary data.Thenparse it to SP. For MNIST dataset, the threshold could be 0, anyvaluebigger than 0 is set to 1. since there is almost no noise in MNISTdataset.But the threshold will still effect the digit shape in the binarizedimage(input of SP). I guess that may be also a reason why my code got alower
accuracy.

An


On Sun, 8 Mar 2015 16:52:50 +0800
 曾國維 <[email protected]> wrote:
Hi An Qi,
I'm interesting in your MNIST test on SP. How do you feed the MNISTimage
to SP? Do you use encoder or just feed the binary input to the SP?
(I mean in MNIST the range of the pixel value is from 0 to 255, butSPinput should be a binary vector. How do you convert 0~255 pixelvalue to
binary input?)

Thanks,

Gordon

2015-01-22 7:05 GMT+08:00 <[email protected]>:

 Hi Subutai:
I just tried the code here(https://github.com/
numenta/nupic.research/tree/
master/image_test), and got 95.56% of accuracy. Here is thestatistics.
As we can see some digits had been always recognized as another one.
(2=>0,
7=>1, 5=>3,8=>3,9=>4,3=>5,8=>5,4=>6,5=>6,2=>7,9=>7,2=>8,3=>8,5=>8,4=>9,
7=>9).
It is better than the result I got. But for some input, the SDRstill
lose
its feature.
well, it used network api. so it is not convenient to see whathappens inlower level, and to add some functions to component. I wonder if icoulduse the ImageSensor directly. Now, I'd like to try the sameparameter
without using network api (more flexible).
"the current version of NuPIC does not support hierarchies well". Doyou
mean the connection between region and region or sensor and region?

An Qi

On Wed, 21 Jan 2015 09:04:43 -0800

 Subutai Ahmad <[email protected]> wrote:

 Hi An,
I think you are right. If you don't get good SDRs out of the SP,then
the
rest of the system won't work well. Evaluating the quality of theSDRs
is
usually one of my first steps in any difficult HTM debuggingprocess.
The
way I test it is:
a) to see that the distribution of winning columns is relativelyeven.
You
don't want a small number columns winning all the time.

b) to see that small changes in the input cause small changes in the
output.
c) you'd like to get unique outputs for very different patterns.Testing
with the KNN on the training set is a decent way to check this. You
should
get close to 100% on the training set. This is similar in spirit toa)
above.
To answer your other question, there's been a fair amount ofdiscussion
on
the list already on the differences between HTM and other networks.We
used
to have hierarchy before but the current version of NuPIC does not
support
hierarchies well so that is a big difference.  As such I do not
recommend
using NuPIC today for image recognition problems. (This is atemporary
issue as we want to rethink the way we implement hierarchies.) Some
advantages of the current implementation of HTM's are in theirability
to
handle streaming data, complex temporal sequences and continuous
learning.
The biggest advantage though is that it is a very good startingpoint
for
understanding computational principles of the cortex. We believethat
HTMs
are on the path to understanding all the other components of
intelligence
such as sensorimotor inference, attention, goal directed behavior,etc.
If
you believe that understanding the cortex deeply is the way to build
intelligent systems, then HTMs are the way to go!

--Subutai


On Mon, Jan 19, 2015 at 11:14 PM, <[email protected]> wrote:

 Hello, Subutai:
Thanks for your reply. It is really helpful. Actually, I tried touse
it
before. But it seemed there was something wrong with the code. afterI
fixed the error, and ran it. I found the classifier just learned 1
pattern.
Now it is working. Great.

The only thing not that good is that, it is really slow because the
program read the images from 60000+10000 files. i think it will bemoreefficient to read images from the datasets directly. Currently ifind
the
SDR i got is not very distributed, because i set a high
localAreaDensity
value in order to get a high accuracy of recognition. Even so, thebest
result i got is about 89%. :(
I have some questions about the SDR and TP. Currently, i am tryingto
test
SP+KNN is because i think the SDR is really important for TP. If the
SDR
from the SP is not good enough (KNN classifier didn't recognize itas a
right one), it would lead TP to another way when train with frame
sequences(same pattern). So, how good should the SDR be, for TP? Andif
we
got a bad SDR (KNN recognize it as another pattern), how much willit
effect TP and CLA classifier? The last question is about HTM model.
Recently, some deep learning models is on fire among academies. Like
CNN
(Convolutional neural network ), RNN-LSTM (recurrent neural network-
long
short term memory). If we compare HTM with them, what kind ofadvantage
and
disadvantage does HTM have?

Thank you.

An Qi


On Mon, 19 Jan 2015 17:54:16 -0800
 Subutai Ahmad <[email protected]> wrote

 Hi An,
Please see [1]. It gets 95.5% accuracy. However, please note this isa
very
simplistic system (just SP+KNN). It does not incorporate hierarchy,
temporal pooling, or any sort of learning of invariances.  (BTW,
anything
less than 99% is not considered very good for MNIST. MNIST is all
about
getting those last few corner cases! :-)

--Subutai

[1] https://github.com/numenta/nupic.research/tree/master/image_test


On Sat, Jan 17, 2015 at 11:00 PM, <[email protected]> wrote:

 Hello.
Sorry for the last email. Thx to the rich formatting :( ... I haveto
type
again.

Recently, I got the result of the test. I followed the source code
and
built the Spatial Pooler + KNN classifier. Then I extracted images
from
MNIST dataset(Train/test : 60000/10000) and parsed them to the
model. I
tried to test with different parameters (using small dataset:
Train/Test
-
6000/1000 ), the best recognition result is about 87.6%. After that,
i
tried the full size MNIST dataset, the result is 89.6%. Currently,
this
is
the best result I got.

Here is the statistics. It shows the error counts for each digits.
the
Row
presents the input digit. the column presents the recognitionresult.
Most
of the "7" are recognized as "9". It seems the SDR from SP is still
not
good enough for the classifier.
I found some interesting things. When I let the "inputDimensions"and
"columnDimensions" be "784" and "1024", the result will be around
68%.
If i
use "(28,28)","(32,32)" and keep others the same, the result will be
around
82%. That 's a lot of difference. It seems the array shape will
effect
SP a
lot.

Did any one get a better result? Does any one have some suggestion
about
the parameters or others?

Thank you.
An Qi
Tokyo University of Agriculture and Technology - Nakagawa Laboratory
2-24-16 Naka-cho, Koganei-shi, Tokyo 184-8588
[email protected]

Re: mnist test result on SP+KNN

Reply via email to