[pymvpa] unable to get meaningful SMLR, do I have the wrong data format?

Kinga Laura Dobolyi Tue, 04 May 2010 09:51:17 -0700

Hello,

I am new to fMRI and I think you have a great tool - unfortunately I'mhaving a bit of trouble getting the SMLR classifier to work for mydataset. I think the problem is in the format of the input data. When Irun the start_easy.py example, modified with my own data, and print outthe confusion matrix, I get something like this:

----------.
predictions\targets  0.0   1.0   2.0   3.0   4.0

`------ ---- ---- ---- ---- ---- P' N' FP FNPPV NPV TPR SPC FDR MCC AUC0.0 20 15 27 21 18 101 0 81 00.2 nan 1 0 0.8 0 0.541.0 0 0 0 0 0 0 35 0 15nan 0.57 0 1 nan 0 0.492.0 0 0 0 0 0 0 47 0 27nan 0.43 0 1 nan 0 0.213.0 0 0 0 0 0 0 41 0 21nan 0.49 0 1 nan 0 0.474.0 0 0 0 0 0 0 38 0 18nan 0.53 0 1 nan 0 0.7

Per target:          ----  ----  ----  ----  ----
        P            20    15    27    21    18
        N            81    86    74    80    83
        TP           20    0     0     0     0
        TN           0     20    20    20    20

Summary \ Means: ---- ---- ---- ---- ---- 20.2 32.2 16.2 16.2nan nan 0.2 0.8 nan 0 0.48

       ACC          0.2
       ACC%         19.8
    # of sets        2

There are 4 stimuli (labels 1-4) and 1 fixation (label 0), correspondingto the 5 labels. No matter what I do (i.e. remove fixation from thedataset, etc), the classifier always seems to classify everything as thefirst kind of label that it sees. I think this is unusual, because ifthe dataset was just noisy, then I feel like it would randomly assignlabels. It seems to assign everything to just 1 label, regardless ofwhat that label is - it's almost like the feature weights are zero!


My dataset is about 500 files that end in nii.gz; I import them like this:
dataArr = ['/mvpa/snafps0.nii.gz',/mvpa/snafps1.nii.gz',............]

The attributes file has exactly as many entries as the number of niifiles in my dataArr, which makes sense of course.

I decided to see if it was my dataset being noisy, or something wrongwith the format of the dataset, by trying to get the classifier to dosomething useful on the sample Haxby dataset from the Princeton MVPAtoolkit. They have 10 files of hdr/img pairs, and each file has 121volumes in it. What I did was save each volume as its own .nii file, andI created an attributes.txt file with 10x121 entries in it by mappingthe regressors in their dataset into the format pymvpa wants. Because Ihave 10 files each with 121 volumes in it, after converting everythingto nifti I had 1210 separate nifti files. When I run the samestart_easy.py on this dataset, I see the same kind of pattern/problem inthe confusion matrix, which shouldn't be happening, since the folks overat Princeton supposedly used this dataset successfully.

The difference between my dataset and the princeton dataset, versus thesample dataset in pymvpa, is that it looks like each volume in mydataset is a scan of an entire brain - I open them using MRIcroN andwhen I use my mouse wheel I can scroll through what looks like the wholebrain. Each volume in the example pymvpa bold.nii.gz is a single sliceof one section of a brain (and the posterior part of the slice at that),as opposed to a 3D scan of the brain I can use my mouse to scroll through.

Is this why the classifier is getting confused? Does it not know how tohandle these whole-brain nii volumes? Any suggestions as to what I mightbe doing wrong? I don't understand why I can't get the Princeton sampledata, which I think might even be from the same Haxby experiment as thePyMVPA sample data, to do something meaningful with the SMLR classifierin the start_easy.py example. When I say I don't understand, I mean I amsure I am doing something wrong, it is just not obvious to me since Ihave a computer science background and not a neuroscience background!


Thanks so much for this great package - hopefully I can make it work! :-)


_______________________________________________
Pkg-ExpPsy-PyMVPA mailing list
[email protected]
http://lists.alioth.debian.org/mailman/listinfo/pkg-exppsy-pymvpa

[pymvpa] unable to get meaningful SMLR, do I have the wrong data format?

Reply via email to