[pymvpa] significance

Jonas Kaplan Thu, 07 May 2009 14:57:27 -0700

Hello,

I wonder if anyone could help me think through the issue of testingclassifier results for significance and how it relates to cross-validation.

We are running a design with 8 chunks, 27 trials in each chunk dividedinto 3 classes. Let's say we do an eight way (leave one out) cross-validation. This results in an accuracy value for each set of 27tests... 8 x 27 for a total of 216 trials that were predictedcorrectly or incorrectly.

Is it wrong to use a binomial test for significance on the totalnumber of correct predictions out of 216? Or would that beinappropriate given that the 8 cross-validation steps are not reallyindependent from each other and we must test each cross-validationstep separately as a binomial with n=27? This latter option raises theissue of how to combine across the 8 tests.

Alternatively, if we use the Monte Carlo simulation to produce a nulldistribution we have the same issue -- we are generating this nulldistribution for each cross-validation step -- and therefore nottaking into account the overall success of the cross-validationroutine across all 216 trials. Would it make sense to generate anull distribution by scrambling the regressors and generating theresults of an entire cross-validation procedure for scrambledregressors? If so, does pymvpa have a routine for doing this?


Thanks for any input or corrections of my thought,


Jonas

P.S. we are using pymvpa for several active projects with muchpleasure and will happily send you posters/papers when our work ismore complete.



----
Jonas Kaplan, Ph.D.
Research Assistant Professor
Brain & Creativity Institute
University of Southern California







_______________________________________________
Pkg-ExpPsy-PyMVPA mailing list
[email protected]
http://lists.alioth.debian.org/mailman/listinfo/pkg-exppsy-pymvpa

[pymvpa] significance

Reply via email to