hi bertrand,

here is the reason i was initially confused:

when i used mean_square_error as score_func, it gave me p=.98, when i was
pretty positive i had a significant result. but that's because the lower the
value is in the distribution the better it is. this obviously reversed when
i used explained_variance, where things closer to 1 are better.

do you think stating that score_func should return a float between 0 and 1
would be better or to state that if you have a score_func that ranges from 0
to inf and whose lower bound is a better score, then interpret significance
as 1-p_value?

cheers,

satra


On Mon, Oct 3, 2011 at 6:04 PM, Satrajit Ghosh <[email protected]> wrote:

> thanks bertrand. yes, it does answer my question.
>
> one of the reasons i asked is that the randomise function in fsl does the
> opposite and reports 1-p (it is noted in their docs - but one has to read it
> :) ).
>
> cheers,
>
> satra
>
>
>
> On Mon, Oct 3, 2011 at 5:55 PM, bthirion <[email protected]>wrote:
>
>> **
>> It is the probability of observing what you have by chance.
>> [and generally the integral of the null density at the right of the
>> observed value]
>> p<0.02 is significant
>> p<0.98 is not.
>> Do I answer your question ?
>>
>> B
>>
>>
>> On 10/03/2011 11:38 PM, Satrajit Ghosh wrote:
>>
>> hi,
>>
>>  what does the returned p-value represent? (more specifically what is the
>> difference between p=0.02 and p=0.98)
>>
>> cheers,
>>
>> satra
>>
>>
>> ------------------------------------------------------------------------------
>> All the data continuously generated in your IT infrastructure contains a
>> definitive record of customers, application performance, security
>> threats, fraudulent activity and more. Splunk takes this data and makes
>> sense of it. Business sense. IT sense. Common 
>> sense.http://p.sf.net/sfu/splunk-d2dcopy1
>>
>>
>> _______________________________________________
>> Scikit-learn-general mailing 
>> [email protected]https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>>
>>
>>
>>
>> ------------------------------------------------------------------------------
>> All the data continuously generated in your IT infrastructure contains a
>> definitive record of customers, application performance, security
>> threats, fraudulent activity and more. Splunk takes this data and makes
>> sense of it. Business sense. IT sense. Common sense.
>> http://p.sf.net/sfu/splunk-d2dcopy1
>> _______________________________________________
>> Scikit-learn-general mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>>
>>
>
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to