Re: [R] How to calculate the area under the curve

Frank E Harrell Jr Thu, 22 Oct 2009 08:09:02 -0700

olivier.abz wrote:

Hi all,
I would like to calculate the area under the ROC curve for my predictive
model. I have managed to plot points giving me the ROC curve. However, I do
not know how to get the value of the area under.Does anybody know of a function that would give the result I want using an
array of specificity and an array of sensitivity as input?
Thanks,
Olivier


Olivier,

The ROC curves in my view just get in the way. They are mainly usefulin that, almost by accident, the area under the curve equals a nice purediscrimination index. Go for the direct calculation of the ROC areabased on the Wilcoxon-Mann-Whitney-Somers' Dxy rank correlationapproach, e.g., using the Hmisc package rcorr.cens package whichprovides Dxy = 2(C-.5) where C = ROC area. It also provides the S.E. ofDxy and thus of C, and generalizes to censored data. This approach usesthe raw data, not sensitivity and specificity (which are improperscoring rules). This is assuming you are using an external validationdataset. If this is not the case you will need to use the bootstrap orintensive cross-validation, e.g., using the rms package's lrm andvalidate functions.

Also note that it is not usually appropriate to compare two ROC areasfor choosing a model as this is too insensitive. It is the same astaking the difference between two scaled Wilcoxon statistics which issimply not done.


Frank

--
Frank E Harrell Jr   Professor and Chair           School of Medicine
                     Department of Biostatistics   Vanderbilt University

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] How to calculate the area under the curve

Reply via email to