Hi,

I have done an analysis using 'rpart' to construct a Classification Tree. I
am wanting to retain the output in tree form so that it is easily
interpretable. However, I am wanting to compare the 'accuracy' of the tree
to a Random Forest to estimate how much predictive ability is lost by using
one simple tree. My understanding is that the error automatically displayed
by the two functions is calculated differently so it is therefore incorrect
to use this as a comparison. Instead I have produced a table for both
analyses comparing the observed and predicted response. 

E.g. table(data$dependent,predict(model,type="class"))

I am looking for confirmation that (a) it is incorrect to compare the error
estimates for the two techniques and (b) that comparing the
misclassification rates is an appropriate method for comparing the two
techniques.

Thanks

Amy

 

 

Amelia Koch

University of Tasmania

School of Geography and Environmental Studies

Private Bag 78 Hobart

Tasmania, Australia 7001

Ph: +61 3 6226 7454

[EMAIL PROTECTED]

 


        [[alternative HTML version deleted]]

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to