Re: [R] How do I extract Random Forest Terms and Probabilities?

Lopez, Dan Thu, 05 Dec 2013 08:35:26 -0800

Hi Andy,

I have used predict before and in fact when I do that to the train set I get a 
perfect model (i.e. ROC right angle curve in upper left quadrant) which just 
looks like it overfit the data. 
This is not the case with the test set were I get auc of .77.


I wanted to attempt a couple of calibration techniques I learned from Max 
Kuhn's Applied Predictive Modeling book. He uses a train set to do this. But 
with what I have now with the train set there is nothing to calibrate.

That's why I thought I would use the original probabilities from the 
randomForest model that was used to create fm$predicted (fm is my randomForest 
model).

I am still fairly new at predictive modeling and it could be the case that 
maybe I am not understanding something basic here.

Thanks.
Dan

-----Original Message-----
From: Liaw, Andy [mailto:andy_l...@merck.com] 
Sent: Monday, December 02, 2013 8:40 AM
To: arun; R help; Lopez, Dan
Subject: RE: [R] How do I extract Random Forest Terms and Probabilities?

#2 can be done simply with predict(fmi, type="prob").  See the help page for 
predict.randomForest().

Best,
Andy


-----Original Message-----
From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] On 
Behalf Of arun
Sent: Tuesday, November 26, 2013 6:57 PM
To: R help
Subject: Re: [R] How do I extract Random Forest Terms and Probabilities?



Hi,
For the first part, you could do:

fmi2 <- fmi
attributes(fmi2$terms) <- NULL
capture.output(fmi2$terms)
#[1] "Species ~ Sepal.Length + Sepal.Width + Petal.Length + Petal.Width"

A.k.

On Tuesday, November 26, 2013 3:55 PM, "Lopez, Dan" <lopez...@llnl.gov> wrote:
Hi R Experts,

I need your help with two question regarding randomForest.


1.       When I run a Random Forest model how do I extract the formula I used 
so that I can store it in a character vector in a dataframe?
For example the dataframe might look like this if I am running models using the 
IRIS dataset #ModelID,Type,

#001,RF,Species ~ Sepal.Length + Sepal.Width + Petal.Length + Petal.Width

fmi<-randomForest(Species~.,iris,mtry=3,ntry=500)
#I know one place where the information is in fmi$terms but not sure how to 
extract just the formula info. Or perhaps there is somewhere else in fmi that I 
could get this?


2.       How do I get the probabilities (probability-like values) from the 
model that was run? I know for the test set I can use predict. And I know to 
extract the classifications from the model I use fmi$predicted. But where are 
the probabilities?


Dan
Workforce Analyst
HRIM - Workforce Analytics & Metrics
LLNL


    [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Notice:  This e-mail message, together with any attachme...{{dropped:10}}

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] How do I extract Random Forest Terms and Probabilities?

Reply via email to