[Rd] GAMs and survival data

Kris Jones Tue, 06 Apr 2010 16:12:49 -0700

Hello. I'm trying to analyze data, which is looking at the relationship between 
temperature and survival for fish (from fertilization to emergence). Looking at 
the raw data, there appears to be a bell shaped relationship. Ordinarily for 
survival data, I would run a generalized linear model (because the data has a 
binomial error structure). However, I am thinking that running a generalized 
additive model (which I've never used before), as its my understanding that 
they are better able to deal with non-linear relationships. Hopefully this is a 
correct assumption.


Question 1: Unfortunately, the data I have to work with is not formatted to be 
as #successes or # failures, as R seems to want for other generalized models 
(the survival data I have is percent survival). I'm using 'The R Book' by Mick 
Crawley,and have searched online, but haven't had much luck finding the right 
code that will run with percentage data for GAM (and whether or not its 
appropriate to use percentage data). Is it inappropriate to run the analyses 
like this? With Generalized Linear Models R seems to want #successes and 
failures, but the GAM doesn't (it worked--output below). I'm just wondering 
whether it is alright to run the model as I have done (with percentage data)? 

Question 2: For this type of model (GAM), is there a simple way of constructing 
an equation for the model (e.g., to come up with predicted values). This is 
probably not the best, but I've plotted the predicted values in Excel, fitted a 
polynomial trend line and got the equation from there. I'm just wondering if 
there's a more appropriate way to get it in R? 

Not sure if it would be useful, but I've provided the code and output for the 
model below. Any help you can offer would be much appreciated. Thanks in 
advance for your help--I really appreciate it! 

Kris 




> names(data) 
[1] "Temp" "Survival" 
> str(data) 
'data.frame': 17 obs. of 2 variables: 
$ Temp : num 35.6 38.8 39 39 41 ... 
$ Survival: num 0.14 0.972 0.697 0.938 0.83 0.987 0.989 0.9 0.996 0.87 ... 
> 
> Surv<-gam(Survival~s(Temp), quasibinomial, data=na.omit(data)) 
> summary(Surv) 

Family: quasibinomial 
Link function: logit 

Formula: 
Survival ~ s(Temp) 

Parametric coefficients: 
Estimate Std. Error t value Pr(>|t|) 
(Intercept) 1.9938 0.3067 6.501 8.02e-05 *** 
--- 
Signif. codes: 0 â***â 0.001 â**â 0.01 â*â 0.05 â.â 0.1 â â 
1 

Approximate significance of smooth terms: 
edf Ref.df F p-value 
s(Temp) 6.325 6.325 4.065 0.0257 * 
--- 
Signif. codes: 0 â***â 0.001 â**â 0.01 â*â 0.05 â.â 0.1 â â 
1 

R-sq.(adj) = 0.775 Deviance explained = 81.2% 
GCV score = 0.18885 Scale est. = 0.10748 n = 17 





This email has been processed by SmoothZap - www.smoothwall.net


        [[alternative HTML version deleted]]

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

[Rd] GAMs and survival data

Reply via email to