dear list!
i am running an anlysis on proportion data using binomial (quasibinomial
family) error structure. My data comprises of two continuous vars, body
size and range size, as well as of feeding guild, nest placement, nest
type and foragig strata as factors. I hope to model with these variables
the preference of primary forests (#successes) by certain bird species.
My code therefore looks like:
y<-cbind(n_forest,n_trials-n_forest)
model<-glm(y~range+body+nstrata+ntype+forage+feed,family=quasibinomial(link=logit),data=dat)
however plausible the approach may look, overdispersion is prevalent
(dispersion estimated at 6.5). I read up on this and learned that in
case of multiple factors, not all levels may yield good results with
logistic regression (Crawley "The R Book"). I subsequently try to
analyse each feeding guild seperately, but to no avail.overdispersion
remains. Given the number of categorical variables in my study, is there
a convenient way to handle the overdispersion? I was trying tree models
to see the most influential variables but again, to no avail.
BTW: It may well be that the data is just bad...
thanks a lot!
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.