Re: [R] Three-component Negative Binomial Mixture: R code

Achim Zeileis Thu, 10 Nov 2016 12:24:23 -0800

On Thu, 10 Nov 2016, danilo.car...@uniparthenope.it wrote:

Thank you for your hints, now the goodness of fit test provides me goodresults, but surprisingly for me the three-component model turns out to beworse than the two-component one (indeed, I focused on the three-componentmixture because the two-component one exhibits a low p-value).
In addition, I have noticed that for some data the function fails to findgood starting values, as you have mentioned in your previuous answer. Theproblem is that the driver FLXMRnegbin() allows to specify only the thetaparameter (and only one value, even in the event of mixtures of two or morecomponents).
I have read the description of flexmix() function too, but it seems that itdoes not allow to set starting values for the parameters of the model. Am Iright? Or is there a way to do it?

No, I don't think so. You could look at the FLXMRnegbin() driver code andtweak this directly to use better starting values etc. But I think thatthe main issue is that the "theta" parameter often diverges to infinity(i.e., a Poisson distribution) if theta is too large in (at least) one ofthe components.

Given that the negbin distribution is a continuous mixture of Poissondistributions, I'm not sure whether approaching such data with a finitemixture of such continuous mixtures.

What to do with this situation, certainly depends on the data you have andthe questions you have about it. And I concur with Bert's advice ofcontacting a local statistics expert for discussion of such issues.


hth,
Z

Achim Zeileis <achim.zeil...@uibk.ac.at> ha scritto:
On Tue, 8 Nov 2016, danilo.car...@uniparthenope.it wrote:
I tried the function flexmix() with the driver FLXMRnegbin() with twocomponents first, in order to compare its results with those provided bymy function mixnbinom(). In particular, I ran the following code:
fm0 <- flexmix(y ~ 1, data = data.frame(y), k = 2, model = FLXMRnegbin())
where "y" is my vector of counts. The previous function provided me thefollowing parameters:
                Comp.1   Comp.2
coef.(Intercept) 1.2746536 1.788578
theta            0.1418201 5.028766
with priors 0.342874 and 0.657126, respectively. I assume that thecoefficients "Intercept" represent the two means of the model (mu1 andmu2),
No, a log link is employed, i.e., exp(1.2746536) and exp(1.788578) are themeans.
while the "theta" coefficients are the size parameters (size1 and size2).
Yes.
Unfortunately, unlike my function mixnbinom(), the model computed withflexmix() did not provide a good fit to my data (p-value ~0).
Is there something wrong in the process above?
Hard to say without a reproducible example. Using parameter values similarto the ones you cite above, the following seems to do a reasonable job:
## packages
library("countreg")
library("flexmix")

## artificial data from two NB distributions:
## 1/3 is NB(mu = 3.5, theta = 0.2) and
## 2/3 is NB(mu = 6.0, theta = 5.0)
set.seed(1)
y <- c(rnbinom(200, mu = 3.5, size = 0.2), rnbinom(400, mu = 6, size = 5))

## fit 2-component mixture model
set.seed(1)
fm <- flexmix(y ~ 1, k = 2, model = FLXMRnegbin())

## inspect estimated parameters -> look acceptable
parameters(fm)
exp(parameters(fm)[1,])
My experience was that finding good starting values may be a problem forflexmix(). So maybe setting these in some better way would be beneficial.
-------------------------------------------------------------
Danilo Carità

PhD Candidate
University of Naples "Parthenope"
Dipartimento di Studi Aziendali e Quantitativi
via G. Parisi, 13, 80132 Napoli - Italy
-------------------------------------------------------------

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Three-component Negative Binomial Mixture: R code

Reply via email to