On Mon, 4 Oct 2004, Christian Hennig wrote:
Hi Michael,
Secondly, and perhaps more difficult, is a second data set. This, when plotted as a histogram, has two clear peaks, perhaps even three, all of which look as though they are normally distributed. So the theory is that my data set is actually made up of two, possibly three, underlying sub-sets of data which are normally distributed, but with different means and standard deviations. So 1) how do I test for this? And 2) how can I estimate the parameters (mean and SD) for the underlying distributions?
The answer to 2, as pointed out already, is to use EMclust in package mclust. Testing for the presence of a mixture is difficult from a theoretical point of view, and as far as I know, nothing is already implemented in R.
For testing for a mixture of two random variables there is the dip test of Hartigan---see diptest on CRAN
David Scott
_________________________________________________________________ David Scott Department of Statistics, Tamaki Campus The University of Auckland, PB 92019 Auckland NEW ZEALAND Phone: +64 9 373 7599 ext 86830 Fax: +64 9 373 7000 Email: [EMAIL PROTECTED]
Graduate Officer, Department of Statistics
______________________________________________ [EMAIL PROTECTED] mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html