Hello, I am trying to find a method to cluster species based on their quantitative traits and at the same time obtain threshold value for each node in the decision tree. My difficulty is that my dependent variable is the list of species names, each species appearing as a single line with no repetition. All explanatory variables are quantitative. As far as I understood, classification trees need a dependent variable with repeated levels as in the iris dataset, in which each species appears several times. All the examples employing classification trees I found use a dependent variable, but I do not have one except for the species names. MRT uses a species by location matrix as dependent variable, and traditional hierarchical cluster analysis do cluster species but do not use quantitative data to that aim, nor produce threshold values. I can run a non-hierarquical cluster analysis like kmeans, but these do not generate threshold values. My concern is that without threshold values any classification I produce will be restricted to the studied species and will not be applicable to different species that can be found in the studied region, what would be a strong limitation to the use of such classification.
Thank you very much in advance for any ideas. Regards, Alexandre -- Dr. Alexandre F. Souza Professor Associado Chefe do Departamento de Ecologia Universidade Federal do Rio Grande do Norte CB, Departamento de Ecologia Campus Universitário - Lagoa Nova 59072-970 - Natal, RN - Brasil lattes: lattes.cnpq.br/7844758818522706 http://www.esferacientifica.com.br https://www.youtube.com/user/alexfadigas http://www.docente.ufrn.br/alexsouza orcid.org/0000-0001-7468-3631 <http://www.docente.ufrn.br/alexsouza> [[alternative HTML version deleted]] _______________________________________________ R-sig-ecology mailing list R-sig-ecology@r-project.org https://stat.ethz.ch/mailman/listinfo/r-sig-ecology