Hello,

I am trying to find a method to cluster species based on their quantitative
traits and at the same time obtain threshold value for each node in the
decision tree. My difficulty is that my dependent variable is the list of
species names, each species appearing as a single line with no repetition.
All explanatory variables are quantitative. As far as I understood,
classification trees need a dependent variable with repeated levels as in
the iris dataset, in which each species appears several times. All the
examples employing classification trees I found use a dependent variable,
but I do not have one except for the species names. MRT uses a species by
location matrix as dependent variable, and traditional hierarchical cluster
analysis do cluster species but do not use quantitative data to that aim,
nor produce threshold values. I can run a non-hierarquical cluster analysis
like kmeans, but these do not generate threshold values. My concern is that
without threshold values any classification I produce will be restricted to
the studied species and will not be applicable to different species that
can be found in the studied region, what would be a strong limitation to
the use of such classification.

Thank you very much in advance for any ideas.

Regards,

Alexandre


-- 
Dr. Alexandre F. Souza
Professor Associado
Chefe do Departamento de Ecologia
Universidade Federal do Rio Grande do Norte
CB, Departamento de Ecologia
Campus Universitário - Lagoa Nova
59072-970 - Natal, RN - Brasil
lattes: lattes.cnpq.br/7844758818522706
http://www.esferacientifica.com.br
https://www.youtube.com/user/alexfadigas
http://www.docente.ufrn.br/alexsouza
orcid.org/0000-0001-7468-3631 <http://www.docente.ufrn.br/alexsouza>

        [[alternative HTML version deleted]]

_______________________________________________
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology

Reply via email to