Hi all,

Since this is my first post in this list: Hello to everyone reading!

In order to use ecological count data in ordination methods like Principal Component Analysis (PCA) or Redundancy Analysis (RDA), they have to be transformed. Ecologists [1,2] proposed a number of useful transformations for count data, such as chi-square or Hellinger transformation, which are also available in the decostand function of the vegan package.

With the advent of high-throughput sequencing, ordination methods have been applied to metagenomic sequence counts (namely of 16S rDNA amplicons). I have been using Hellinger transformation with such data in combination with RDA for quite a while with satisfactory results.

In the past years, new normalization techniques specifically suited for metagenomic count data, namely cumulative-sum scaling (CSS) [3] were developed. The authors claim that their CSS method is superior to traditional total-sum scaling (TSS). TSS is also the first step in Hellinger transformation (followed by taking the square root).

I am thus wondering if I should switch to CSS transformation of amplicon counts for use in RDA - probably in combination with square-root transformation to lessen the impact of highly abundant taxa.

Any experience with this matter or suggestions from your side?

Thanks in advance for sharing your thoughts with me!






References:

1. Legendre P, Gallagher ED. 2001. Ecologically meaningful transformations for ordination of species data. Oecologia 129:271-280.

2. Borcard D, Gillet F, Legendre P. 2011. Numerical ecology with R. Springer, New York.

3. Paulson JN, Stine OC, Bravo HC, Pop M. 2013. Differential abundance analysis for microbial marker-gene surveys. Nat Methods 10:1200-1202.

_______________________________________________
R-sig-ecology mailing list
R-sig-ecology@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-ecology

Reply via email to