Hello,

I am looking for a function that computes a consensus DNA sequence by plurality. I found "consensus" in bio3d which almost does what I need. However, it would be important for me to include ambiguities and not to omit every position that is less than the threshold set in "consensus".

Is anybody aware of a package with a function that includes ambiguities in a consensus sequence?

Thanks already!

Kim

Here are a couple of sequences to illustrate what I would like:

TGCATACACCGACAACATCCTCGACGACTACACCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
AGCATACACCGACAACATCCTCGATGACTACTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
AGCATACACCGACAACATCCTCGATGACTACTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG
CGCCTACACCAACGATGTCCTGGACGACTTCTGCTACTACG

 GC TACACC AC A  TCCT GA GACT CTGCTACTACG
#This is what I get from "consensus" in bio3d with a threshold of 0.85.

HGCMTACACCRACRAYRTCCTSGAYGACTWCTGCTACTACG
# This is what I would like to get.


___________________________________________
Kim Milferstedt, PhD
Postdoctoral Researcher
University of Illinois at Urbana-Champaign
Department of Microbiology
C207 CLSL
601 S. Goodwin Avenue
Urbana, IL 61801

phone: 001-217-244-0721
email: [EMAIL PROTECTED]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to