Dear all, please could you suggest any R functions or packages (or external programs), that
a. take as input a large number (> 10 000) of short 20-30 nt sequences, and do sequence assembly, to reconstruct larger (extended) 30-50 sequences ? b. take as input a larger number of sequences (100 000 - 1 mil) and cluster these sequences in distinct classes based on the sequence similarity ? thanks a lot, bogdan [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.