Package: wnpp Severity: wishlist X-Debbugs-Cc: cru...@debian.org Subject: ITP: seqan-raptor -- pre-filter for querying very large collections of nucleotide sequences Package: wnpp Owner: Michael R. Crusoe <cru...@debian.org> Severity: wishlist
* Package name : seqan-raptor Version : 2.0.0.0.git.fecfbca+ds Upstream Author : Enrico Seiler * URL : https://github.com/seqan/raptor * License : BSD-3-clause Programming Lang: C Description : pre-filter for querying very large collections of nucleotide sequences Raptor is a system for approximately searching many queries such as next-generation sequencing reads or transcripts in large collections of nucleotide sequences. Raptor uses winnowing minimizers to define a set of representative k-mers, an extension of the interleaved Bloom filters (IBFs) as a set membership data structure and probabilistic thresholding for minimizers. Our approach allows compression and partitioning of the IBF to enable the effective use of secondary memory. We test and show the performance and limitations of the new features using simulated and real datasets. Our data structure can be used to accelerate various core bioinformatics applications. Remark: This package is maintained by Debian Med Packaging Team at https://salsa.debian.org/med-team/seqan-raptor