4 bits per nucleotide will waste too much space. On the other hand, NCBI way
(http://blast.wustl.edu/blast/ncbi20ntfmt.html) is too complex. Now I am
thinking of something like this: we encode in batches of 9 bytes. In each
batch, the 1st byte holds flags for the next 8. If its i-th bit is 1, t
> Date: Tue, 12 Feb 2008 09:56:48 +0100
> From: [EMAIL PROTECTED]
> Just one question: if your alphabet has 5 letters, how are you planning
> to use only 2 bits to encode it? Or maybe the N is never actually
> appearing in data?
Almost 2 bits.
This is how a 2 bit encoding is done in NCBI BLAST d
Ivan Vilata i Balaguer wrote:
> - Forwarded message from PyTables <[EMAIL PROTECTED]> -
>
> #152: creating a custom filter
> -+--
> Reporter: [EMAIL PROTECTED] | Owner: somebody
> Type: enhancemen
- Forwarded message from PyTables <[EMAIL PROTECTED]> -
#152: creating a custom filter
-+--
Reporter: [EMAIL PROTECTED] | Owner: somebody
Type: enhancement | Status: new
Prior