Hi,

When I run wgsim, I see the following output. But I can't find a
document for this format.

[wgsim_core] calculating the total length of the reference sequence...
[wgsim_core] 1 sequences, total length: 10000
chr1    221     C       M       +
chr1    882     A       R       +
chr1    1670    A       G       -
chr1    2526    C       G       -
chr1    3783    A       C       -
chr1    4231    C       M       +
chr1    5227    T       C       -
chr1    6371    T       W       +
chr1    6403    C       G       -
chr1    6621    T       -       +
chr1    8055    T       K       +
chr1    9956    C       A       -

I understand that the 3rd column is IUPAC nucleotide code. Does "-"
mean deletion?

http://www.bioinformatics.org/sms/iupac.html

What does the 4th column mean? Thanks.

-- 
Regards,
Peng

------------------------------------------------------------------------------
_______________________________________________
Samtools-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to