Hi Wolfgang,

AFAIK, samtools can only call short variants (SNPs / indels), despite the
fact that you'll see evidence in the raw mpileup format. I think you'd need
to use something like BreakDancer, DELLY, or even SGA's graph-diff tool
(I'm experimenting with this for longer insertions, but don't have any
conclusions yet.

Others probably have better suggestions; this isn't an area I know very
well ...

~Joe





On Tue, Jun 3, 2014 at 3:48 AM, Wolfgang Maier <
[email protected]> wrote:

> I can provide a bit more details to my question. I've
> managed to create bam
> input that gives rise to the following mpileup output:
>
> [snip]
>
> chrX    8982279 g       10      ..........      EJIJJJJC;J
> chrX    8982280 a       10      ..........      DHJJJJJ@HJ
> chrX    8982281 a       10      ..........      DHHJJJJIBI
> chrX    8982282 a       10      ..........      DHHHJJJIEJ
> chrX    8982283 a       10      ..........      DHHGHJJHEJ
> chrX    8982284 a       10      ..........      DHHHHHJHIJ
> chrX    8982285 c       10      ..........      DFHHHHHFIJ
> chrX    8982286 g       10      ..........      DFFHHHGCGJ
> chrX    8982287 c       10      ..........      DFFFHHH:HJ
> chrX    8982288 a       10      ..........      DFFFFHHDBH
> chrX    8982289 g       10      ..........      DFFFFFHFDH
> chrX    8982290 a       10
>
> .-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTT
>
> TCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACC
>
> AGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTT
>
> TGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGG
>
> CCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAG
>
> GAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGAT
>
> GAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGA
>
> GGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATAT
>
> CGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTA
>
> AGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGG
>
> AACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATTTTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG.-
>
> 1241CAACGTAAGATTTAAAACATGACTCGTTGCGAACCTCGATCAATGTCGGCTGCGCGATTTTTTCAAAGCTTATTATTTGATACTCACTATCAGGAAAGCAGTACGCAGGAAAGCAACACATTATGAAAATATTTAGAAAATGAGATTTGTGATTGAACCACATAAAACATTCAATTTCATAAACATAAATTATTTATTAAACTATCAACGTACAACAGCAATAACATTTTTGAAGATTGGAACTATAATTGGCTTTACTTGGATTTTCTCTATCCTTTGTGCTATGCCCTTTGCGATCCATCACCGAGCCGATTACATTATGAAAAGCTGGCCAGGGACAGACAACAGAATACCGGTAAAATAGAGTTTATTGACAATAACTCAGCAAAGTCTTTTAAGGTTAAATCTTCAAAAATGTGCATGATAGCAGTGATGTTTGAACCAAAGCTAGCGTCAACTTTTAAGGTAGTTAATAAATAAAAATTATCTTATACAGGGTGCAAGTTTGTATATTTTTTTCAGATTCTATTTCACTTCTCTGCCATAGCATTCTTTGCACTCCCACTGTTTACAATTGTAATTCTCTATGCAAGAATTGCATGTAAGGTAAGAAAAGTTGAACGGTGAAAAGCAAATTTTAAATTCACATTAAAACATTGATTGTTTTAAAATGAACGTTTTAATATCGTAATGAATAACGGATACACGTAAAACAAAAATTAAACAAAAACTCACCTGTTCTTTCATTTAGGTATCCAGCAACAGAACAATTCAACCAGGCGAACTTGATATCACTGAGGAACTGCAAATGAGAATCAATGCAATTTTATGTGAGTTCGGGATGAGTTGTTGATAAAAAACAGATGTAGGAGGGGTAGAAAATTGGAAAGTGAGGAACCGAATATGGGAGGGATTGGAGCGGAAAACGAATGCGTGAGCAGAGACTAGAAATGGATGTACTTAGATGAAGTACAAACTATCGTATTGTTCCATACAATTTTCAGTGGATATTTTTGTATT
>
> TTTCGGAATTTCGAGAACTGCGGTAGTTTATTGTTTAGTAACAGAGTTTTTAGTAACAAAGAGTAGAAACAAGGTAAAGTGACAATTTATGCAAATTTTGGGTTTTCTGGATGAATACTGTTTTTCACTTTAACTGTTATATTTTGTCACTACAGATAAAATATTGTATGTTCCCAAAAAGTGAGAAAAATATGAGCACATTTTGTCATTGTCTAGTGGCG
> DCFFFFFHHH
> chrX    8982291 c       10      **********      DCCFFFFDDH
> chrX    8982292 a       10      **********      DCCFFFFDDH
> chrX    8982293 a       10      **********      DCCFFFFDDH
> chrX    8982294 c       10      **********      DCCFFFFDDH
> chrX    8982295 g       10      **********      DCCFFFFDDH
> chrX    8982296 t       10      **********      DCCFFFFDDH
> chrX    8982297 a       10      **********      DCCFFFFDDH
> chrX    8982298 a       10      **********      DCCFFFFDDH
> chrX    8982299 g       10      **********      DCCFFFFDDH
> chrX    8982300 a       10      **********      DCCFFFFDDH
> chrX    8982301 t       10      **********      DCCFFFFDDH
> chrX    8982302 t       10      **********      DCCFFFFDDH
> chrX    8982303 t       10      **********      DCCFFFFDDH
> chrX    8982304 a       10      **********      DCCFFFFDDH
> chrX    8982305 a       10      **********      DCCFFFFDDH
> chrX    8982306 a       10      **********      DCCFFFFDDH
> chrX    8982307 a       10      **********      DCCFFFFDDH
> chrX    8982308 c       10      **********      DCCFFFFDDH
> chrX    8982309 a       10      **********      DCCFFFFDDH
> chrX    8982310 t       10      **********      DCCFFFFDDH
> chrX    8982311 g       10      **********      DCCFFFFDDH
> chrX    8982312 a       10      **********      DCCFFFFDDH
> chrX    8982313 c       10      **********      DCCFFFFDDH
> chrX    8982314 t       10      **********      DCCFFFFDDH
> chrX    8982315 c       10      **********      DCCFFFFDDH
> chrX    8982316 g       10      **********      DCCFFFFDDH
> chrX    8982317 t       10      **********      DCCFFFFDDH
> chrX    8982318 t       10      **********      DCCFFFFDDH
> chrX    8982319 g       10      **********      DCCFFFFDDH
> chrX    8982320 c       10      **********      DCCFFFFDDH
> chrX    8982321 g       10      **********      DCCFFFFDDH
> chrX    8982322 a       10      **********      DCCFFFFDDH
> chrX    8982323 a       10      **********      DCCFFFFDDH
> chrX    8982324 c       10      **********      DCCFFFFDDH
> chrX    8982325 c       10      **********      DCCFFFFDDH
> chrX    8982326 t       10      **********      DCCFFFFDDH
> chrX    8982327 c       10      **********      DCCFFFFDDH
> chrX    8982328 g       10      **********      DCCFFFFDDH
> chrX    8982329 a       10      **********      DCCFFFFDDH
> chrX    8982330 t       10      **********      DCCFFFFDDH
> chrX    8982331 c       10      **********      DCCFFFFDDH
> chrX    8982332 a       10      **********      DCCFFFFDDH
>
> [/snip]
>
> indicating a 1241bp deletion on chrX (actually, this is
> C.elegans data).
>
> However, when I do genotype calling on the same bam input
> using
>
> samtools mpileup -r chrX:8900000-9000000 -Dgu -f
> WS220.64_chr.fa input.bam |
> bcftools view -gv - > output.vcf
>
> then that deletion is totally ignored (I tried already -
> with no effect -
> disabling BAQ calculation and -Q 0 just to make sure those
> aren't the problem).
>
> I am just looking for an explanation for this behavior and,
> ideally, a
> solution that allows me to keep using samtools for aligned
> input like this.
>
> The thought behind this is that I would like to stick with
> one variant
> caller for everything (SNVs, indels, deletions) instead of
> invoking
> different ones for every variant type.
>
> Any thoughts?
>
> Wolfgang
>
>
> Wolfgang Maier <wolfgang.maier <at>
> biologie.uni-freiburg.de> writes:
>
> >
> > I'm currently experimenting with split-reads alignment to
> detect
> > deletions in WGS data and I'm wondering whether samtools
> mpileup
> > combined with bcftools is able to call larger deletions
> (typically
> > several hundred bases) as opposed to just small indels
> when presented
> > with appropriately aligned reads.
> >  From a first test with samtools 0.1.19 and engineered
> bam data
> > simulating a 1.4 kb deletion the answer seems to be no,
> but I'd be
> > grateful for any help or explanation why it can't work.
> >
> > Thanks a lot,
> > Wolfgang
> >
>
>
>
> ------------------------------------------------------------------------------
> Learn Graph Databases - Download FREE O'Reilly Book
> "Graph Databases" is the definitive new guide to graph databases and their
> applications. Written by three acclaimed leaders in the field,
> this first edition is now available. Download your free book today!
> http://p.sf.net/sfu/NeoTech
> _______________________________________________
> Samtools-help mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/samtools-help
>



-- 
Joseph Fass
Lead Data Analyst
UC Davis Genome Center - Bioinformatics Core
http://bioinformatics.ucdavis.edu/
[email protected]
phone ~ 530.752.2698
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their 
applications. Written by three acclaimed leaders in the field, 
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
_______________________________________________
Samtools-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to