Bug#692747: Implement -raw for pdftohtml

2012-11-08 Thread Pino Toscano
severity 692747 wishlist thanks Hi, Alle giovedì 8 novembre 2012, Mathieu Malaterre ha scritto: > I am trying to convert the following two columns PDF document to > HTML: > > $ wget http://www.hpca.ual.es/~vruiz/papers/ORTIZ04b.pdf > > pdftohtml completely messes up the output since it is readi

Bug#692747: Implement -raw for pdftohtml

2012-11-08 Thread Mathieu Malaterre
Package: poppler-utils Version: 0.12.4-1.2 Severity: important I am trying to convert the following two columns PDF document to HTML: $ wget http://www.hpca.ual.es/~vruiz/papers/ORTIZ04b.pdf pdftohtml completely messes up the output since it is reading in 'layout' mode, using line1 from col1, t