Your message dated Fri, 19 Mar 2010 13:10:14 +0100
with message-id <[email protected]>
and subject line Re: Bug#572522: ocrodjvu: new problem with cuneiform engine
has caused the Debian Bug report #572522,
regarding ocrodjvu: crashes with ValueError on malformed hOCR
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
572522: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=572522
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: ocrodjvu
Version: 0.4.2-1
Severity: normal
On Mon, 01 Mar 2010 [email protected] wrote:
> The input file is temporarily available at
> http://fleksem.klf.uw.edu.pl/~jsbien/tmp/in.djvu.
Now I get:
----------------------------------------------------------------------------------------------
ocrodjvu --render all --engine cuneiform --language pol --clear-text -o
out.djvu in.djvu
Processing 'in.djvu':
- Page #1
- Page #2
Exception in thread Thread-2:
Traceback (most recent call last):
File "/usr/lib/python2.5/threading.py", line 486, in __bootstrap_inner
self.run()
File "/usr/lib/python2.5/threading.py", line 446, in run
self.__target(*self.__args, **self.__kwargs)
File "/usr/share/ocrodjvu/lib/_ocrodjvu.py", line 443, in page_thread
result = self.process_page(page)
File "/usr/share/ocrodjvu/lib/_ocrodjvu.py", line 423, in process_page
page_size=size
File "/usr/share/ocrodjvu/lib/hocr.py", line 457, in extract_text
scan_result = scan(doc.find('/body'), settings)
File "/usr/share/ocrodjvu/lib/hocr.py", line 419, in scan
_scan(node, buffer, BBox(), settings)
File "/usr/share/ocrodjvu/lib/hocr.py", line 394, in _scan
look_down(result, bbox)
File "/usr/share/ocrodjvu/lib/hocr.py", line 342, in look_down
_scan(child, buffer, parent_bbox, settings)
File "/usr/share/ocrodjvu/lib/hocr.py", line 407, in _scan
result[:] = _replace_cuneiform08_paragraph(result[:], settings)
File "/usr/share/ocrodjvu/lib/hocr.py", line 234, in
_replace_cuneiform08_paragraph
raise ValueError
ValueError
----------------------------------------------------------------------------------------------
JSB
-- System Information:
Debian Release: squeeze/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: i386 (i686)
Kernel: Linux 2.6.32-trunk-486
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages ocrodjvu depends on:
ii djvulibre-bin 3.5.22-8 Utilities for the DjVu image forma
ii python 2.5.4-9 An interactive high-level object-o
ii python-argparse 1.0.1-1 optparse-inspired command-line par
ii python-djvu 0.1.17-1 Python support for the DjVu image
ii python-lxml 2.2.4-1+b1 pythonic binding for the libxml2 a
ii python-support 1.0.6 automated rebuilding support for P
Versions of packages ocrodjvu recommends:
ii ocropus 0.3.1-2 document analysis and OCR system
ii python-pyicu 0.9-2 Python extension wrapping the ICU
ii tesseract-ocr 2.04-2 Command line OCR tool
Versions of packages ocrodjvu suggests:
ii cuneiform 0.7.0+dfsg-5 multi-language OCR system
-- no debconf information
--- End Message ---
--- Begin Message ---
Version: 0.4.3-1
* Jakub Wilk <[email protected]>, 2010-03-04, 19:57:
I cannot do much about this, except making the error message more
helpful.
Error message should be a bit more meaningful in ocrodjvu 0.4.3.
--
Jakub Wilk
signature.asc
Description: Digital signature
--- End Message ---