Bug#672489: should recommend python-html5lib, depend on lxml

May 11th, 2012 - 10:10 am ET by Thomas Koch | Report spam
Package: ocrodjvu
Version: 0.7.9-1
Severity: normal

Hash: SHA256

Hi,

the library package python-html5lib is needed for the --html5 option. I'd also
recommend to move the lxml dependency from recommends to depends. Is there any
use case for ocrodjvu that does not rely on lxml?

Regards,

Thomas Koch

Debian Release: wheezy/sid
APT prefers unstable
APT policy: (500, 'unstable'), (500, 'testing')
Architecture: amd64 (x86_64)

Kernel: Linux 3.2.0-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages ocrodjvu depends on:
ii djvulibre-bin 3.5.25.2-4
ii python 2.7.2-10
ii python-argparse 1.2.1-2
ii python-djvu 0.3.9-1
ii python2.7 [python-argparse] 2.7.3~rc2-2.1

Versions of packages ocrodjvu recommends:
ii ocropus <none>
ii python-lxml 2.3.2-1
ii python-pyicu 1.3-1
ii tesseract-ocr 3.02.01-4

Versions of packages ocrodjvu suggests:
pn cuneiform <none>
pn gocr <none>
pn ocrad <none>





To UNSUBSCRIBE, email to debian-bugs-dist-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
email Follow the discussionReplies 1 replyReplies Make a reply

Replies

#1 Jakub Wilk
May 11th, 2012 - 10:40 am ET | Report spam
* Thomas Koch , 2012-05-11, 16:02:
Is there any use case for ocrodjvu that does not rely on lxml?



ocrodjvu imports lxml only if it has to parse hOCR.
Some supported OCR engines (Ocrad, GOCR, Tesseract 2.X) don't use hOCR
as output format.

Jakub Wilk



To UNSUBSCRIBE, email to
with a subject of "unsubscribe". Trouble? Contact

Similar topics