Ewan Mellor created TIKA-2613:
---------------------------------

             Summary: Tesseract 4.0 has removed -psm, so Tika must update
                 Key: TIKA-2613
                 URL: https://issues.apache.org/jira/browse/TIKA-2613
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.17
            Reporter: Ewan Mellor


Tesseract 4.0 (currently in beta-1) has removed the {-psm} flag, in favor of 
{--psm} (with two dashes).

The {--psm} variant was introduced in Nov 2016, so it should be safe to simply 
switch Tika to use the two-dash variant, even for people still using Tesseract 
3.05.

For reference, the Tesseract cset is:

{code}
commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb
Author: Stefan Weil <s...@weilnetz.de>
Date: Sun Mar 25 17:28:33 2018 +0200

Remove deprecated support for -psm argument (#1419)

It was replaced by --psm and deprecated in commit 92d981b93.

Signed-off-by: Stefan Weil <s...@weilnetz.de>
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to