[ https://issues.apache.org/jira/browse/TIKA-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16419752#comment-16419752 ]
Ewan Mellor commented on TIKA-2613: ----------------------------------- Build failures were not this change; they were from TIKA-2621 which went through at the same time. > Tesseract 4.0 has removed -psm, so Tika must update > --------------------------------------------------- > > Key: TIKA-2613 > URL: https://issues.apache.org/jira/browse/TIKA-2613 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 1.17 > Reporter: Ewan Mellor > Priority: Major > Fix For: 1.18, 2.0.0 > > > Tesseract 4.0 (currently in beta-1) has removed the {{\-psm}} flag, in favor > of {{\-\-psm}} (with two dashes). > The {{\-\-psm}} variant was introduced in Nov 2016, so it should be safe to > simply switch Tika to use the two-dash variant, even for people still using > Tesseract 3.05. > For reference, the Tesseract cset is: > {code} > commit ee201e1f4fa277a4b2ecd751a45d3bf1eba6dfdb > Author: Stefan Weil <s...@weilnetz.de> > Date: Sun Mar 25 17:28:33 2018 +0200 > Remove deprecated support for -psm argument (#1419) > It was replaced by --psm and deprecated in commit 92d981b93. > Signed-off-by: Stefan Weil <s...@weilnetz.de> > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)