[ https://issues.apache.org/jira/browse/TIKA-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison resolved TIKA-4043. ------------------------------- Fix Version/s: 2.8.1 Resolution: Fixed > Fix build for variations in tesseract and timezone info in RTFs > --------------------------------------------------------------- > > Key: TIKA-4043 > URL: https://issues.apache.org/jira/browse/TIKA-4043 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > Fix For: 2.8.1 > > > From [~grossws]: > > * OCR (tesseract) multipage test is still the same, it extracts "Page?2" > > instead of "Page 2" on my laptop; > > * RTFParserTest testMetaDataCounts fails because of different time zone > > since RTF format itself has only local date/time in meta and I fall into > > different size of midnight with my local time (known issue, requires some > > changes in metadata to handle correctly). When building with TZ=UTC works > > fine. > We should fix these. -- This message was sent by Atlassian Jira (v8.20.10#820010)