[ https://issues.apache.org/jira/browse/TIKA-3848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17604948#comment-17604948 ]
Hudson commented on TIKA-3848: ------------------------------ UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #794 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/794/]) TIKA-3848 -- avoid throwing a runtime exception for a header problem. (tallison: [https://github.com/apache/tika/commit/a30c7e337f593cb35a3363156f469dfc1588ce8d]) * (edit) tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/dbf/DBFColumnHeader.java > IllegalArgumentException in DBFColumnHeader.setType() > ----------------------------------------------------- > > Key: TIKA-3848 > URL: https://issues.apache.org/jira/browse/TIKA-3848 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 2.4.1 > Reporter: Tilman Hausherr > Priority: Major > Fix For: 2.5.0 > > > {noformat} > "commoncrawl/CC-MAIN-2021-31/2c/80/2c80ca18e8a34133b8defc9813f31bf0058341ab119959ed5f3a2d75affd919f",1,False,"1450","java.lang.IllegalArgumentException: > Unrecognized column type for column: 0� *. I regret I don't recognize: H > at > org.apache.tika.parser.dbf.DBFColumnHeader.setType(DBFColumnHeader.java:55) > at > org.apache.tika.parser.dbf.DBFFileHeader.readCol(DBFFileHeader.java:111) > at org.apache.tika.parser.dbf.DBFFileHeader.parse(DBFFileHeader.java:77) > at org.apache.tika.parser.dbf.DBFReader.<init>(DBFReader.java:59) > at org.apache.tika.parser.dbf.DBFReader.open(DBFReader.java:65) > at org.apache.tika.parser.dbf.DBFParser.parse(DBFParser.java:70) > {noformat} > Possible solution: in {{DBFFileHeader.readCol()}} catch the > {{IllegalArgumentException}} thrown by {{col.setType(colType);}} and throw a > better exception. -- This message was sent by Atlassian Jira (v8.20.10#820010)