[ 
https://issues.apache.org/jira/browse/TIKA-3848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17604948#comment-17604948
 ] 

Hudson commented on TIKA-3848:
------------------------------

UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #794 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/794/])
TIKA-3848 -- avoid throwing a runtime exception for a header problem. 
(tallison: 
[https://github.com/apache/tika/commit/a30c7e337f593cb35a3363156f469dfc1588ce8d])
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-miscoffice-module/src/main/java/org/apache/tika/parser/dbf/DBFColumnHeader.java


> IllegalArgumentException in DBFColumnHeader.setType()
> -----------------------------------------------------
>
>                 Key: TIKA-3848
>                 URL: https://issues.apache.org/jira/browse/TIKA-3848
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 2.4.1
>            Reporter: Tilman Hausherr
>            Priority: Major
>             Fix For: 2.5.0
>
>
> {noformat}
> "commoncrawl/CC-MAIN-2021-31/2c/80/2c80ca18e8a34133b8defc9813f31bf0058341ab119959ed5f3a2d75affd919f",1,False,"1450","java.lang.IllegalArgumentException:
>  Unrecognized column type for column: 0�     *. I regret I don't recognize: H
>       at 
> org.apache.tika.parser.dbf.DBFColumnHeader.setType(DBFColumnHeader.java:55)
>       at 
> org.apache.tika.parser.dbf.DBFFileHeader.readCol(DBFFileHeader.java:111)
>       at org.apache.tika.parser.dbf.DBFFileHeader.parse(DBFFileHeader.java:77)
>       at org.apache.tika.parser.dbf.DBFReader.<init>(DBFReader.java:59)
>       at org.apache.tika.parser.dbf.DBFReader.open(DBFReader.java:65)
>       at org.apache.tika.parser.dbf.DBFParser.parse(DBFParser.java:70)
> {noformat}
> Possible solution: in  {{DBFFileHeader.readCol()}} catch the 
> {{IllegalArgumentException}} thrown by {{col.setType(colType);}} and throw a 
> better exception.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to