[ 
https://issues.apache.org/jira/browse/TIKA-4029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17721447#comment-17721447
 ] 

Hudson commented on TIKA-4029:
------------------------------

FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #1079 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/1079/])
TIKA-4029 -- change catch logic based on change in commons-csv 1.10.0 
(tallison: 
[https://github.com/apache/tika/commit/de89f69c7fc4e8390ba9a9bea3af1ba608002177])
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-text-module/src/main/java/org/apache/tika/parser/csv/TextAndCSVParser.java


> commons-csv 1.10 changed IllegalStateException to IOException
> -------------------------------------------------------------
>
>                 Key: TIKA-4029
>                 URL: https://issues.apache.org/jira/browse/TIKA-4029
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.8.0
>
>
> In the CSVParser, we had a catch statement in case the csv file was 
> malformed.  In the catch statement we read the rest of the stream and wrote 
> it as one big paragraph.  We then threw the exception.
>  
> In commons-csv 1.10.0, it looks like they changed the IllegalStateException 
> to an UncheckedIOException.  So, we're now missing a bunch of text that we 
> used to extract from csv files.
>  
> This issue was discovered during our regression tests in preparation for the 
> release of 2.8.0-rc1
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to