[ https://issues.apache.org/jira/browse/TIKA-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14989875#comment-14989875 ]
Hudson commented on TIKA-1786: ------------------------------ SUCCESS: Integrated in tika-trunk-jdk1.7 #881 (See [https://builds.apache.org/job/tika-trunk-jdk1.7/881/]) TIKA-1786 -- clean up logging in tika-batch (tallison: [http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1712572]) * trunk/tika-batch/src/main/java/org/apache/tika/batch/FileResourceConsumer.java * trunk/tika-batch/src/test/java/org/apache/tika/batch/fs/BatchProcessTest.java * trunk/tika-batch/src/test/java/org/apache/tika/batch/fs/FSBatchTestBase.java * trunk/tika-batch/src/test/resources/log4j-on.properties * trunk/tika-batch/src/test/resources/test-input/illegal_xml_chars_in_exception * trunk/tika-batch/src/test/resources/test-input/illegal_xml_chars_in_exception/test0_bad_chars.xml * trunk/tika-batch/src/test/resources/test-input/illegal_xml_chars_in_exception/test1_ok.xml * trunk/tika-core/src/test/java/org/apache/tika/parser/mock/MockParser.java > Downgrade logging severity in FileResourceConsumer and fix handling of > illegal xml characters > --------------------------------------------------------------------------------------------- > > Key: TIKA-1786 > URL: https://issues.apache.org/jira/browse/TIKA-1786 > Project: Tika > Issue Type: Improvement > Components: tika-batch > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Trivial > Fix For: 1.12 > > > FileResourceConsumer logs an xmlified snippet to record problems encountered > during parsing. If a parser includes illegal xml characters in the > ParseException, this exception is caught by the xmlification code and then > logged as an error. > The xmlification code should be robust against illegal characters and we > should downgrade logging severity from error to warnings when there wasn't an > actual error thrown by a parser. -- This message was sent by Atlassian JIRA (v6.3.4#6332)