[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15129480#comment-15129480 ] Tim Allison commented on TIKA-1848: --- Thank you for running DRAT! I think we're ok with CharsetDetector and related classes according to this [thread|http://lucene.472066.n3.nabble.com/Licensing-Question-td4194289.html]. For the test files, I'd be concerned that adding the license will change the test, but I'll take a look tomorrow. > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Bug >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Blocker > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15129484#comment-15129484 ] Lewis John McGibbney commented on TIKA-1848: [~talli...@mitre.org] ACK I've not VOTE'd so by no means is this a blocker IMHO. Would be good to get some kind of clarification though! > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130325#comment-15130325 ] Tim Allison commented on TIKA-1848: --- So, um, I'll try to fix these in trunk. Do we need an rc2 where these are fixed? If so, will that be cut from trunk or should I make the changes somewhere else? > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130376#comment-15130376 ] Nick Burch commented on TIKA-1848: -- I'm not sure if our test files should have license headers in them, especially not if it'll break the things we're using to test for! Since we're not adding license metadata to our PNGs, our Ogg files or a Office documents (for just a few examples), I don't see why we should be monkeying with the HTML ones only? The Charset stuff doesn't have our standard header, as it's third party (suitably licensed) code that we've incorporated + re-packaged + bugfixed Is it worth getting DRAT to pull in the excludes we've put into the POMs that normal RAT uses? > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130400#comment-15130400 ] Tim Allison commented on TIKA-1848: --- I tested adding headers, and they don't break our tests with the exception of test-tika-327 (where I had to put the license under the entity, and then it did work). I'd prefer not to include license headers in our test files if we don't have to. Happy to patch trunk if necessary, but would prefer to leave as is if possible. > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130582#comment-15130582 ] Lewis John McGibbney commented on TIKA-1848: Hi Folks, I am +1 to this being closed then as will not fix. I agree with the points made. I waned to log then anyways such that we were aware of them and could discuss. I'll gt back and provide my +1 on the VOTE thread 😄 -- *Lewis* > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130666#comment-15130666 ] Ken Krugler commented on TIKA-1848: --- Unless I'm not understanding the issues properly, I agree with the above - test files don't need license headers, and the character set detector code shouldn't get their existing (3rd party) license headers stomped on by us. Don't know the relative value of DRAT vs RAT, and thus the value of figuring out how to leverage exclusions we've got so DRAT runs clean. Maybe modify this issue to be something like "Exclude test & 3rd party source files from DRAT analysis", lower the priority, and call it good for now? > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1848) Address issues with Tika 1.12rc#1
[ https://issues.apache.org/jira/browse/TIKA-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130667#comment-15130667 ] Lewis John McGibbney commented on TIKA-1848: Ack Ken -- *Lewis* > Address issues with Tika 1.12rc#1 > - > > Key: TIKA-1848 > URL: https://issues.apache.org/jira/browse/TIKA-1848 > Project: Tika > Issue Type: Task >Affects Versions: 1.12 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney > Fix For: 1.12 > > > The following files for the 1.12rc#1 have unsuitable license headers > {code} > /usr/local/drat/deploy/data/jobs/rat/1454458514778/input/testJAVA.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetDetector.java > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetMatch.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_2022.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_UTF8.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_Unicode.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_mbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecog_sbcs.java > > /usr/local/drat/deploy/data/jobs/rat/1454458508087/input/CharsetRecognizer.java > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/big-preamble.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate-whitespace.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/boilerplate.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/resume.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test-tika-327.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/test.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_1.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_2.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_3.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testHTMLNoisyMetaEncoding_4.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testJsonMultipleInts.html > > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/testlargerbuffer.html > /usr/local/drat/deploy/data/jobs/rat/1454458515805/input/tika434.html > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)