[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426490#comment-13426490 ] Jukka Zitting commented on TIKA-965: I'm not too big a fan of the {{Charset}} classes in

[jira] [Comment Edited] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426490#comment-13426490 ] Jukka Zitting edited comment on TIKA-965 at 8/1/12 10:12 AM: - I'

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426525#comment-13426525 ] Ray Gauss II commented on TIKA-965: --- Are we likely to run into similar issues with other e

[jira] [Updated] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting updated TIKA-965: --- Attachment: 0001-TIKA-965-Text-Detection-Fails-on-Mostly-Non-ASCII-UT.patch The attached patch implemen

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426541#comment-13426541 ] Ray Gauss II commented on TIKA-965: --- I have a test file that I've gotten permission to inc

[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426550#comment-13426550 ] Jukka Zitting commented on TIKA-965: I see where you're going, but it's a really tricky

[jira] [Resolved] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files

2012-08-01 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II resolved TIKA-965. --- Resolution: Fixed Fix Version/s: 1.3 Assignee: Ray Gauss II Fair enough. I've incorpor

Build failed in Jenkins: Tika-trunk #906

2012-08-01 Thread Apache Jenkins Server
See Changes: [rgauss] TIKA-965: Text Detection Fails on Mostly Non-ASCII UTF-8 Files - Added looksLikeUTF8 method to TextStatistics - Added check to TextDetector.detect for looksLikeUTF8 - Added testTextNonASCIIUTF8 to AutoDetectPars

Re: Build failed in Jenkins: Tika-trunk #906

2012-08-01 Thread Ray Gauss II
Anyone have ideas on this one? Is it really something I did? On Aug 1, 2012, at 3:17 PM, Apache Jenkins Server wrote: > See > > Changes: > > [rgauss] TIKA-965: Text Detection Fails on Mostly Non-ASCII UTF-8 Files > - Added looksLikeU

Re: Build failed in Jenkins: Tika-trunk #906

2012-08-01 Thread Jukka Zitting
Hi, On Wed, Aug 1, 2012 at 4:22 PM, Ray Gauss II wrote: > Anyone have ideas on this one? Is it really something I did? Looks like a Jenkins problem. The Jenkins setup at Apache has been quite unstable over the last few months. BR, Jukka Zitting

[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426678#comment-13426678 ] Jukka Zitting commented on TIKA-966: bq. No where in this code path are the dynamically

[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar

2012-08-01 Thread Gary Karasiuk (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426740#comment-13426740 ] Gary Karasiuk commented on TIKA-966: >> That's as it should be. I did START the bundle,

[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar

2012-08-01 Thread Gary Karasiuk (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426746#comment-13426746 ] Gary Karasiuk commented on TIKA-966: Essentially running this code: Tika tika = new Tik

[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426748#comment-13426748 ] Jukka Zitting commented on TIKA-966: You're looking at the wrong place. The dynamic pars

[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426752#comment-13426752 ] Jukka Zitting commented on TIKA-966: See also the [BundleIT|http://svn.apache.org/repos

[jira] [Commented] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader

2012-08-01 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426842#comment-13426842 ] Jukka Zitting commented on TIKA-885: What I had in mind was something like a {{Metadata.