Re: Build failed in Jenkins: Tika-trunk #507

2011-04-06 Thread Maxim Valyanskiy
Hello! Can anyone restart Hudson build? best wishes, Max 05.04.2011 16:00, Apache Hudson Server пишет: See -- Started by an SCM change Building remotely on ubuntu2 hudson.util.IOException2: remote fi

[jira] [Created] (TIKA-634) Command Line Parser for Metadata Extraction

2011-04-06 Thread Nick Burch (JIRA)
Command Line Parser for Metadata Extraction --- Key: TIKA-634 URL: https://issues.apache.org/jira/browse/TIKA-634 Project: Tika Issue Type: Improvement Components: parser Affects Versions

Invisible text displayed for headings in doc files

2011-04-06 Thread Julien Nioche
Hi guys, We are currently getting duplicated text for the heading from .doc files e.g. *29. No Partnership or Agency XE "29. No Partnership or Agency" * XE seems to be a flag in MS Word http://taxonomist.tripod.com/indexing/wordflags.html but I don't think it should be displayed. Have I missed

[jira] [Commented] (TIKA-93) OCR support

2011-04-06 Thread Mike (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13016417#comment-13016417 ] Mike commented on TIKA-93: -- It's been a while since this bug has been visited. I have an upstream is

Jenkins build is back to normal : Tika-trunk » Apache Tika parsers #508

2011-04-06 Thread Apache Hudson Server
See

Jenkins build is back to normal : Tika-trunk #508

2011-04-06 Thread Apache Hudson Server
See

[jira] [Commented] (TIKA-634) Command Line Parser for Metadata Extraction

2011-04-06 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13016459#comment-13016459 ] Nick Burch commented on TIKA-634: - I've done some work on this. We can now use XML files to

Re: Command Line Parser for Metadata Extraction

2011-04-06 Thread Nick Burch
On Tue, 5 Apr 2011, Mattmann, Chris A (388J) wrote: Check out our CmdLineMetExtractor class [2], and this guide [3] on some of our baked in MetExtractors. I think it would be awesome if we could support a similar interface in Tika (I'd love to push those details upstream of OODT). I think you