Subcribe

2014-06-24 Thread Hong-Thai Nguyen
-- -- Hong-Thai

Build failed

2014-06-24 Thread Hong-Thai Nguyen
Hi all, Sorry about last wrong mail. I'm unable to build latest snapshot on my Windows. Any idea ? Thanks Tests in error: initializationError(org.apache.tika.bundle.BundleIT): Problem starting test co ntainer. Tests run: 1, Failures: 0, Errors: 1, Skipped: 0

[jira] [Created] (TIKA-1354) ForkParser doesn't work in OSGI container

2014-06-24 Thread Michal Hlavac (JIRA)
Michal Hlavac created TIKA-1354: --- Summary: ForkParser doesn't work in OSGI container Key: TIKA-1354 URL: https://issues.apache.org/jira/browse/TIKA-1354 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-1354) ForkParser doesn't work in OSGI container

2014-06-24 Thread Michal Hlavac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042099#comment-14042099 ] Michal Hlavac commented on TIKA-1354: - I created simple patch to run ForkParser in

[GitHub] tika pull request: [TIKA-1354] Register ForkParser service in Acti...

2014-06-24 Thread hlavki
GitHub user hlavki opened a pull request: https://github.com/apache/tika/pull/13 [TIKA-1354] Register ForkParser service in Activator and add simple test There is maybe another way but I didn't find it. It'll will be good if somebody with higher OSGI knowledge also look on it. You

[jira] [Commented] (TIKA-1354) ForkParser doesn't work in OSGI container

2014-06-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042129#comment-14042129 ] ASF GitHub Bot commented on TIKA-1354: -- GitHub user hlavki opened a pull request:

[jira] [Commented] (TIKA-1350) OutlookPSTParser: Unknown message type: IPM.Note

2014-06-24 Thread Jonathan Evans (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042171#comment-14042171 ] Jonathan Evans commented on TIKA-1350: -- 0.8.1 Is deployed to Maven Central now. Are

[jira] [Resolved] (TIKA-1353) OpenDocumentParser doesn't correctly process metadata

2014-06-24 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1353. -- Resolution: Fixed Fix Version/s: 1.6 I've fixed those TODOs in r1605124. Now, if a

Re: Patch: self-contained HTML using Data URI

2014-06-24 Thread Andrew Skiba
Hello again. I created an issue https://issues.apache.org/jira/browse/TIKA-1344 for this patch and got an advise to implement this in a content handler. So I learned the idea behind RecursiveMetadata and started to look how to move my change into a handler according to what Nick advised me. I

[GitHub] tika pull request: Bumps libpst version to fix TIKA-1350

2014-06-24 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/12 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (TIKA-1350) OutlookPSTParser: Unknown message type: IPM.Note

2014-06-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042339#comment-14042339 ] ASF GitHub Bot commented on TIKA-1350: -- Github user asfgit closed the pull request at:

Re: Patch: self-contained HTML using Data URI

2014-06-24 Thread Nick Burch
On Tue, 24 Jun 2014, Andrew Skiba wrote: I started with org.apache.tika.parser.microsoft.WordExtractor and immediately saw that it already makes a recursive call to the org.apache.tika.parser.image.ImageParser. But ImageParser currently only enriches metadata, and does not create img element

[jira] [Resolved] (TIKA-1350) OutlookPSTParser: Unknown message type: IPM.Note

2014-06-24 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1350. -- Resolution: Fixed Fix Version/s: (was: 1.7) 1.6 Dependency bumped in

[jira] [Commented] (TIKA-1353) OpenDocumentParser doesn't correctly process metadata

2014-06-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042352#comment-14042352 ] Hudson commented on TIKA-1353: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #64 (See

[jira] [Commented] (TIKA-1350) OutlookPSTParser: Unknown message type: IPM.Note

2014-06-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042375#comment-14042375 ] Hudson commented on TIKA-1350: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #64 (See

[jira] [Commented] (TIKA-1353) OpenDocumentParser doesn't correctly process metadata

2014-06-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042376#comment-14042376 ] Hudson commented on TIKA-1353: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #64 (See

[jira] [Commented] (TIKA-758) Address TODOs when we upgrade to next PDFBox release

2014-06-24 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042398#comment-14042398 ] Tyler Palsulich commented on TIKA-758: -- Thanks [~talli...@apache.org]. Happy to help!

[jira] [Commented] (TIKA-758) Address TODOs when we upgrade to next PDFBox release

2014-06-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042428#comment-14042428 ] Tim Allison commented on TIKA-758: -- Y, my grand plan after TIKA-1302 is in place would be

[jira] [Commented] (TIKA-1350) OutlookPSTParser: Unknown message type: IPM.Note

2014-06-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14042438#comment-14042438 ] Hudson commented on TIKA-1350: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #65 (See

Re: Build failed

2014-06-24 Thread Tyler Palsulich
Hi Hong-Thai, What version of Java do you have? It seems like this isn't a Tika issue, but one of its dependencies. Can you try deleting your Maven cache and running `mvn clean install`? Tyler On Tue, Jun 24, 2014 at 3:48 AM, Hong-Thai Nguyen thaicha...@gmail.com wrote: Hi all, Sorry about

Re: Review Request 22892: New parser for ENVI header files

2014-06-24 Thread Ann Burgess
On June 24, 2014, 12:28 p.m., Nick Burch wrote: trunk/tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java, lines 75-82 https://reviews.apache.org/r/22892/diff/3/?file=615266#file615266line75 This might be better using something like a BufferedReader, so

Re: Review Request 22892: New parser for ENVI header files

2014-06-24 Thread Tyler Palsulich
On June 24, 2014, 12:28 p.m., Nick Burch wrote: Looking into this more, AutoDetectReader is already a subclass of BufferedReader. Should we, as discussed here [1], be reading chunk by chunk, as this code (and TXTParser) is doing manually? If so, we should really just use the built in