[jira] [Resolved] (TIKA-1447) CHM parser: wrong directory list

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1447. Resolution: Fixed CHM parser: wrong directory list

[jira] [Resolved] (TIKA-1446) CHM parser : wrong decompression of aligned blocks

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1446. Resolution: Fixed CHM parser : wrong decompression of aligned blocks

[jira] [Updated] (TIKA-1447) CHM parser: wrong directory list

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen updated TIKA-1447: --- Fix Version/s: 1.7 CHM parser: wrong directory list

[jira] [Resolved] (TIKA-1448) CHM parser : defect in file extraction

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1448. Resolution: Fixed CHM parser : defect in file extraction

[jira] [Resolved] (TIKA-1430) CHM parser gets faulty text (fix found)

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1430. Resolution: Fixed CHM parser gets faulty text (fix found)

[jira] [Updated] (TIKA-1430) CHM parser gets faulty text (fix found)

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen updated TIKA-1430: --- Fix Version/s: 1.7 CHM parser gets faulty text (fix found)

[jira] [Updated] (TIKA-1446) CHM parser : wrong decompression of aligned blocks

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen updated TIKA-1446: --- Fix Version/s: 1.7 CHM parser : wrong decompression of aligned blocks

[jira] [Updated] (TIKA-1448) CHM parser : defect in file extraction

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen updated TIKA-1448: --- Fix Version/s: 1.7 CHM parser : defect in file extraction

[jira] [Updated] (TIKA-672) Proper error handling in the CHM parser

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen updated TIKA-672: -- Fix Version/s: 1.7 Proper error handling in the CHM parser

[jira] [Resolved] (TIKA-672) Proper error handling in the CHM parser

2014-11-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-672. --- Resolution: Fixed Check no more System.err/System.out inside CHM parser Proper error handling

Re: Subsets of tika parsers redux

2014-11-24 Thread Sergey Beryozkin
Hi Nick Was good talking to you and thanks for initiating this thread. It is an interesting idea, one that can lead to introducing finer-grained bundles but also providing a mechanism for the (auto-)generation of the import metadata required by each of the parser modules. Besides,

[jira] [Commented] (TIKA-1473) Apache Tika is not working for .docx documents

2014-11-24 Thread Milan Zivkovic (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222927#comment-14222927 ] Milan Zivkovic commented on TIKA-1473: -- Hi, Indeed I was using the FileInputStream,

[jira] [Comment Edited] (TIKA-1473) Apache Tika is not working for .docx documents

2014-11-24 Thread Milan Zivkovic (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222927#comment-14222927 ] Milan Zivkovic edited comment on TIKA-1473 at 11/24/14 11:56 AM:

[jira] [Commented] (TIKA-1332) Create eval code

2014-11-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222947#comment-14222947 ] Tim Allison commented on TIKA-1332: --- In a personal communication, I asked

Re: Subsets of tika parsers redux

2014-11-24 Thread Mattmann, Chris A (3980)
Hey Nick, This sounds like a great plan to me, good job to you and Sergey. As for helping I¹ll try my best, but I¹m not an OSGI guru :) Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223430#comment-14223430 ] Andreas Lehmkühler edited comment on TIKA-1442 at 11/24/14 8:02 PM:

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223430#comment-14223430 ] Andreas Lehmkühler edited comment on TIKA-1442 at 11/24/14 8:09 PM:

[jira] [Commented] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223460#comment-14223460 ] Tim Allison commented on TIKA-1442: --- Thank you for PDFBOX-2520! I'd put my $ on Jenkins.

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223460#comment-14223460 ] Tim Allison edited comment on TIKA-1442 at 11/24/14 8:34 PM: -

[jira] [Commented] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223469#comment-14223469 ] Andreas Lehmkühler commented on TIKA-1442: -- Yes, build 145 should include the

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223460#comment-14223460 ] Tim Allison edited comment on TIKA-1442 at 11/24/14 8:39 PM: -

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223469#comment-14223469 ] Andreas Lehmkühler edited comment on TIKA-1442 at 11/24/14 9:03 PM:

[jira] [Comment Edited] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-11-24 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14223469#comment-14223469 ] Andreas Lehmkühler edited comment on TIKA-1442 at 11/24/14 9:04 PM:

[jira] [Commented] (TIKA-1481) TikaJAXRS get metadata calls give different results

2014-11-24 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224167#comment-14224167 ] Darya Arbuzova commented on TIKA-1481: -- Thank you, Sergey! I was trying to find a