[jira] [Commented] (TIKA-1373) AutoDetectParser extracts no text when SourceCodeParser is selected

2014-07-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073042#comment-14073042 ] Hong-Thai Nguyen commented on TIKA-1373: HtmlParser skips tags generated by

[jira] [Resolved] (TIKA-1373) AutoDetectParser extracts no text when SourceCodeParser is selected

2014-07-24 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong-Thai Nguyen resolved TIKA-1373. Resolution: Fixed AutoDetectParser extracts no text when SourceCodeParser is selected

[jira] [Commented] (TIKA-1373) AutoDetectParser extracts no text when SourceCodeParser is selected

2014-07-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073052#comment-14073052 ] Hudson commented on TIKA-1373: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #112 (See

[jira] [Commented] (TIKA-1373) AutoDetectParser extracts no text when SourceCodeParser is selected

2014-07-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073077#comment-14073077 ] Hudson commented on TIKA-1373: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #114 (See

[jira] [Commented] (TIKA-1269) Self-hosted documentation for the JAX-RS Server

2014-07-24 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073164#comment-14073164 ] Nick Burch commented on TIKA-1269: -- It's a bit hard to be sure on Miredot when most (all?)

[jira] [Created] (TIKA-1374) Need to add code to look for OS-specific keys for embedded files within PDFs

2014-07-24 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1374: - Summary: Need to add code to look for OS-specific keys for embedded files within PDFs Key: TIKA-1374 URL: https://issues.apache.org/jira/browse/TIKA-1374 Project: Tika

[jira] [Updated] (TIKA-1374) Need to add code to look for OS-specific keys for embedded files within PDFs

2014-07-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1374: -- Description: Embedded files in PDFs can be found by the general all purpose key we currently use via

[jira] [Created] (TIKA-1375) Decrease memory consumption when extracting images from PDFs

2014-07-24 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1375: - Summary: Decrease memory consumption when extracting images from PDFs Key: TIKA-1375 URL: https://issues.apache.org/jira/browse/TIKA-1375 Project: Tika Issue

Re: How should video files with audio be handled by parsers?

2014-07-24 Thread Nick Burch
On Wed, 23 Jul 2014, Ray Gauss wrote: 2) There are are several PBCore instantiation properties that apply to the entire file like duration and tracks that we'd want prefixed with pbcore so I think it would be odd to see:   pbcore:instantiationDuration=00:00:05.20  

[jira] [Created] (TIKA-1376) Improve embedded file name extraction in PDFParser

2014-07-24 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1376: - Summary: Improve embedded file name extraction in PDFParser Key: TIKA-1376 URL: https://issues.apache.org/jira/browse/TIKA-1376 Project: Tika Issue Type:

[GitHub] tika pull request: TIKA-1361: MP4Parser Update

2014-07-24 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/14 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Commented] (TIKA-1361) Update MP4Parser to 1.0.2

2014-07-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073543#comment-14073543 ] ASF GitHub Bot commented on TIKA-1361: -- Github user asfgit closed the pull request at:

[jira] [Resolved] (TIKA-1361) Update MP4Parser to 1.0.2

2014-07-24 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1361. -- Resolution: Fixed Fix Version/s: 1.6 Update MP4Parser to 1.0.2 -

[jira] [Commented] (TIKA-1373) AutoDetectParser extracts no text when SourceCodeParser is selected

2014-07-24 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073557#comment-14073557 ] Tyler Palsulich commented on TIKA-1373: --- bq. HtmlParser skips tags generated by

[jira] [Commented] (TIKA-1269) Self-hosted documentation for the JAX-RS Server

2014-07-24 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073562#comment-14073562 ] Lewis John McGibbney commented on TIKA-1269: Yep I am on it right now. Patch

[jira] [Commented] (TIKA-1361) Update MP4Parser to 1.0.2

2014-07-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073569#comment-14073569 ] Hudson commented on TIKA-1361: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #116 (See

[jira] [Commented] (TIKA-1361) Update MP4Parser to 1.0.2

2014-07-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14073596#comment-14073596 ] Hudson commented on TIKA-1361: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #114 (See