[jira] [Updated] (TIKA-980) MicrodataContentHandler for Apache Tika

2012-08-27 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated TIKA-980: --- Attachment: TIKA-980-1.3-2.patch - improved itemprop attribute handling - moved package to tika-parsers

[jira] [Created] (TIKA-981) Text isn't extracted from PDF pop-up annotations

2012-08-27 Thread Michael McCandless (JIRA)
Michael McCandless created TIKA-981: --- Summary: Text isn't extracted from PDF pop-up annotations Key: TIKA-981 URL: https://issues.apache.org/jira/browse/TIKA-981 Project: Tika Issue Type: B

[jira] [Updated] (TIKA-981) Text isn't extracted from PDF pop-up annotations

2012-08-27 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-981: Attachment: TIKA-981.patch Patch with test case and fix. I removed the check for only PDAnno

[jira] [Created] (TIKA-982) RTF document embedded into Word (.doc) document is extracted as .unknown

2012-08-27 Thread Michael McCandless (JIRA)
Michael McCandless created TIKA-982: --- Summary: RTF document embedded into Word (.doc) document is extracted as .unknown Key: TIKA-982 URL: https://issues.apache.org/jira/browse/TIKA-982 Project: Tik

[jira] [Updated] (TIKA-982) RTF document embedded into Word (.doc) document is extracted as .unknown

2012-08-27 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-982: Attachment: TIKA-982.patch Patch with test case and fix, adding another case to POIFSContaine

[jira] [Commented] (TIKA-980) MicrodataContentHandler for Apache Tika

2012-08-27 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13442416#comment-13442416 ] Ken Krugler commented on TIKA-980: -- Hi Markus - looks interesting. I'll try to review soon.

[jira] [Created] (TIKA-983) HTML parser should add Open Graph meta tag data to Metadata returned by parser

2012-08-27 Thread Ken Krugler (JIRA)
Ken Krugler created TIKA-983: Summary: HTML parser should add Open Graph meta tag data to Metadata returned by parser Key: TIKA-983 URL: https://issues.apache.org/jira/browse/TIKA-983 Project: Tika

[jira] [Updated] (TIKA-983) HTML parser should add Open Graph meta tag data to Metadata returned by parser

2012-08-27 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler updated TIKA-983: - Attachment: TIKA-983.patch > HTML parser should add Open Graph meta tag data to Metadata returned by pa

[jira] [Resolved] (TIKA-983) HTML parser should add Open Graph meta tag data to Metadata returned by parser

2012-08-27 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler resolved TIKA-983. -- Resolution: Fixed r1377890 > HTML parser should add Open Graph meta tag data to Metadata