[jira] [Commented] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13190965#comment-13190965 ] Nick Burch commented on TIKA-818: - Tika does already handle its own temporary files, via or

Re: ExifTool Parser Conventions

2012-01-23 Thread Nick Burch
On Tue, 17 Jan 2012, Ray Gauss II wrote: Is there any convention to be followed yet in terms of naming of such a project or where it should live? I'm not sure we do, though I think Jukka in particular does have some ideas on a longer term plan for that sort of thing One thing that may affect

[jira] [Created] (TIKA-849) Identify and parse the Apple iBooks format

2012-01-23 Thread Andrew Jackson (Created) (JIRA)
Identify and parse the Apple iBooks format -- Key: TIKA-849 URL: https://issues.apache.org/jira/browse/TIKA-849 Project: Tika Issue Type: New Feature Components: mime, parser Affects Vers

[jira] [Issue Comment Edited] (TIKA-849) Identify and parse the Apple iBooks format

2012-01-23 Thread Andrew Jackson (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191148#comment-13191148 ] Andrew Jackson edited comment on TIKA-849 at 1/23/12 1:35 PM: --

[jira] [Updated] (TIKA-849) Identify and parse the Apple iBooks format

2012-01-23 Thread Andrew Jackson (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Jackson updated TIKA-849: Attachment: ibooks-support.patch This patch identifies *.ibooks files, and passes them through the eP

[jira] [Commented] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191208#comment-13191208 ] Nick Burch commented on TIKA-846: - Longer term, using Any23 sounds good to me, and hopefully

[jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191214#comment-13191214 ] Nick Burch commented on TIKA-844: - Thanks, patch applied in r1234861. > Abi

[jira] [Resolved] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-23 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-844. - Resolution: Fixed > Ability to Define an Internal Text Bag Property > -

[jira] [Resolved] (TIKA-843) Support for Date without a Time Component

2012-01-23 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-843. - Resolution: Fixed > Support for Date without a Time Component > ---

[jira] [Resolved] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-23 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-846. - Resolution: Fixed > Ability to Parse RDF Bag Elements in XML >

[jira] [Commented] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191228#comment-13191228 ] Nick Burch commented on TIKA-845: - I think the current logic isn't quite correct. Rather tha

[jira] [Resolved] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler

2012-01-23 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-845. - Resolution: Fixed > Check for Existing Value in Multi-Value Fields in XML Metadata Handler > --

[jira] [Commented] (TIKA-849) Identify and parse the Apple iBooks format

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191242#comment-13191242 ] Nick Burch commented on TIKA-849: - Sample file committed in r1234886, along with a unit test

[jira] [Commented] (TIKA-848) NullPointerException in SecurityHandler.addDictionaryAndSubDictionary(SecurityHandler.java:185)

2012-01-23 Thread Timo Boehme (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191244#comment-13191244 ] Timo Boehme commented on TIKA-848: -- Now fixed in PDFBox-1212. > NullPointe

[jira] [Commented] (TIKA-849) Identify and parse the Apple iBooks format

2012-01-23 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191259#comment-13191259 ] Nick Burch commented on TIKA-849: - Test and parser change committed in r1234904, thanks It

[jira] [Commented] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler

2012-01-23 Thread Ray Gauss II (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191805#comment-13191805 ] Ray Gauss II commented on TIKA-845: --- I was following precedence there and actually not eve

[jira] [Updated] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff

2012-01-23 Thread Paul Pearcy (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Pearcy updated TIKA-818: - Attachment: choose_inmemory_vs_temp_file_pdf.patch Here is a patch based off the trunk. Please let me know

[jira] [Updated] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff

2012-01-23 Thread Paul Pearcy (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Pearcy updated TIKA-818: - Attachment: choose_inmemory_vs_temp_file_pdf_passes_tests.patch Here's a version that should pass all tests