[GitHub] tika pull request: Tika 1886

2016-03-03 Thread nandan-pc
GitHub user nandan-pc opened a pull request: https://github.com/apache/tika/pull/88 Tika 1886 Fix for issue : Tika -1886 provided by Nandan Padar Chandrashekar. Summary : 1. Added .hfa mime type to mime-type.xml 2. Added related test case and resource file.

[jira] [Updated] (TIKA-1892) Mime Magic for application/x-mobipocket-ebook and application/x-shapefile

2016-03-03 Thread Suman Kashyap (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suman Kashyap updated TIKA-1892: Description: Our FHT analysis for mobipocket-ebook and shapefiles shows high corelation of initial

[jira] [Created] (TIKA-1892) Mime Magic for application/x-mobipocket-ebook and application/x-shapefile

2016-03-03 Thread Suman Kashyap (JIRA)
Suman Kashyap created TIKA-1892: --- Summary: Mime Magic for application/x-mobipocket-ebook and application/x-shapefile Key: TIKA-1892 URL: https://issues.apache.org/jira/browse/TIKA-1892 Project: Tika

[jira] [Updated] (TIKA-1889) Update mimetype for *.qt and *.mov files detection

2016-03-03 Thread Ajay Kumar Loganathan Ravichandran (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ajay Kumar Loganathan Ravichandran updated TIKA-1889: - Description: Updating tika-mimetype.xml to identify

[jira] [Created] (TIKA-1891) Update mimetype for mime-type image/fits

2016-03-03 Thread Ajay Kumar Loganathan Ravichandran (JIRA)
Ajay Kumar Loganathan Ravichandran created TIKA-1891: Summary: Update mimetype for mime-type image/fits Key: TIKA-1891 URL: https://issues.apache.org/jira/browse/TIKA-1891 Project:

[jira] [Created] (TIKA-1890) Update mimetype for application/vnd.ms-cab-compressed

2016-03-03 Thread Ajay Kumar Loganathan Ravichandran (JIRA)
Ajay Kumar Loganathan Ravichandran created TIKA-1890: Summary: Update mimetype for application/vnd.ms-cab-compressed Key: TIKA-1890 URL: https://issues.apache.org/jira/browse/TIKA-1890

[jira] [Created] (TIKA-1888) Update mimetype for application/x-netcdf

2016-03-03 Thread Ajay Kumar Loganathan Ravichandran (JIRA)
Ajay Kumar Loganathan Ravichandran created TIKA-1888: Summary: Update mimetype for application/x-netcdf Key: TIKA-1888 URL: https://issues.apache.org/jira/browse/TIKA-1888 Project:

[jira] [Created] (TIKA-1889) Update mimetype for video/quicktime

2016-03-03 Thread Ajay Kumar Loganathan Ravichandran (JIRA)
Ajay Kumar Loganathan Ravichandran created TIKA-1889: Summary: Update mimetype for video/quicktime Key: TIKA-1889 URL: https://issues.apache.org/jira/browse/TIKA-1889 Project: Tika

[jira] [Created] (TIKA-1887) Add new mimetype for file extensions .po

2016-03-03 Thread Manali Shah (JIRA)
Manali Shah created TIKA-1887: - Summary: Add new mimetype for file extensions .po Key: TIKA-1887 URL: https://issues.apache.org/jira/browse/TIKA-1887 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-1883) Identification of Mime Type for Empty Files

2016-03-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15178469#comment-15178469 ] ASF GitHub Bot commented on TIKA-1883: -- GitHub user adityardesai opened a pull request:

[GitHub] tika pull request: Fix for TIKA-1883 and 1884

2016-03-03 Thread adityardesai
GitHub user adityardesai opened a pull request: https://github.com/apache/tika/pull/87 Fix for TIKA-1883 and 1884 TIKA 1883 Identification of Mime types for empty files, updating TIKA 1.12 source code to fix this issue. The Tika Detector and Parsers have been modified

[jira] [Commented] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata

2016-03-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15178059#comment-15178059 ] Tim Allison commented on TIKA-1607: --- FWIW, I extracted ~300k XMPs and XFAs from some of our corpus. The

[jira] [Comment Edited] (TIKA-1663) Add a DigestingParser to add MD5/SHA-X hashes as fields in Metadata

2016-03-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177851#comment-15177851 ] Tim Allison edited comment on TIKA-1663 at 3/3/16 2:19 PM: --- Thank you, Nick. I

[jira] [Commented] (TIKA-1663) Add a DigestingParser to add MD5/SHA-X hashes as fields in Metadata

2016-03-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177851#comment-15177851 ] Tim Allison commented on TIKA-1663: --- Thank you, Nick. I somewhat prefer the first option (once we add

[jira] [Commented] (TIKA-1841) Different XML output structure for PPT and PPTX

2016-03-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177637#comment-15177637 ] ASF GitHub Bot commented on TIKA-1841: -- GitHub user zetisam opened a pull request:

[GitHub] tika pull request: fix for TIKA-1841 contributed by zetisam

2016-03-03 Thread zetisam
GitHub user zetisam opened a pull request: https://github.com/apache/tika/pull/86 fix for TIKA-1841 contributed by zetisam You can merge this pull request into a Git repository by running: $ git pull https://github.com/zetisam/tika TIKA-1841 Alternatively you can review and

[GitHub] tika pull request: updated mime magic for cab, quicktime, fits and...

2016-03-03 Thread nithinkrishna
GitHub user nithinkrishna opened a pull request: https://github.com/apache/tika/pull/85 updated mime magic for cab, quicktime, fits and netcdf Identified magic bytes based on FHT analysis on Polar-Data in class #CSCI599 You can merge this pull request into a Git repository by

[jira] [Commented] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element

2016-03-03 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177572#comment-15177572 ] Markus Jelsma commented on TIKA-1782: - Yes i, unfortunately, agree. The unit test i supplied, similar

[jira] [Commented] (TIKA-1885) Updated tika-mimestype.xml and a detector to identify new types of files based on analysis

2016-03-03 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177566#comment-15177566 ] Nick Burch commented on TIKA-1885: -- Did you mean to close this? Is there a matching pull request or patch