[jira] [Commented] (TIKA-1276) Missing embedded dependencies in tika-bundle

2015-03-19 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14370809#comment-14370809 ] Shai Erera commented on TIKA-1276: -- bq. `com.uwyn:jhighlight:1.0` is not embedded Just FY

[jira] [Updated] (TIKA-1580) ISA-Tab parsers

2015-03-19 Thread Giuseppe Totaro (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Giuseppe Totaro updated TIKA-1580: -- Attachment: TIKA-1580.patch > ISA-Tab parsers > --- > > Key: TIKA-158

[jira] [Updated] (TIKA-1580) ISA-Tab parsers

2015-03-19 Thread Giuseppe Totaro (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Giuseppe Totaro updated TIKA-1580: -- Summary: ISA-Tab parsers (was: ISA-Tab) > ISA-Tab parsers > --- > >

[jira] [Created] (TIKA-1580) ISA-Tab

2015-03-19 Thread Giuseppe Totaro (JIRA)
Giuseppe Totaro created TIKA-1580: - Summary: ISA-Tab Key: TIKA-1580 URL: https://issues.apache.org/jira/browse/TIKA-1580 Project: Tika Issue Type: New Feature Components: parser

Re: work with you

2015-03-19 Thread Mattmann, Chris A (3980)
One thing you may consider as well is helping out the effort in Tika to support machine translation. We are working on a Joshua and Moses based translator and we have plug-ins to the Google API and to the Bing API and to Lingo24. I’m sure the Tika community would be happy to help with this - could

[jira] [Commented] (TIKA-1577) NetCDF Data Extraction

2015-03-19 Thread Ann Burgess (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14370177#comment-14370177 ] Ann Burgess commented on TIKA-1577: --- [~riverma] this is a good place to start: http://ww

[jira] [Updated] (TIKA-1579) Add file type to NetCDFParser

2015-03-19 Thread Ann Burgess (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ann Burgess updated TIKA-1579: -- Attachment: TIKA-1579.abburgess.190315.patch.txt > Add file type to NetCDFParser > --

[jira] [Commented] (TIKA-1579) Add file type to NetCDFParser

2015-03-19 Thread Ann Burgess (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14370137#comment-14370137 ] Ann Burgess commented on TIKA-1579: --- https://reviews.apache.org/r/32260/ > Add file type

Review Request 32260: Add file type description to NetCDF parser

2015-03-19 Thread Ann Burgess
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/32260/ --- Review request for tika. Repository: tika Description --- Outputs filety

[jira] [Commented] (TIKA-1578) Add file type description to HDFParsers

2015-03-19 Thread Ann Burgess (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14369982#comment-14369982 ] Ann Burgess commented on TIKA-1578: --- https://reviews.apache.org/r/32255/ > Add file type

Review Request 32255: File type description to HDFParser

2015-03-19 Thread Ann Burgess
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/32255/ --- Review request for tika. Repository: tika Description --- Added a file t

[jira] [Updated] (TIKA-1578) Add file type description to HDFParsers

2015-03-19 Thread Ann Burgess (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ann Burgess updated TIKA-1578: -- Attachment: TIKA-1578.abburgess.150319.patch.txt File type added to HDFParser > Add file type descriptio

[jira] [Commented] (TIKA-1154) Tika hangs on format detection of malformed HTML file.

2015-03-19 Thread Andrew Jackson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14368858#comment-14368858 ] Andrew Jackson commented on TIKA-1154: -- Yes, thanks - that's the behaviour I'd hoped f

[jira] [Issue Comment Deleted] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-19 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated TIKA-1575: -- Comment: was deleted (was: With the pure ExtractText, all is identical. Could you attach the file

[jira] [Commented] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-19 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14368687#comment-14368687 ] Tilman Hausherr commented on TIKA-1575: --- With the pure ExtractText, all is identical.

[jira] [Commented] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-19 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14368686#comment-14368686 ] Tilman Hausherr commented on TIKA-1575: --- With the pure ExtractText, all is identical.

[jira] [Updated] (TIKA-1088) Unsupported AutoCAD drawing version: AC1009

2015-03-19 Thread Hardik Upadhyay (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hardik Upadhyay updated TIKA-1088: -- Attachment: (was: 227051.dwg) > Unsupported AutoCAD drawing version: AC1009 > ---