[jira] [Assigned] (TIKA-1327) New parser for Matlab .mat files

2014-06-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned TIKA-1327: --- Assignee: Chris A. Mattmann > New parser for Matlab .mat files > -

Re: Review Request 22246: New parser for Matlab .mat files

2014-06-09 Thread Ann Burgess
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22246/ --- (Updated June 10, 2014, 3:21 a.m.) Review request for tika and Chris Mattmann.

Re: Review Request 22402: Tika OCR

2014-06-09 Thread Tyler Palsulich
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22402/#review45159 --- Need to add a license to the top of the new files. - Tyler Palsulic

[jira] [Updated] (TIKA-93) OCR support

2014-06-09 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-93?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-93: Attachment: TesseractOCR_Tyler_v2.patch Minor updates to the patch: Moved the OCRParser to tika-parser

Review Request 22402: Tika OCR

2014-06-09 Thread Tyler Palsulich
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22402/ --- Review request for tika and Chris Mattmann. Repository: tika Description

Re: Review Request 22246: New parser for Matlab .mat files

2014-06-09 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22246/#review45156 --- Annie, this looks great. I will just take out those 2 parts about th

Re: Review Request 22246: New parser for Matlab .mat files

2014-06-09 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22246/#review45154 --- Ship it! Ship It! trunk/tika-parsers/pom.xml

[jira] [Created] (TIKA-1328) Translate Metadata and Content

2014-06-09 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1328: - Summary: Translate Metadata and Content Key: TIKA-1328 URL: https://issues.apache.org/jira/browse/TIKA-1328 Project: Tika Issue Type: New Feature

Re: Review Request 22246: New parser for Matlab .mat files

2014-06-09 Thread Tyler Palsulich
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22246/#review45126 --- trunk/tika-parsers/pom.xml

Re: Review Request 22246: New parser for Matlab .mat files

2014-06-09 Thread Ann Burgess
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22246/ --- (Updated June 9, 2014, 8:11 p.m.) Review request for tika and Chris Mattmann.

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025462#comment-14025462 ] Hudson commented on TIKA-1325: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #36 (See [https://bu

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025434#comment-14025434 ] Hudson commented on TIKA-1325: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #36 (See [https://bu

Re: Timezone issue with TTF parser?

2014-06-09 Thread Tyler Palsulich
> Ok, should work as of r1601444. Yep! All passing now. Thanks.

RE: Timezone issue with TTF parser?

2014-06-09 Thread Allison, Timothy B.
Ok, should work as of r1601444. Thank you, Nick, for working through this issue with me. -Original Message- From: Nick Burch [mailto:apa...@gagravarr.org] Sent: Monday, June 09, 2014 10:45 AM To: dev@tika.apache.org Subject: Re: Timezone issue with TTF parser? On Mon, 9 Jun 2014, Ken K

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025375#comment-14025375 ] Tim Allison commented on TIKA-1325: --- Default timezone is now set in the test case as of r

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025367#comment-14025367 ] Hudson commented on TIKA-1258: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #35 (See [https://bu

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025355#comment-14025355 ] Nick Burch commented on TIKA-1258: -- The perils of not doing a clean first... It was workin

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025347#comment-14025347 ] Hudson commented on TIKA-1258: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #35 (See [https://bu

[jira] [Commented] (TIKA-1323) Improve exception reporting in JAX-RS server

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025345#comment-14025345 ] Tim Allison commented on TIKA-1323: --- Great. Thank you, Sergey! I'll require that the opt

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025324#comment-14025324 ] Tilman Hausherr commented on TIKA-1325: --- PDFBOX-2122 has been fixed. > Move the font

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025319#comment-14025319 ] Ken Krugler commented on TIKA-1303: --- I'd tried setting it to 1.6 when I fixed it, but Jir

[jira] [Updated] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler updated TIKA-1303: -- Fix Version/s: (was: 1.7) 1.6 > Parsing Html page (not well formed) containing tw

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025316#comment-14025316 ] Tim Allison commented on TIKA-1325: --- Sorry, didn't mean to pile on with Hudson! Let me k

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025310#comment-14025310 ] Hudson commented on TIKA-1325: -- UNSTABLE: Integrated in tika-trunk-jdk1.6 #34 (See [https://b

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025309#comment-14025309 ] Tim Allison commented on TIKA-1325: --- Happy to do so. I can't get a clean build, though b

Re: Timezone issue with TTF parser?

2014-06-09 Thread Lewis John Mcgibbney
+1 Can reproduce On Mon, Jun 9, 2014 at 11:41 AM, wrote: > > Subject: Re: Timezone issue with TTF parser? > +1 Having the same issue. That test passed for me before the update. I'm on > Pacific time, for what it's worth. > >

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025305#comment-14025305 ] Hudson commented on TIKA-1325: -- UNSTABLE: Integrated in tika-trunk-jdk1.7 #34 (See [https://b

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025306#comment-14025306 ] Hudson commented on TIKA-1258: -- UNSTABLE: Integrated in tika-trunk-jdk1.7 #34 (See [https://b

[jira] [Comment Edited] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025287#comment-14025287 ] Tim Allison edited comment on TIKA-1325 at 6/9/14 3:39 PM: --- Doh!

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025293#comment-14025293 ] Nick Burch commented on TIKA-1325: -- Setting the Timezone in the test for now would work fo

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025287#comment-14025287 ] Tim Allison commented on TIKA-1325: --- Doh! Same issue for those of us in non-standard lan

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Michal Hlavac (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025283#comment-14025283 ] Michal Hlavac commented on TIKA-1258: - Hi, I have also some test. I'll send it after so

Re: Timezone issue with TTF parser?

2014-06-09 Thread Tyler Palsulich
+1 Having the same issue. That test passed for me before the update. I'm on Pacific time, for what it's worth. Tyler

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025267#comment-14025267 ] Nick Burch commented on TIKA-1258: -- Thanks, I've committed it in r1601402. I've also had

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025264#comment-14025264 ] Hudson commented on TIKA-1258: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #33 (See [https://bu

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025262#comment-14025262 ] Hudson commented on TIKA-1325: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #33 (See [https://bu

[jira] [Commented] (TIKA-1276) Missing embedded dependencies in tika-bundle

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025263#comment-14025263 ] Hudson commented on TIKA-1276: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #33 (See [https://bu

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025261#comment-14025261 ] Hudson commented on TIKA-1303: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #33 (See [https://bu

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025254#comment-14025254 ] Lewis John McGibbney commented on TIKA-1303: The code is integrated into trunk

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025251#comment-14025251 ] Hudson commented on TIKA-1325: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #33 (See [https://bu

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hassan Akram (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025253#comment-14025253 ] Hassan Akram commented on TIKA-1303: :) Thanks Guys - Do I just close this issue now?

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025250#comment-14025250 ] Hudson commented on TIKA-1303: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #33 (See [https://bu

[jira] [Commented] (TIKA-1276) Missing embedded dependencies in tika-bundle

2014-06-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025252#comment-14025252 ] Hudson commented on TIKA-1276: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #33 (See [https://bu

[jira] [Assigned] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler reassigned TIKA-1303: - Assignee: Ken Krugler > Parsing Html page (not well formed) containing two title tags results in

[jira] [Resolved] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler resolved TIKA-1303. --- Resolution: Fixed Fix Version/s: 1.7 r1601397 - thanks Hassan! > Parsing Html page (not well f

Re: Timezone issue with TTF parser?

2014-06-09 Thread Nick Burch
On Mon, 9 Jun 2014, Ken Krugler wrote: I just did an svn up from trunk, and mvn clean install is failing with: Failed tests: testTTFParsing(org.apache.tika.parser.font.FontParsersTest): expected:<1904-01-01T0[0]:00:00Z> but was:<1904-01-01T0[8]:00:00Z> See TIKA-1325. Pesky to discover it's

Timezone issue with TTF parser?

2014-06-09 Thread Ken Krugler
Hi all, I just did an svn up from trunk, and mvn clean install is failing with: Failed tests: testTTFParsing(org.apache.tika.parser.font.FontParsersTest): expected:<1904-01-01T0[0]:00:00Z> but was:<1904-01-01T0[8]:00:00Z> Which is this line in the test: assertEquals("1904-01-01T00:0

[jira] [Comment Edited] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025196#comment-14025196 ] Tim Allison edited comment on TIKA-1325 at 6/9/14 2:37 PM: --- Agree

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025205#comment-14025205 ] Tilman Hausherr commented on TIKA-1325: --- It is PDFBOX-2122, and I can do the change d

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025202#comment-14025202 ] Nick Burch commented on TIKA-1325: -- I hope that the change in r1601385 has solved it - I'v

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025203#comment-14025203 ] Lewis John McGibbney commented on TIKA-1303: +1 LGTM > Parsing Html page (not

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025196#comment-14025196 ] Tim Allison commented on TIKA-1325: --- Agreed and agreed. Thank you. It looks like FontBo

[jira] [Commented] (TIKA-1325) Move the font metadata definitions to properties

2014-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025187#comment-14025187 ] Nick Burch commented on TIKA-1325: -- I couldn't help but think there ought to be a better w

[jira] [Comment Edited] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hassan Akram (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14023067#comment-14023067 ] Hassan Akram edited comment on TIKA-1303 at 6/9/14 12:03 PM: - H

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hassan Akram (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14023067#comment-14023067 ] Hassan Akram commented on TIKA-1303: Hi Lewis, Thanks for your help. I have created the

[jira] [Updated] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hassan Akram (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hassan Akram updated TIKA-1303: --- Attachment: TIKA-1303.patch > Parsing Html page (not well formed) containing two title tags results in

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14021878#comment-14021878 ] Lewis John McGibbney commented on TIKA-1303: Yeah sure either * check out the

[jira] [Commented] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Hassan Akram (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14021750#comment-14021750 ] Hassan Akram commented on TIKA-1303: Hi Lewis, I am just picking up your comments on th

[jira] [Issue Comment Deleted] (TIKA-1303) Parsing Html page (not well formed) containing two title tags results in metadata (title) to be overwritten

2014-06-09 Thread Ashish Sood (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashish Sood updated TIKA-1303: -- Comment: was deleted (was: I am currently out of the office, returning on Monday 9 June 2014. If your e