[VOTE] Apache Tika 1.7 Release

2015-01-05 Thread Tyler Palsulich
Hi All, A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ The SHA1 checksum of the archive is

[jira] [Commented] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264597#comment-14264597 ] Tim Allison commented on TIKA-1506: --- Looks like PSTFile relies on finalize() to close the

[jira] [Created] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1506: - Summary: OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser Key: TIKA-1506 URL: https://issues.apache.org/jira/browse/TIKA-1506

RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
All, I think I may have found a problem with the interaction of OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7. If you use the AutoDetectParser instead of the OutlookPSTParser() in OutlookPSTParserTest: // OutlookPSTParser pstParser = new OutlookPSTParser();

[jira] [Updated] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1506: -- Description: On Windows, PSTFile can leave its InputStream open, which can prevent AutoDetectParser

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Works for me. I got stalled midway through the process of getting RC#1 out (authentication issues). But, going to try to finish it right now (best way to upload to dist.apache.org? http://www.apache.org/dev/release.html#upload-scp each file?). I won't send a VOTE for RC#1, though -- I'll wait for

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Nick Burch
On Mon, 5 Jan 2015, Tyler Palsulich wrote: Works for me. I got stalled midway through the process of getting RC#1 out (authentication issues). But, going to try to finish it right now (best way to upload to dist.apache.org? That's a svn checkout For the RC, assuming it's the same process as

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Thanks, Nick! You were right. OK -- Technically, RC#1 is up at https://dist.apache.org/repos/dist/dev/tika/. Should I also patch the rc1 branch or will you re-branch from trunk? I'll re-branch. Tyler On Mon, Jan 5, 2015 at 12:03 PM, Allison, Timothy B. talli...@mitre.org wrote: I'll patch

RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
I'll patch trunk tonight (with null check, of course :)). Should I also patch the rc1 branch or will you re-branch from trunk? -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, January 05, 2015 11:38 AM To: dev@tika.apache.org Subject: Re: 1.7

[jira] [Commented] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3

2015-01-05 Thread Hong-Thai Nguyen (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264786#comment-14264786 ] Hong-Thai Nguyen commented on TIKA-1505: Can you provide also problem files and

level of interest in database file parsing?

2015-01-05 Thread Allison, Timothy B.
All, Thanks to Nick for adding mime info for db files, we can now identify several common db files. What is the community's level of interest in adding parsers for databases that store data in one file, such as .mdb, .dbf, .sqlite, .hsqldb ... (others?)? Most of the jdbc drivers are not

[jira] [Commented] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1426#comment-1426 ] Hudson commented on TIKA-1506: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #392 (See

[jira] [Resolved] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1506. --- Resolution: Fixed r1649710. [~tpalsulich], let's roll! OutlookPSTParser not closing PSTFile's

[jira] [Commented] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265591#comment-14265591 ] Hudson commented on TIKA-1506: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #407 (See

[jira] [Updated] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3

2015-01-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1505: -- Fix Version/s: (was: 1.7) 1.8 chmparser breaks down when extracting from

[jira] [Commented] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265053#comment-14265053 ] Tim Allison commented on TIKA-1506: --- [~lfcnassif], ah, good to hear I'm not alone. Any

[jira] [Commented] (TIKA-1506) OutlookPSTParser not closing PSTFile's InputStream, causing exception when called by AutoDetectParser

2015-01-05 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14264705#comment-14264705 ] Luis Filipe Nassif commented on TIKA-1506: -- Ok. I have applied the same patch