RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
All,

I think I may have found a problem with the interaction of OutlookPSTParser 
with AutoDetectParser that I'd want to fix before 1.7.

If you use the AutoDetectParser instead of the OutlookPSTParser() in 
OutlookPSTParserTest:

//   OutlookPSTParser pstParser = new OutlookPSTParser();
Parser pstParser = new AutoDetectParser();

I'm seeing this exception:

org.apache.tika.exception.TikaException: Failed to close temporary resources
at 
org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)

Are others seeing this?

I'll try to dig into this today, might not get to it until tomorrow.

Best,

Tim



-Original Message-
From: Tyler Palsulich [mailto:tpalsul...@gmail.com] 
Sent: Monday, December 22, 2014 1:58 PM
To: dev@tika.apache.org
Subject: Re: 1.7 release?

Hi All,

Nick added the temporary fix for TIKA-1445 and made the POI updates for
TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
:)

I'll start the process this weekend or a couple days into the new year.

Cheers,
Tyler
On Dec 18, 2014 9:45 PM, Mattmann, Chris A (3980) 
chris.a.mattm...@jpl.nasa.gov wrote:

 +1

 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++






 -Original Message-
 From: Tyler Palsulich tpalsul...@gmail.com
 Reply-To: dev@tika.apache.org dev@tika.apache.org
 Date: Thursday, December 18, 2014 at 9:15 PM
 To: dev@tika.apache.org dev@tika.apache.org
 Subject: Re: 1.7 release?

 I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
 Nick
 just recommended, I'll try adding metadata extraction to Tesseract soon,
 then adding the extensible solution in 1.8.
 
 Tyler
 
 On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) 
 chris.a.mattm...@jpl.nasa.gov wrote:
 
  I haven’t tried my hand at it - been super busy. tyler if you have a
  chance go for it, I think that’s the remaining blocker.
 
  ++
  Chris Mattmann, Ph.D.
  Chief Architect
  Instrument Software and Science Data Systems Section (398)
  NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
  Office: 168-519, Mailstop: 168-527
  Email: chris.a.mattm...@nasa.gov
  WWW:  http://sunset.usc.edu/~mattmann/
  ++
  Adjunct Associate Professor, Computer Science Department
  University of Southern California, Los Angeles, CA 90089 USA
  ++
 
 
 
 
 
 
  -Original Message-
  From: Tyler Palsulich tpalsul...@gmail.com
  Reply-To: dev@tika.apache.org dev@tika.apache.org
  Date: Thursday, December 18, 2014 at 12:54 PM
  To: dev@tika.apache.org dev@tika.apache.org
  Subject: Re: 1.7 release?
 
  Hi All,
  
  It's been a few months, so I just want to follow up on this thread.
 We've
  resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
  1.7
  (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
 TIKA-1445?
  Has anyone tried their hand at the suggested (significant) fix?
  
  Are there any other issues someone would like to fit in?
  
  Cheers,
  Tyler
  
  [0] -
  
 
 
 https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
 e
  dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
  
  On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) 
  chris.a.mattm...@jpl.nasa.gov wrote:
  
   Thanks Tim saw your patch and am looking now.
  
   ++
   Chris Mattmann, Ph.D.
   Chief Architect
   Instrument Software and Science Data Systems Section (398)
   NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
   Office: 168-519, Mailstop: 168-527
   Email: chris.a.mattm...@nasa.gov
   WWW:  http://sunset.usc.edu/~mattmann/
   ++
   Adjunct Associate Professor, Computer Science Department
   University of Southern California, Los Angeles, CA 90089 USA
   ++
  
  
  
  
  
  
   -Original Message-
   From: Allison, Timothy B. talli...@mitre.org
   Reply-To: dev@tika.apache.org dev@tika.apache.org
   Date: Monday, October 27, 2014 at 12:30 PM
   To: dev@tika.apache.org dev@tika.apache.org
   Subject: RE: 1.7 release?
  
   

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?
http://www.apache.org/dev/release.html#upload-scp each file?). I won't send
a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.

Sound good?

Tyler

On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. talli...@mitre.org
wrote:

 All,

 I think I may have found a problem with the interaction of
 OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.

 If you use the AutoDetectParser instead of the OutlookPSTParser() in
 OutlookPSTParserTest:

 //   OutlookPSTParser pstParser = new OutlookPSTParser();
 Parser pstParser = new AutoDetectParser();

 I'm seeing this exception:

 org.apache.tika.exception.TikaException: Failed to close temporary
 resources
 at
 org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
 at
 org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)

 Are others seeing this?

 I'll try to dig into this today, might not get to it until tomorrow.

 Best,

 Tim



 -Original Message-
 From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
 Sent: Monday, December 22, 2014 1:58 PM
 To: dev@tika.apache.org
 Subject: Re: 1.7 release?

 Hi All,

 Nick added the temporary fix for TIKA-1445 and made the POI updates for
 TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
 :)

 I'll start the process this weekend or a couple days into the new year.

 Cheers,
 Tyler
 On Dec 18, 2014 9:45 PM, Mattmann, Chris A (3980) 
 chris.a.mattm...@jpl.nasa.gov wrote:

  +1
 
  ++
  Chris Mattmann, Ph.D.
  Chief Architect
  Instrument Software and Science Data Systems Section (398)
  NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
  Office: 168-519, Mailstop: 168-527
  Email: chris.a.mattm...@nasa.gov
  WWW:  http://sunset.usc.edu/~mattmann/
  ++
  Adjunct Associate Professor, Computer Science Department
  University of Southern California, Los Angeles, CA 90089 USA
  ++
 
 
 
 
 
 
  -Original Message-
  From: Tyler Palsulich tpalsul...@gmail.com
  Reply-To: dev@tika.apache.org dev@tika.apache.org
  Date: Thursday, December 18, 2014 at 9:15 PM
  To: dev@tika.apache.org dev@tika.apache.org
  Subject: Re: 1.7 release?
 
  I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
  Nick
  just recommended, I'll try adding metadata extraction to Tesseract soon,
  then adding the extensible solution in 1.8.
  
  Tyler
  
  On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) 
  chris.a.mattm...@jpl.nasa.gov wrote:
  
   I haven’t tried my hand at it - been super busy. tyler if you have a
   chance go for it, I think that’s the remaining blocker.
  
   ++
   Chris Mattmann, Ph.D.
   Chief Architect
   Instrument Software and Science Data Systems Section (398)
   NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
   Office: 168-519, Mailstop: 168-527
   Email: chris.a.mattm...@nasa.gov
   WWW:  http://sunset.usc.edu/~mattmann/
   ++
   Adjunct Associate Professor, Computer Science Department
   University of Southern California, Los Angeles, CA 90089 USA
   ++
  
  
  
  
  
  
   -Original Message-
   From: Tyler Palsulich tpalsul...@gmail.com
   Reply-To: dev@tika.apache.org dev@tika.apache.org
   Date: Thursday, December 18, 2014 at 12:54 PM
   To: dev@tika.apache.org dev@tika.apache.org
   Subject: Re: 1.7 release?
  
   Hi All,
   
   It's been a few months, so I just want to follow up on this thread.
  We've
   resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked
 as
   1.7
   (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
  TIKA-1445?
   Has anyone tried their hand at the suggested (significant) fix?
   
   Are there any other issues someone would like to fit in?
   
   Cheers,
   Tyler
   
   [0] -
   
  
  
 
 https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
  e
   dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
   
   On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) 
   chris.a.mattm...@jpl.nasa.gov wrote:
   
Thanks Tim saw your patch and am looking now.
   
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Nick Burch

On Mon, 5 Jan 2015, Tyler Palsulich wrote:

Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?


That's a svn checkout

For the RC, assuming it's the same process as for Apache POI, you checkout 
https://dist.apache.org/repos/dist/dev/tika and put the files there


Then, if the vote passes, you svn mv them to 
https://dist.apache.org/repos/dist/release/tika/ + upload things to maven 
central


Nick


Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Thanks, Nick! You were right. OK -- Technically, RC#1 is up at
https://dist.apache.org/repos/dist/dev/tika/.

 Should I also patch the rc1 branch or will you re-branch from trunk?
I'll re-branch.

Tyler

On Mon, Jan 5, 2015 at 12:03 PM, Allison, Timothy B. talli...@mitre.org
wrote:

 I'll patch trunk tonight (with null check, of course :)).  Should I also
 patch the rc1 branch or will you re-branch from trunk?

 -Original Message-
 From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
 Sent: Monday, January 05, 2015 11:38 AM
 To: dev@tika.apache.org
 Subject: Re: 1.7 release? | potential blocker?

 Works for me. I got stalled midway through the process of getting RC#1 out
 (authentication issues). But, going to try to finish it right now (best way
 to upload to dist.apache.org?
 http://www.apache.org/dev/release.html#upload-scp each file?). I won't
 send
 a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.

 Sound good?

 Tyler

 On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. talli...@mitre.org
 wrote:

  All,
 
  I think I may have found a problem with the interaction of
  OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.
 
  If you use the AutoDetectParser instead of the OutlookPSTParser() in
  OutlookPSTParserTest:
 
  //   OutlookPSTParser pstParser = new OutlookPSTParser();
  Parser pstParser = new AutoDetectParser();
 
  I'm seeing this exception:
 
  org.apache.tika.exception.TikaException: Failed to close temporary
  resources
  at
 
 org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
  at
  org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)
 
  Are others seeing this?
 
  I'll try to dig into this today, might not get to it until tomorrow.
 
  Best,
 
  Tim
 
 
 
  -Original Message-
  From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
  Sent: Monday, December 22, 2014 1:58 PM
  To: dev@tika.apache.org
  Subject: Re: 1.7 release?
 
  Hi All,
 
  Nick added the temporary fix for TIKA-1445 and made the POI updates for
  TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for
 1.7!
  :)
 
  I'll start the process this weekend or a couple days into the new year.
 
  Cheers,
  Tyler
  On Dec 18, 2014 9:45 PM, Mattmann, Chris A (3980) 
  chris.a.mattm...@jpl.nasa.gov wrote:
 
   +1
  
   ++
   Chris Mattmann, Ph.D.
   Chief Architect
   Instrument Software and Science Data Systems Section (398)
   NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
   Office: 168-519, Mailstop: 168-527
   Email: chris.a.mattm...@nasa.gov
   WWW:  http://sunset.usc.edu/~mattmann/
   ++
   Adjunct Associate Professor, Computer Science Department
   University of Southern California, Los Angeles, CA 90089 USA
   ++
  
  
  
  
  
  
   -Original Message-
   From: Tyler Palsulich tpalsul...@gmail.com
   Reply-To: dev@tika.apache.org dev@tika.apache.org
   Date: Thursday, December 18, 2014 at 9:15 PM
   To: dev@tika.apache.org dev@tika.apache.org
   Subject: Re: 1.7 release?
  
   I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
   Nick
   just recommended, I'll try adding metadata extraction to Tesseract
 soon,
   then adding the extensible solution in 1.8.
   
   Tyler
   
   On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) 
   chris.a.mattm...@jpl.nasa.gov wrote:
   
I haven’t tried my hand at it - been super busy. tyler if you have a
chance go for it, I think that’s the remaining blocker.
   
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++
   
   
   
   
   
   
-Original Message-
From: Tyler Palsulich tpalsul...@gmail.com
Reply-To: dev@tika.apache.org dev@tika.apache.org
Date: Thursday, December 18, 2014 at 12:54 PM
To: dev@tika.apache.org dev@tika.apache.org
Subject: Re: 1.7 release?
   
Hi All,

It's been a few months, so I just want to follow up on this thread.
   We've
resolved/closed 51 issues for v1.7 [0]. There are two on JIRA
 marked
  as
1.7
(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
   TIKA-1445?
Has anyone tried their hand at the suggested (significant) fix

RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
I'll patch trunk tonight (with null check, of course :)).  Should I also patch 
the rc1 branch or will you re-branch from trunk?

-Original Message-
From: Tyler Palsulich [mailto:tpalsul...@gmail.com] 
Sent: Monday, January 05, 2015 11:38 AM
To: dev@tika.apache.org
Subject: Re: 1.7 release? | potential blocker?

Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?
http://www.apache.org/dev/release.html#upload-scp each file?). I won't send
a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.

Sound good?

Tyler

On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. talli...@mitre.org
wrote:

 All,

 I think I may have found a problem with the interaction of
 OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.

 If you use the AutoDetectParser instead of the OutlookPSTParser() in
 OutlookPSTParserTest:

 //   OutlookPSTParser pstParser = new OutlookPSTParser();
 Parser pstParser = new AutoDetectParser();

 I'm seeing this exception:

 org.apache.tika.exception.TikaException: Failed to close temporary
 resources
 at
 org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
 at
 org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)

 Are others seeing this?

 I'll try to dig into this today, might not get to it until tomorrow.

 Best,

 Tim



 -Original Message-
 From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
 Sent: Monday, December 22, 2014 1:58 PM
 To: dev@tika.apache.org
 Subject: Re: 1.7 release?

 Hi All,

 Nick added the temporary fix for TIKA-1445 and made the POI updates for
 TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
 :)

 I'll start the process this weekend or a couple days into the new year.

 Cheers,
 Tyler
 On Dec 18, 2014 9:45 PM, Mattmann, Chris A (3980) 
 chris.a.mattm...@jpl.nasa.gov wrote:

  +1
 
  ++
  Chris Mattmann, Ph.D.
  Chief Architect
  Instrument Software and Science Data Systems Section (398)
  NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
  Office: 168-519, Mailstop: 168-527
  Email: chris.a.mattm...@nasa.gov
  WWW:  http://sunset.usc.edu/~mattmann/
  ++
  Adjunct Associate Professor, Computer Science Department
  University of Southern California, Los Angeles, CA 90089 USA
  ++
 
 
 
 
 
 
  -Original Message-
  From: Tyler Palsulich tpalsul...@gmail.com
  Reply-To: dev@tika.apache.org dev@tika.apache.org
  Date: Thursday, December 18, 2014 at 9:15 PM
  To: dev@tika.apache.org dev@tika.apache.org
  Subject: Re: 1.7 release?
 
  I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
  Nick
  just recommended, I'll try adding metadata extraction to Tesseract soon,
  then adding the extensible solution in 1.8.
  
  Tyler
  
  On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) 
  chris.a.mattm...@jpl.nasa.gov wrote:
  
   I haven’t tried my hand at it - been super busy. tyler if you have a
   chance go for it, I think that’s the remaining blocker.
  
   ++
   Chris Mattmann, Ph.D.
   Chief Architect
   Instrument Software and Science Data Systems Section (398)
   NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
   Office: 168-519, Mailstop: 168-527
   Email: chris.a.mattm...@nasa.gov
   WWW:  http://sunset.usc.edu/~mattmann/
   ++
   Adjunct Associate Professor, Computer Science Department
   University of Southern California, Los Angeles, CA 90089 USA
   ++
  
  
  
  
  
  
   -Original Message-
   From: Tyler Palsulich tpalsul...@gmail.com
   Reply-To: dev@tika.apache.org dev@tika.apache.org
   Date: Thursday, December 18, 2014 at 12:54 PM
   To: dev@tika.apache.org dev@tika.apache.org
   Subject: Re: 1.7 release?
  
   Hi All,
   
   It's been a few months, so I just want to follow up on this thread.
  We've
   resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked
 as
   1.7
   (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
  TIKA-1445?
   Has anyone tried their hand at the suggested (significant) fix?
   
   Are there any other issues someone would like to fit in?
   
   Cheers,
   Tyler
   
   [0] -
   
  
  
 
 https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
  e
   dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
   
   On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) 
   chris.a.mattm...@jpl.nasa.gov wrote:
   
Thanks Tim saw your patch and am looking now