Re: [VOTE] Apache Tika 1.7 Release

2015-01-17 Thread Mattmann, Chris A (3980)
woot ;) Knew you would

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Tyler Palsulich 
Reply-To: "dev@tika.apache.org" 
Date: Thursday, January 15, 2015 at 1:36 PM
To: "dev@tika.apache.org" 
Subject: Re: [VOTE] Apache Tika 1.7 Release

>Found it:
>https://github.com/chrismattmann/apachestuff/blob/master/extract-tika-cont
>ribs
>:)
>
>Thanks!
>Tyler
>
>On Thu, Jan 15, 2015 at 8:57 AM, Tyler Palsulich 
>wrote:
>
>> Thanks, Chris! That sounds useful. Let me know when you get a chance to
>> upload it somewhere.
>>
>> Tyler
>>
>> On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) <
>> chris.a.mattm...@jpl.nasa.gov> wrote:
>>
>>> yeah good idea Nick. Also I had a script that would partially
>>> auto-generate
>>> the contributors and so forth - let me see if I can find it (it used
>>> Tika!)
>>> Hah, how’s THAT for eating our own dogfood?
>>>
>>> ++
>>> Chris Mattmann, Ph.D.
>>> Chief Architect
>>> Instrument Software and Science Data Systems Section (398)
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 168-519, Mailstop: 168-527
>>> Email: chris.a.mattm...@nasa.gov
>>> WWW:  http://sunset.usc.edu/~mattmann/
>>> ++
>>> Adjunct Associate Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++
>>>
>>>
>>>
>>>
>>>
>>>
>>> -Original Message-
>>> From: Nick Burch 
>>> Reply-To: "dev@tika.apache.org" 
>>> Date: Wednesday, January 14, 2015 at 11:49 AM
>>> To: "dev@tika.apache.org" 
>>> Subject: Re: [VOTE] Apache Tika 1.7 Release
>>>
>>> >On Wed, 14 Jan 2015, Tyler Palsulich wrote:
>>> >> Nick, thanks for building the site! We still need to rebuild the
>>>index,
>>> >> right?
>>> >
>>> >You'll need to build the 1.7 index page (based on the changelog), then
>>> >update the download page + homepage + menu, and finally rebuild the
>>>site
>>> >
>>> >(All I did was finish off the formats page, which we update as
>>> >development
>>> >progresses, copy over + tweak the remaining pages, and start a formats
>>> >page for the new version. Maybe it's worth adding all these steps to
>>>the
>>> >release process docs?)
>>> >
>>> >Nick
>>>
>>>
>>



Re: [VOTE] Apache Tika 1.7 Release

2015-01-15 Thread Tyler Palsulich
Found it:
https://github.com/chrismattmann/apachestuff/blob/master/extract-tika-contribs
:)

Thanks!
Tyler

On Thu, Jan 15, 2015 at 8:57 AM, Tyler Palsulich 
wrote:

> Thanks, Chris! That sounds useful. Let me know when you get a chance to
> upload it somewhere.
>
> Tyler
>
> On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
>
>> yeah good idea Nick. Also I had a script that would partially
>> auto-generate
>> the contributors and so forth - let me see if I can find it (it used
>> Tika!)
>> Hah, how’s THAT for eating our own dogfood?
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: Nick Burch 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Wednesday, January 14, 2015 at 11:49 AM
>> To: "dev@tika.apache.org" 
>> Subject: Re: [VOTE] Apache Tika 1.7 Release
>>
>> >On Wed, 14 Jan 2015, Tyler Palsulich wrote:
>> >> Nick, thanks for building the site! We still need to rebuild the index,
>> >> right?
>> >
>> >You'll need to build the 1.7 index page (based on the changelog), then
>> >update the download page + homepage + menu, and finally rebuild the site
>> >
>> >(All I did was finish off the formats page, which we update as
>> >development
>> >progresses, copy over + tweak the remaining pages, and start a formats
>> >page for the new version. Maybe it's worth adding all these steps to the
>> >release process docs?)
>> >
>> >Nick
>>
>>
>


[RESULT] [VOTE] Apache Tika 1.7 Release Candidate #3

2015-01-15 Thread Tyler Palsulich
Hi All,

The VOTE for releasing Apache Tika 1.7 RC#3 finished with the following
tally:

+1:
Chris Mattmann
David Meikle
Hong-Thai Nguyen
Nick Burch
Tim Allison
Tyler Palsulich

+0:
[None]

-1:
[None]

Thank you everyone for voting! I will move forward with the release.

Have a good day,
Tyler


Re: [VOTE] Apache Tika 1.7 Release

2015-01-15 Thread Tyler Palsulich
Thanks, Chris! That sounds useful. Let me know when you get a chance to
upload it somewhere.

Tyler

On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> yeah good idea Nick. Also I had a script that would partially auto-generate
> the contributors and so forth - let me see if I can find it (it used Tika!)
> Hah, how’s THAT for eating our own dogfood?
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Nick Burch 
> Reply-To: "dev@tika.apache.org" 
> Date: Wednesday, January 14, 2015 at 11:49 AM
> To: "dev@tika.apache.org" 
> Subject: Re: [VOTE] Apache Tika 1.7 Release
>
> >On Wed, 14 Jan 2015, Tyler Palsulich wrote:
> >> Nick, thanks for building the site! We still need to rebuild the index,
> >> right?
> >
> >You'll need to build the 1.7 index page (based on the changelog), then
> >update the download page + homepage + menu, and finally rebuild the site
> >
> >(All I did was finish off the formats page, which we update as
> >development
> >progresses, copy over + tweak the remaining pages, and start a formats
> >page for the new version. Maybe it's worth adding all these steps to the
> >release process docs?)
> >
> >Nick
>
>


Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Mattmann, Chris A (3980)
yeah good idea Nick. Also I had a script that would partially auto-generate
the contributors and so forth - let me see if I can find it (it used Tika!)
Hah, how’s THAT for eating our own dogfood?

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Nick Burch 
Reply-To: "dev@tika.apache.org" 
Date: Wednesday, January 14, 2015 at 11:49 AM
To: "dev@tika.apache.org" 
Subject: Re: [VOTE] Apache Tika 1.7 Release

>On Wed, 14 Jan 2015, Tyler Palsulich wrote:
>> Nick, thanks for building the site! We still need to rebuild the index,
>> right?
>
>You'll need to build the 1.7 index page (based on the changelog), then
>update the download page + homepage + menu, and finally rebuild the site
>
>(All I did was finish off the formats page, which we update as
>development 
>progresses, copy over + tweak the remaining pages, and start a formats
>page for the new version. Maybe it's worth adding all these steps to the
>release process docs?)
>
>Nick



Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread David Meikle
Hi Tyler,

> On 9 Jan 2015, at 22:02, Tyler Palsulich  wrote:
> 
> A candidate for the Tika 1.7 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/ 
> <https://dist.apache.org/repos/dist/dev/tika/>
> 
> The release candidate is a zip archive of the sources in:
> http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ 
> <http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/>
+1 from me.

Cheers,
Dave

Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Nick Burch

On Wed, 14 Jan 2015, Tyler Palsulich wrote:
Nick, thanks for building the site! We still need to rebuild the index, 
right?


You'll need to build the 1.7 index page (based on the changelog), then 
update the download page + homepage + menu, and finally rebuild the site


(All I did was finish off the formats page, which we update as development 
progresses, copy over + tweak the remaining pages, and start a formats 
page for the new version. Maybe it's worth adding all these steps to the 
release process docs?)


Nick


Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Tyler Palsulich
Thanks everyone! I'll close off this VOTE and roll the release tomorrow
morning.

Nick, thanks for building the site! We still need to rebuild the index,
right?

Tyler

On Wed, Jan 14, 2015 at 8:37 AM, Allison, Timothy B. 
wrote:

> +1
>
> Built successfully on both Windows 7 and RHEL 6.5 for me...no Tesseract
> installed.  Relying on post rc2 release eval for TIKA 1445 against trunk
> for no new regressions.  Manually confirmed image metadata is being
> extracted.
>
> Thank you, Tyler!
>
> Best,
>
>  Tim
>
> -Original Message-
> From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
> Sent: Wednesday, January 14, 2015 10:54 AM
> To: u...@tika.apache.org; dev@tika.apache.org
> Subject: Re: [VOTE] Apache Tika 1.7 Release
>
> +1 to release
>
> GPG sigs and Checksums good (after import of tika.asc)
>
> Great work Tyler and team!
>
> Cheers,
> Chris
>
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc
> tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/
>   % Total% Received % Xferd  Average Speed   TimeTime Time
> Current
>  Dload  Upload   Total   SpentLeft
> Speed
> 100 58.8M  100 58.8M0 0   839k  0  0:01:11  0:01:11 --:--:--
> 1137k
>   % Total% Received % Xferd  Average Speed   TimeTime Time
> Current
>  Dload  Upload   Total   SpentLeft
> Speed
> 100   473  100   4730 0901  0 --:--:-- --:--:-- --:--:--
> 900
>   % Total% Received % Xferd  Average Speed   TimeTime Time
> Current
>  Dload  Upload   Total   SpentLeft
> Speed
> 10033  100330 0 58  0 --:--:-- --:--:-- --:--:--
>  58
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls
> tika-1.7-src.zip  tika-1.7-src.zip.asc  tika-1.7-src.zip.md5
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
> Verifying Signature for file tika-1.7-src.zip.asc
> gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
> gpg: Can't check signature: public key not found
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O
> https://people.apache.org/keys/group/tika.asc
>   % Total% Received % Xferd  Average Speed   TimeTime Time
> Current
>  Dload  Upload   Total   SpentLeft
> Speed
> 100  153k  100  153k0 0   149k  0  0:00:01  0:00:01 --:--:--
> 149k
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika
> tika: No such file or directory.
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc
> gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY)
> " not changed
> gpg: key A355A63E: "Jukka Zitting " not changed
> gpg: key 8A26D9A6: "Jukka Zitting " not changed
> gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) "
> not changed
> gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) "
> 2 new signatures
> gpg: key D84E41AE: "Nick Burch " 16 new signatures
> gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY)
> " not changed
> gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) "
> not changed
> gpg: key DEDEAB92: "Sergey Beryozkin (Release Management)
> " not changed
> gpg: key 97EDDE66: "tallison (apache_distro_keys) "
> not changed
> gpg: key D4F10117: public key "Tyler Palsulich "
> imported
> gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY)
> " not changed
> gpg: key 0890B1AB: public key "Konstantin Gribov (gross)
> " imported
> gpg: Total number processed: 13
> gpg:   imported: 2  (RSA: 2)
> gpg:  unchanged: 9
> gpg: new signatures: 18
> gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model
> gpg: depth: 0  valid:   3  signed:   0  trust: 0-, 0q, 0n, 0m, 0f, 3u
> gpg: next trustdb check due at 2015-08-18
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
> Verifying Signature for file tika-1.7-src.zip.asc
> gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
> gpg: Good signature from "Tyler Palsulich "
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg:  There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4  183E 8810 BB19 D4F1 0117
> Verifying Signature for file tika.asc
> gpg: verify signatures failed: unexpected data
> [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann%
> $HOME/bin/verify_md5_

RE: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Allison, Timothy B.
+1

Built successfully on both Windows 7 and RHEL 6.5 for me...no Tesseract 
installed.  Relying on post rc2 release eval for TIKA 1445 against trunk for no 
new regressions.  Manually confirmed image metadata is being extracted.

Thank you, Tyler!

Best,

 Tim

-Original Message-
From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] 
Sent: Wednesday, January 14, 2015 10:54 AM
To: u...@tika.apache.org; dev@tika.apache.org
Subject: Re: [VOTE] Apache Tika 1.7 Release

+1 to release

GPG sigs and Checksums good (after import of tika.asc)

Great work Tyler and team!

Cheers,
Chris

[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc
tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100 58.8M  100 58.8M0 0   839k  0  0:01:11  0:01:11 --:--:--
1137k
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100   473  100   4730 0901  0 --:--:-- --:--:-- --:--:--
900
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
10033  100330 0 58  0 --:--:-- --:--:-- --:--:--
 58
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls
tika-1.7-src.zip  tika-1.7-src.zip.asc  tika-1.7-src.zip.md5
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.7-src.zip.asc
gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
gpg: Can't check signature: public key not found
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O
https://people.apache.org/keys/group/tika.asc
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100  153k  100  153k0 0   149k  0  0:00:01  0:00:01 --:--:--
149k
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika
tika: No such file or directory.
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc
gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY)
" not changed
gpg: key A355A63E: "Jukka Zitting " not changed
gpg: key 8A26D9A6: "Jukka Zitting " not changed
gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) "
not changed
gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) "
2 new signatures
gpg: key D84E41AE: "Nick Burch " 16 new signatures
gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY)
" not changed
gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) "
not changed
gpg: key DEDEAB92: "Sergey Beryozkin (Release Management)
" not changed
gpg: key 97EDDE66: "tallison (apache_distro_keys) "
not changed
gpg: key D4F10117: public key "Tyler Palsulich "
imported
gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY)
" not changed
gpg: key 0890B1AB: public key "Konstantin Gribov (gross)
" imported
gpg: Total number processed: 13
gpg:   imported: 2  (RSA: 2)
gpg:  unchanged: 9
gpg: new signatures: 18
gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model
gpg: depth: 0  valid:   3  signed:   0  trust: 0-, 0q, 0n, 0m, 0f, 3u
gpg: next trustdb check due at 2015-08-18
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.7-src.zip.asc
gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
gpg: Good signature from "Tyler Palsulich "
gpg: WARNING: This key is not certified with a trusted signature!
gpg:  There is no indication that the signature belongs to the
owner.
Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4  183E 8810 BB19 D4F1 0117
Verifying Signature for file tika.asc
gpg: verify signatures failed: unexpected data
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann%
$HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.7-src.zip: OK
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann%



++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 

Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Mattmann, Chris A (3980)
+1 to release

GPG sigs and Checksums good (after import of tika.asc)

Great work Tyler and team!

Cheers,
Chris

[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc
tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100 58.8M  100 58.8M0 0   839k  0  0:01:11  0:01:11 --:--:--
1137k
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100   473  100   4730 0901  0 --:--:-- --:--:-- --:--:--
900
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
10033  100330 0 58  0 --:--:-- --:--:-- --:--:--
 58
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls
tika-1.7-src.zip  tika-1.7-src.zip.asc  tika-1.7-src.zip.md5
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.7-src.zip.asc
gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
gpg: Can't check signature: public key not found
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O
https://people.apache.org/keys/group/tika.asc
  % Total% Received % Xferd  Average Speed   TimeTime Time
Current
 Dload  Upload   Total   SpentLeft
Speed
100  153k  100  153k0 0   149k  0  0:00:01  0:00:01 --:--:--
149k
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika
tika: No such file or directory.
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc
gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY)
" not changed
gpg: key A355A63E: "Jukka Zitting " not changed
gpg: key 8A26D9A6: "Jukka Zitting " not changed
gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) "
not changed
gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) "
2 new signatures
gpg: key D84E41AE: "Nick Burch " 16 new signatures
gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY)
" not changed
gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) "
not changed
gpg: key DEDEAB92: "Sergey Beryozkin (Release Management)
" not changed
gpg: key 97EDDE66: "tallison (apache_distro_keys) "
not changed
gpg: key D4F10117: public key "Tyler Palsulich "
imported
gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY)
" not changed
gpg: key 0890B1AB: public key "Konstantin Gribov (gross)
" imported
gpg: Total number processed: 13
gpg:   imported: 2  (RSA: 2)
gpg:  unchanged: 9
gpg: new signatures: 18
gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model
gpg: depth: 0  valid:   3  signed:   0  trust: 0-, 0q, 0n, 0m, 0f, 3u
gpg: next trustdb check due at 2015-08-18
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.7-src.zip.asc
gpg: Signature made Fri Jan  9 16:36:49 2015 EST using RSA key ID D4F10117
gpg: Good signature from "Tyler Palsulich "
gpg: WARNING: This key is not certified with a trusted signature!
gpg:  There is no indication that the signature belongs to the
owner.
Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4  183E 8810 BB19 D4F1 0117
Verifying Signature for file tika.asc
gpg: verify signatures failed: unexpected data
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann%
$HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.7-src.zip: OK
[chipotle:~/tmp/apache-tika-1.7-rc3] mattmann%



++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++






-Original Message-
From: Tyler Palsulich 
Reply-To: 
Date: Friday, January 9, 2015 at 5:02 PM
To: , "u...@tika.apache.org" 
Subject: [VOTE] Apache Tika 1.7 Release

>Hi All,
>
>A candidate for the Tika 1.7 release is available at:
>https://dist.apache.org/repos/dist/dev/tika/
>
>
>The release candidate is a zip archive of the sources in:
>http://svn.apache.org/repos/asf/tika/ta

Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Hong-Thai Nguyen
I've checked again some regression tests. Seem fine for me too. So +1

Great job Tyler !

On Fri, Jan 9, 2015 at 11:02 PM, Tyler Palsulich 
wrote:

> Hi All,
>
> A candidate for the Tika 1.7 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/
>
> The SHA1 checksum of the archive is
> b2190c267433e62c08560576ab7197e506bfdc11
>
> In addition, a staged maven repository is available here:
>
>
> https://repository.apache.org/content/repositories/orgapachetika-1007/org/apache/tika/
>
> Please vote on releasing this package as Apache Tika 1.7.
>
> The vote is open for the next 72 hours and passes if a majority of at least
> three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.7
> [ ] -1 Do not release this package because...
>
> Thanks!
> Tyler
>
> P.S. Here is my +1!
>



-- 
--
Hong-Thai


Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Nick Burch

On Fri, 9 Jan 2015, Tyler Palsulich wrote:

A candidate for the Tika 1.7 release is available at:
   https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
   http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/


All looks good to me (signatures, hashes, metadata etc), so I'm +1

Nick


Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Thomas Ledoux
+1, works for me

2015-01-13 9:23 GMT+01:00 Tyler Palsulich :

> Hi Folks,
>
> Let's mark this RC#2 as failed and shift the vote to the updated RC#3 (
> http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract
> metadata
> fixes and David's test fix.
>
> Thanks,
> Tyler
>
> On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer 
> wrote:
>
> > +1.
> >
> > Worked great once I manually
> > edited
> >
> tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties
> > and set useNonSequentialParser to true
> >
> > Peter
> >
>


Re: [VOTE] Apache Tika 1.7 Release

2015-01-13 Thread Lewis John Mcgibbney
+1

On Tue, Jan 13, 2015 at 3:23 AM, Tyler Palsulich 
wrote:

> Hi Folks,
>
> Let's mark this RC#2 as failed and shift the vote to the updated RC#3 (
> http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract
> metadata fixes and David's test fix.
>
> Thanks,
> Tyler
>
> On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer 
> wrote:
>
>> +1.
>>
>> Worked great once I manually
>> edited
>> tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties
>> and set useNonSequentialParser to true
>>
>> Peter
>>
>
>


-- 
*Lewis*


Re: [VOTE] Apache Tika 1.7 Release

2015-01-13 Thread Tyler Palsulich
Hi Folks,

Let's mark this RC#2 as failed and shift the vote to the updated RC#3 (
http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract metadata
fixes and David's test fix.

Thanks,
Tyler

On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer 
wrote:

> +1.
>
> Worked great once I manually
> edited
> tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties
> and set useNonSequentialParser to true
>
> Peter
>


[VOTE] Apache Tika 1.7 Release

2015-01-09 Thread Tyler Palsulich
Hi All,

A candidate for the Tika 1.7 release is available at:
https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/

The SHA1 checksum of the archive is
b2190c267433e62c08560576ab7197e506bfdc11

In addition, a staged maven repository is available here:

https://repository.apache.org/content/repositories/orgapachetika-1007/org/apache/tika/

Please vote on releasing this package as Apache Tika 1.7.

The vote is open for the next 72 hours and passes if a majority of at least
three +1 Tika PMC votes are cast.

[ ] +1 Release this package as Apache Tika 1.7
[ ] -1 Do not release this package because...

Thanks!
Tyler

P.S. Here is my +1!


Re: [VOTE] Apache Tika 1.7 Release

2015-01-08 Thread Peter Bowyer
+1.

Worked great once I manually
edited 
tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties
and set useNonSequentialParser to true

Peter


Re: [VOTE] Apache Tika 1.7 Release

2015-01-08 Thread Hong-Thai Nguyen
Seems fine for me: +1

No big regression on our corpus test of 23K docs:

15-01-07 18:19:27 INFO  (DocumentConversionErrorPlugin.java : 116)
[pool-3-thread-1] Summary of document conversion errors:
- pdf (4)
* (2) org.apache.tika.exception.TikaException: TIKA-198: Illegal
IOException from org.apache.tika.parser.ParserDecorator$1@4b0b2006
* (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@4b0b2006
* (1) org.apache.tika.exception.TikaException: Unable to extract PDF content
- ps (3)
* (3) org.apache.tika.exception.TikaException: Unable to unpack document
stream
- pptx (10)
* (9) org.apache.tika.exception.TikaException: Error creating OOXML
extractor
* (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@45df8db8
- doc (6)
* (6) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@58797499
- ppt (14)
* (13) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@58797499
* (1) org.apache.tika.exception.TikaException: TIKA-198: Illegal
IOException from org.apache.tika.parser.ParserDecorator$1@58797499
- xls (9)
* (9) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@58797499
- vsd (3)
* (3) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@58797499
- odp (2)
* (2) org.apache.tika.exception.TikaException: TIKA-198: Illegal
IOException from org.apache.tika.parser.ParserDecorator$1@753ce4d8
- chm (1)
* (1) org.apache.tika.exception.TikaException: CHM file extract error:
extracted Length is wrong.
- dwg (4)
* (4) org.apache.tika.exception.TikaException: Unsupported AutoCAD drawing
version: AC1014
- pps (2)
* (2) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@58797499
- chw (1)
* (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException
from org.apache.tika.parser.ParserDecorator$1@a0b8fca

Thank Tyler,

On Tue, Jan 6, 2015 at 7:59 AM, Tyler Palsulich 
wrote:

> Hi All,
>
> A candidate for the Tika 1.7 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/
>
> The SHA1 checksum of the archive is
> 0307a8367ae6f8b1103824fd11337fd89e24e6a4.
>
> In addition, a staged maven repository is available here:
>
>
> https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/
>
> Please vote on releasing this package as Apache Tika 1.7.
>
> The vote is open for the next 72 hours and passes if a majority of at least
> three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.7
> [ ] -1 Do not release this package because...
>
> Thanks!
> Tyler
>
> P.S. Count this as my +1!
>



-- 
--
Hong-Thai


Re: [VOTE] Apache Tika 1.7 Release

2015-01-07 Thread David Meikle
-1 on this for me too as there is a small unit test failure from ODFParser
on Windows from TIKA-1412.

I have added the tweak to fix this on trunk.

(I have also tested the latest changes added by Tim and Tyler in TIKA-1445
on Windows, Mac and Ubuntu with a decent batch of files, and everything is
working nicely at this end.)

On 7 January 2015 at 01:11, Allison, Timothy B.  wrote:

> -1
>
> I'm sorry that I haven't had a chance to kick the tires on the recent
> changes to the metadata extraction from images until now, but it looks like
> 1.7-rc2 and trunk are not pulling metadata from embedded images.
>
> I've posted a test file from govdocs1 to TIKA-1445.  I may have time
> tomorrow to see what's going on.  I should also have time tomorrow to
> finish the analysis of the comparison between 1.6 and 1.7 on govdocs1.
>
> Sorry for my delay, all!  And even greater apologies if user error is at
> fault and metadata is successfully being extracted from embedded images. :)
>
> Thank you, Tyler, for running this release!
>
>
> -Original Message-
> From: Nick Burch [mailto:apa...@gagravarr.org]
> Sent: Tuesday, January 06, 2015 11:36 AM
> To: dev@tika.apache.org
> Subject: Re: [VOTE] Apache Tika 1.7 Release
>
> On Tue, 6 Jan 2015, Tyler Palsulich wrote:
> > A candidate for the Tika 1.7 release is available at:
> >https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive of the sources in:
> >http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/
> >
> > The SHA1 checksum of the archive is
> >0307a8367ae6f8b1103824fd11337fd89e24e6a4.
> >
> > In addition, a staged maven repository is available here:
> >
> >
> https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/
>
> Looks good to me, I'm +1
>
> Nick
>


RE: [VOTE] Apache Tika 1.7 Release

2015-01-06 Thread Allison, Timothy B.
-1

I'm sorry that I haven't had a chance to kick the tires on the recent changes 
to the metadata extraction from images until now, but it looks like 1.7-rc2 and 
trunk are not pulling metadata from embedded images.

I've posted a test file from govdocs1 to TIKA-1445.  I may have time tomorrow 
to see what's going on.  I should also have time tomorrow to finish the 
analysis of the comparison between 1.6 and 1.7 on govdocs1.

Sorry for my delay, all!  And even greater apologies if user error is at fault 
and metadata is successfully being extracted from embedded images. :)

Thank you, Tyler, for running this release!


-Original Message-
From: Nick Burch [mailto:apa...@gagravarr.org] 
Sent: Tuesday, January 06, 2015 11:36 AM
To: dev@tika.apache.org
Subject: Re: [VOTE] Apache Tika 1.7 Release

On Tue, 6 Jan 2015, Tyler Palsulich wrote:
> A candidate for the Tika 1.7 release is available at:
>https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
>http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/
>
> The SHA1 checksum of the archive is
>0307a8367ae6f8b1103824fd11337fd89e24e6a4.
>
> In addition, a staged maven repository is available here:
>
> https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/

Looks good to me, I'm +1

Nick


Re: [VOTE] Apache Tika 1.7 Release

2015-01-06 Thread Nick Burch

On Tue, 6 Jan 2015, Tyler Palsulich wrote:

A candidate for the Tika 1.7 release is available at:
   https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
   http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/

The SHA1 checksum of the archive is
   0307a8367ae6f8b1103824fd11337fd89e24e6a4.

In addition, a staged maven repository is available here:

https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/


Looks good to me, I'm +1

Nick


[VOTE] Apache Tika 1.7 Release

2015-01-05 Thread Tyler Palsulich
Hi All,

A candidate for the Tika 1.7 release is available at:
https://dist.apache.org/repos/dist/dev/tika/

The release candidate is a zip archive of the sources in:
http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/

The SHA1 checksum of the archive is
0307a8367ae6f8b1103824fd11337fd89e24e6a4.

In addition, a staged maven repository is available here:

https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/

Please vote on releasing this package as Apache Tika 1.7.

The vote is open for the next 72 hours and passes if a majority of at least
three +1 Tika PMC votes are cast.

[ ] +1 Release this package as Apache Tika 1.7
[ ] -1 Do not release this package because...

Thanks!
Tyler

P.S. Count this as my +1!


Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Thanks, Nick! You were right. OK -- Technically, RC#1 is up at
https://dist.apache.org/repos/dist/dev/tika/.

> Should I also patch the rc1 branch or will you re-branch from trunk?
I'll re-branch.

Tyler

On Mon, Jan 5, 2015 at 12:03 PM, Allison, Timothy B. 
wrote:

> I'll patch trunk tonight (with null check, of course :)).  Should I also
> patch the rc1 branch or will you re-branch from trunk?
>
> -Original Message-
> From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
> Sent: Monday, January 05, 2015 11:38 AM
> To: dev@tika.apache.org
> Subject: Re: 1.7 release? | potential blocker?
>
> Works for me. I got stalled midway through the process of getting RC#1 out
> (authentication issues). But, going to try to finish it right now (best way
> to upload to dist.apache.org?
> http://www.apache.org/dev/release.html#upload-scp each file?). I won't
> send
> a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.
>
> Sound good?
>
> Tyler
>
> On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. 
> wrote:
>
> > All,
> >
> > I think I may have found a problem with the interaction of
> > OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.
> >
> > If you use the AutoDetectParser instead of the OutlookPSTParser() in
> > OutlookPSTParserTest:
> >
> > //   OutlookPSTParser pstParser = new OutlookPSTParser();
> > Parser pstParser = new AutoDetectParser();
> >
> > I'm seeing this exception:
> >
> > org.apache.tika.exception.TikaException: Failed to close temporary
> > resources
> > at
> >
> org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
> > at
> > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)
> >
> > Are others seeing this?
> >
> > I'll try to dig into this today, might not get to it until tomorrow.
> >
> > Best,
> >
> > Tim
> >
> >
> >
> > -Original Message-
> > From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
> > Sent: Monday, December 22, 2014 1:58 PM
> > To: dev@tika.apache.org
> > Subject: Re: 1.7 release?
> >
> > Hi All,
> >
> > Nick added the temporary fix for TIKA-1445 and made the POI updates for
> > TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for
> 1.7!
> > :)
> >
> > I'll start the process this weekend or a couple days into the new year.
> >
> > Cheers,
> > Tyler
> > On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
> > chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> > > +1
> > >
> > > ++
> > > Chris Mattmann, Ph.D.
> > > Chief Architect
> > > Instrument Software and Science Data Systems Section (398)
> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > > Office: 168-519, Mailstop: 168-527
> > > Email: chris.a.mattm...@nasa.gov
> > > WWW:  http://sunset.usc.edu/~mattmann/
> > > ++
> > > Adjunct Associate Professor, Computer Science Department
> > > University of Southern California, Los Angeles, CA 90089 USA
> > > ++
> > >
> > >
> > >
> > >
> > >
> > >
> > > -Original Message-
> > > From: Tyler Palsulich 
> > > Reply-To: "dev@tika.apache.org" 
> > > Date: Thursday, December 18, 2014 at 9:15 PM
> > > To: "dev@tika.apache.org" 
> > > Subject: Re: 1.7 release?
> > >
> > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> > > >Nick
> > > >just recommended, I'll try adding metadata extraction to Tesseract
> soon,
> > > >then adding the extensible solution in 1.8.
> > > >
> > > >Tyler
> > > >
> > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> > > >chris.a.mattm...@jpl.nasa.gov> wrote:
> > > >>
> > > >> I haven’t tried my hand at it - been super busy. tyler if you have a
> > > >> chance go for it, I think that’s the remaining blocker.
> > > >>
> > > >> ++++++++++
> > > >> Chris Mattmann, Ph.D.
> > > >> Chief Architect
> > > &g

RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
I'll patch trunk tonight (with null check, of course :)).  Should I also patch 
the rc1 branch or will you re-branch from trunk?

-Original Message-
From: Tyler Palsulich [mailto:tpalsul...@gmail.com] 
Sent: Monday, January 05, 2015 11:38 AM
To: dev@tika.apache.org
Subject: Re: 1.7 release? | potential blocker?

Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?
http://www.apache.org/dev/release.html#upload-scp each file?). I won't send
a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.

Sound good?

Tyler

On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. 
wrote:

> All,
>
> I think I may have found a problem with the interaction of
> OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.
>
> If you use the AutoDetectParser instead of the OutlookPSTParser() in
> OutlookPSTParserTest:
>
> //   OutlookPSTParser pstParser = new OutlookPSTParser();
> Parser pstParser = new AutoDetectParser();
>
> I'm seeing this exception:
>
> org.apache.tika.exception.TikaException: Failed to close temporary
> resources
> at
> org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)
>
> Are others seeing this?
>
> I'll try to dig into this today, might not get to it until tomorrow.
>
> Best,
>
> Tim
>
>
>
> -Original Message-
> From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
> Sent: Monday, December 22, 2014 1:58 PM
> To: dev@tika.apache.org
> Subject: Re: 1.7 release?
>
> Hi All,
>
> Nick added the temporary fix for TIKA-1445 and made the POI updates for
> TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
> :)
>
> I'll start the process this weekend or a couple days into the new year.
>
> Cheers,
> Tyler
> On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
> chris.a.mattm...@jpl.nasa.gov> wrote:
>
> > +1
> >
> > ++
> > Chris Mattmann, Ph.D.
> > Chief Architect
> > Instrument Software and Science Data Systems Section (398)
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 168-519, Mailstop: 168-527
> > Email: chris.a.mattm...@nasa.gov
> > WWW:  http://sunset.usc.edu/~mattmann/
> > ++
> > Adjunct Associate Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++
> >
> >
> >
> >
> >
> >
> > -Original Message-
> > From: Tyler Palsulich 
> > Reply-To: "dev@tika.apache.org" 
> > Date: Thursday, December 18, 2014 at 9:15 PM
> > To: "dev@tika.apache.org" 
> > Subject: Re: 1.7 release?
> >
> > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> > >Nick
> > >just recommended, I'll try adding metadata extraction to Tesseract soon,
> > >then adding the extensible solution in 1.8.
> > >
> > >Tyler
> > >
> > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> > >chris.a.mattm...@jpl.nasa.gov> wrote:
> > >>
> > >> I haven’t tried my hand at it - been super busy. tyler if you have a
> > >> chance go for it, I think that’s the remaining blocker.
> > >>
> > >> ++
> > >> Chris Mattmann, Ph.D.
> > >> Chief Architect
> > >> Instrument Software and Science Data Systems Section (398)
> > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >> Office: 168-519, Mailstop: 168-527
> > >> Email: chris.a.mattm...@nasa.gov
> > >> WWW:  http://sunset.usc.edu/~mattmann/
> > >> ++
> > >> Adjunct Associate Professor, Computer Science Department
> > >> University of Southern California, Los Angeles, CA 90089 USA
> > >> ++
> > >>
> > >>
> > >>
> > >>
> > >>
> > >>
> > >> -Original Message-
> > >> From: Tyler Palsulich 
> > >>

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Nick Burch

On Mon, 5 Jan 2015, Tyler Palsulich wrote:

Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?


That's a svn checkout

For the RC, assuming it's the same process as for Apache POI, you checkout 
https://dist.apache.org/repos/dist/dev/tika and put the files there


Then, if the vote passes, you svn mv them to 
https://dist.apache.org/repos/dist/release/tika/ + upload things to maven 
central


Nick


Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Works for me. I got stalled midway through the process of getting RC#1 out
(authentication issues). But, going to try to finish it right now (best way
to upload to dist.apache.org?
http://www.apache.org/dev/release.html#upload-scp each file?). I won't send
a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2.

Sound good?

Tyler

On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. 
wrote:

> All,
>
> I think I may have found a problem with the interaction of
> OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7.
>
> If you use the AutoDetectParser instead of the OutlookPSTParser() in
> OutlookPSTParserTest:
>
> //   OutlookPSTParser pstParser = new OutlookPSTParser();
> Parser pstParser = new AutoDetectParser();
>
> I'm seeing this exception:
>
> org.apache.tika.exception.TikaException: Failed to close temporary
> resources
> at
> org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
> at
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)
>
> Are others seeing this?
>
> I'll try to dig into this today, might not get to it until tomorrow.
>
> Best,
>
> Tim
>
>
>
> -Original Message-
> From: Tyler Palsulich [mailto:tpalsul...@gmail.com]
> Sent: Monday, December 22, 2014 1:58 PM
> To: dev@tika.apache.org
> Subject: Re: 1.7 release?
>
> Hi All,
>
> Nick added the temporary fix for TIKA-1445 and made the POI updates for
> TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
> :)
>
> I'll start the process this weekend or a couple days into the new year.
>
> Cheers,
> Tyler
> On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
> chris.a.mattm...@jpl.nasa.gov> wrote:
>
> > +1
> >
> > ++
> > Chris Mattmann, Ph.D.
> > Chief Architect
> > Instrument Software and Science Data Systems Section (398)
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 168-519, Mailstop: 168-527
> > Email: chris.a.mattm...@nasa.gov
> > WWW:  http://sunset.usc.edu/~mattmann/
> > ++
> > Adjunct Associate Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++++++
> >
> >
> >
> >
> >
> >
> > -Original Message-
> > From: Tyler Palsulich 
> > Reply-To: "dev@tika.apache.org" 
> > Date: Thursday, December 18, 2014 at 9:15 PM
> > To: "dev@tika.apache.org" 
> > Subject: Re: 1.7 release?
> >
> > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> > >Nick
> > >just recommended, I'll try adding metadata extraction to Tesseract soon,
> > >then adding the extensible solution in 1.8.
> > >
> > >Tyler
> > >
> > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> > >chris.a.mattm...@jpl.nasa.gov> wrote:
> > >>
> > >> I haven’t tried my hand at it - been super busy. tyler if you have a
> > >> chance go for it, I think that’s the remaining blocker.
> > >>
> > >> ++
> > >> Chris Mattmann, Ph.D.
> > >> Chief Architect
> > >> Instrument Software and Science Data Systems Section (398)
> > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >> Office: 168-519, Mailstop: 168-527
> > >> Email: chris.a.mattm...@nasa.gov
> > >> WWW:  http://sunset.usc.edu/~mattmann/
> > >> ++
> > >> Adjunct Associate Professor, Computer Science Department
> > >> University of Southern California, Los Angeles, CA 90089 USA
> > >> ++
> > >>
> > >>
> > >>
> > >>
> > >>
> > >>
> > >> -Original Message-
> > >> From: Tyler Palsulich 
> > >> Reply-To: "dev@tika.apache.org" 
> > >> Date: Thursday, December 18, 2014 at 12:54 PM
> > >> To: "dev@tika.apache.org" 
> > >> Subject: Re: 1.7 release?
> > >>
> > >> >Hi

RE: 1.7 release? | potential blocker?

2015-01-05 Thread Allison, Timothy B.
All,

I think I may have found a problem with the interaction of OutlookPSTParser 
with AutoDetectParser that I'd want to fix before 1.7.

If you use the AutoDetectParser instead of the OutlookPSTParser() in 
OutlookPSTParserTest:

//   OutlookPSTParser pstParser = new OutlookPSTParser();
Parser pstParser = new AutoDetectParser();

I'm seeing this exception:

org.apache.tika.exception.TikaException: Failed to close temporary resources
at 
org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152)
at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127)

Are others seeing this?

I'll try to dig into this today, might not get to it until tomorrow.

Best,

Tim



-Original Message-
From: Tyler Palsulich [mailto:tpalsul...@gmail.com] 
Sent: Monday, December 22, 2014 1:58 PM
To: dev@tika.apache.org
Subject: Re: 1.7 release?

Hi All,

Nick added the temporary fix for TIKA-1445 and made the POI updates for
TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
:)

I'll start the process this weekend or a couple days into the new year.

Cheers,
Tyler
On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
chris.a.mattm...@jpl.nasa.gov> wrote:

> +1
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Tyler Palsulich 
> Reply-To: "dev@tika.apache.org" 
> Date: Thursday, December 18, 2014 at 9:15 PM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> >Nick
> >just recommended, I'll try adding metadata extraction to Tesseract soon,
> >then adding the extensible solution in 1.8.
> >
> >Tyler
> >
> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >>
> >> I haven’t tried my hand at it - been super busy. tyler if you have a
> >> chance go for it, I think that’s the remaining blocker.
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: Tyler Palsulich 
> >> Reply-To: "dev@tika.apache.org" 
> >> Date: Thursday, December 18, 2014 at 12:54 PM
> >> To: "dev@tika.apache.org" 
> >> Subject: Re: 1.7 release?
> >>
> >> >Hi All,
> >> >
> >> >It's been a few months, so I just want to follow up on this thread.
> >>We've
> >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
> >> >1.7
> >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
> >>TIKA-1445?
> >> >Has anyone tried their hand at the suggested (significant) fix?
> >> >
> >> >Are there any other issues someone would like to fit in?
> >> >
> >> >Cheers,
> >> >Tyler
> >> >
> >> >[0] -
> >> >
> >>
> >>
> https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
> >>e
> >> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
> >> >
> >> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
> >> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >> >>
> >>

Re: 1.7 release?

2015-01-02 Thread David Meikle

> On 22 Dec 2014, at 18:57, Tyler Palsulich  wrote:
> 
> Hi All,
> 
> Nick added the temporary fix for TIKA-1445 and made the POI updates for
> TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
> :)
> 
> I'll start the process this weekend or a couple days into the new year.

Nice one Tyler!

Cheers,
Dave

Re: 1.7 release?

2014-12-22 Thread Thomas Ledoux
+1 for going.
Many thanks to Tyler and to Nick to take the POI upgrade.

So many christmas gifts in advance or just after :-)

Merry christmas to all

2014-12-22 19:59 GMT+01:00 Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov>:

> WOOO HOO! Go Tyler go! :0) Merry Christmas bud.
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Tyler Palsulich 
> Reply-To: "dev@tika.apache.org" 
> Date: Monday, December 22, 2014 at 10:57 AM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >Hi All,
> >
> >Nick added the temporary fix for TIKA-1445 and made the POI updates for
> >TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for
> >1.7!
> >:)
> >
> >I'll start the process this weekend or a couple days into the new year.
> >
> >Cheers,
> >Tyler
> >On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> >> +1
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: Tyler Palsulich 
> >> Reply-To: "dev@tika.apache.org" 
> >> Date: Thursday, December 18, 2014 at 9:15 PM
> >> To: "dev@tika.apache.org" 
> >> Subject: Re: 1.7 release?
> >>
> >> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> >> >Nick
> >> >just recommended, I'll try adding metadata extraction to Tesseract
> >>soon,
> >> >then adding the extensible solution in 1.8.
> >> >
> >> >Tyler
> >> >
> >> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> >> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >> >>
> >> >> I haven’t tried my hand at it - been super busy. tyler if you have a
> >> >> chance go for it, I think that’s the remaining blocker.
> >> >>
> >> >> ++
> >> >> Chris Mattmann, Ph.D.
> >> >> Chief Architect
> >> >> Instrument Software and Science Data Systems Section (398)
> >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> >> Office: 168-519, Mailstop: 168-527
> >> >> Email: chris.a.mattm...@nasa.gov
> >> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> >> ++
> >> >> Adjunct Associate Professor, Computer Science Department
> >> >> University of Southern California, Los Angeles, CA 90089 USA
> >> >> ++
> >> >>
> >> >>
> >> >>
> >> >>
> >> >>
> >> >>
> >> >> -Original Message-
> >> >> From: Tyler Palsulich 
> >> >> Reply-To: "dev@tika.apache.org" 
> >> >> Date: Thursday, December 18, 2014 at 12:54 PM
> >> >> To: "dev@tika.apache.org" 
> >> >> Subject: Re: 1.7 release?
> >> >>
> >> >> >Hi All,
> >> >> >
> >> 

Re: 1.7 release?

2014-12-22 Thread Mattmann, Chris A (3980)
WOOO HOO! Go Tyler go! :0) Merry Christmas bud.

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Tyler Palsulich 
Reply-To: "dev@tika.apache.org" 
Date: Monday, December 22, 2014 at 10:57 AM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>Hi All,
>
>Nick added the temporary fix for TIKA-1445 and made the POI updates for
>TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for
>1.7!
>:)
>
>I'll start the process this weekend or a couple days into the new year.
>
>Cheers,
>Tyler
>On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
>chris.a.mattm...@jpl.nasa.gov> wrote:
>
>> +1
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: Tyler Palsulich 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Thursday, December 18, 2014 at 9:15 PM
>> To: "dev@tika.apache.org" 
>> Subject: Re: 1.7 release?
>>
>> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
>> >Nick
>> >just recommended, I'll try adding metadata extraction to Tesseract
>>soon,
>> >then adding the extensible solution in 1.8.
>> >
>> >Tyler
>> >
>> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
>> >chris.a.mattm...@jpl.nasa.gov> wrote:
>> >>
>> >> I haven’t tried my hand at it - been super busy. tyler if you have a
>> >> chance go for it, I think that’s the remaining blocker.
>> >>
>> >> ++
>> >> Chris Mattmann, Ph.D.
>> >> Chief Architect
>> >> Instrument Software and Science Data Systems Section (398)
>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >> Office: 168-519, Mailstop: 168-527
>> >> Email: chris.a.mattm...@nasa.gov
>> >> WWW:  http://sunset.usc.edu/~mattmann/
>> >> ++
>> >> Adjunct Associate Professor, Computer Science Department
>> >> University of Southern California, Los Angeles, CA 90089 USA
>> >> ++
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >> -Original Message-
>> >> From: Tyler Palsulich 
>> >> Reply-To: "dev@tika.apache.org" 
>> >> Date: Thursday, December 18, 2014 at 12:54 PM
>> >> To: "dev@tika.apache.org" 
>> >> Subject: Re: 1.7 release?
>> >>
>> >> >Hi All,
>> >> >
>> >> >It's been a few months, so I just want to follow up on this thread.
>> >>We've
>> >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA
>>marked as
>> >> >1.7
>> >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
>> >>TIKA-1445?
>> >> >Has anyone tried their hand at the suggested (significant) fix?
>> >> >
>> >> >Are there any other issues someone would like to fit in?
>> >> >
>> >> >Cheers,
>> >> >Tyler
>> >> >
>> >> 

Re: 1.7 release?

2014-12-22 Thread Tyler Palsulich
Hi All,

Nick added the temporary fix for TIKA-1445 and made the POI updates for
TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7!
:)

I'll start the process this weekend or a couple days into the new year.

Cheers,
Tyler
On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" <
chris.a.mattm...@jpl.nasa.gov> wrote:

> +1
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Tyler Palsulich 
> Reply-To: "dev@tika.apache.org" 
> Date: Thursday, December 18, 2014 at 9:15 PM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
> >Nick
> >just recommended, I'll try adding metadata extraction to Tesseract soon,
> >then adding the extensible solution in 1.8.
> >
> >Tyler
> >
> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >>
> >> I haven’t tried my hand at it - been super busy. tyler if you have a
> >> chance go for it, I think that’s the remaining blocker.
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: Tyler Palsulich 
> >> Reply-To: "dev@tika.apache.org" 
> >> Date: Thursday, December 18, 2014 at 12:54 PM
> >> To: "dev@tika.apache.org" 
> >> Subject: Re: 1.7 release?
> >>
> >> >Hi All,
> >> >
> >> >It's been a few months, so I just want to follow up on this thread.
> >>We've
> >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
> >> >1.7
> >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
> >>TIKA-1445?
> >> >Has anyone tried their hand at the suggested (significant) fix?
> >> >
> >> >Are there any other issues someone would like to fit in?
> >> >
> >> >Cheers,
> >> >Tyler
> >> >
> >> >[0] -
> >> >
> >>
> >>
> https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
> >>e
> >> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
> >> >
> >> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
> >> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >> >>
> >> >> Thanks Tim saw your patch and am looking now.
> >> >>
> >> >> ++
> >> >> Chris Mattmann, Ph.D.
> >> >> Chief Architect
> >> >> Instrument Software and Science Data Systems Section (398)
> >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> >> Office: 168-519, Mailstop: 168-527
> >> >> Email: chris.a.mattm...@nasa.gov
> >> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> >> ++
> >> >> Adjunct Associate Professor, Computer Science Department
> >> >> University of Southern California, Los Angeles, CA 90089 USA
> >> >> 

Re: 1.7 release?

2014-12-18 Thread Mattmann, Chris A (3980)
+1

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Tyler Palsulich 
Reply-To: "dev@tika.apache.org" 
Date: Thursday, December 18, 2014 at 9:15 PM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As
>Nick
>just recommended, I'll try adding metadata extraction to Tesseract soon,
>then adding the extensible solution in 1.8.
>
>Tyler
>
>On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
>chris.a.mattm...@jpl.nasa.gov> wrote:
>>
>> I haven’t tried my hand at it - been super busy. tyler if you have a
>> chance go for it, I think that’s the remaining blocker.
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: Tyler Palsulich 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Thursday, December 18, 2014 at 12:54 PM
>> To: "dev@tika.apache.org" 
>> Subject: Re: 1.7 release?
>>
>> >Hi All,
>> >
>> >It's been a few months, so I just want to follow up on this thread.
>>We've
>> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
>> >1.7
>> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with
>>TIKA-1445?
>> >Has anyone tried their hand at the suggested (significant) fix?
>> >
>> >Are there any other issues someone would like to fit in?
>> >
>> >Cheers,
>> >Tyler
>> >
>> >[0] -
>> >
>> 
>>https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select
>>e
>> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
>> >
>> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
>> >chris.a.mattm...@jpl.nasa.gov> wrote:
>> >>
>> >> Thanks Tim saw your patch and am looking now.
>> >>
>> >> ++
>> >> Chris Mattmann, Ph.D.
>> >> Chief Architect
>> >> Instrument Software and Science Data Systems Section (398)
>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >> Office: 168-519, Mailstop: 168-527
>> >> Email: chris.a.mattm...@nasa.gov
>> >> WWW:  http://sunset.usc.edu/~mattmann/
>> >> ++
>> >> Adjunct Associate Professor, Computer Science Department
>> >> University of Southern California, Los Angeles, CA 90089 USA
>> >> ++
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >> -Original Message-
>> >> From: , "Timothy B." 
>> >> Reply-To: "dev@tika.apache.org" 
>> >> Date: Monday, October 27, 2014 at 12:30 PM
>> >> To: "dev@tika.apache.org" 
>> >> Subject: RE: 1.7 release?
>> >>
>> >> >Sounds good.  As long as the default behavior remains the same, I'm
>> >> >happy.  I'm going to play with a combination of your patch and
>>Tyler's
>> >> >and see what the ramifications are for embedded docs.
>> >> >
>> >> >To confirm, the OCR integration is fantastic.  Thank you and Tyler!
>> >> >
>> >&g

Re: 1.7 release?

2014-12-18 Thread Tyler Palsulich
I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As Nick
just recommended, I'll try adding metadata extraction to Tesseract soon,
then adding the extensible solution in 1.8.

Tyler

On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:
>
> I haven’t tried my hand at it - been super busy. tyler if you have a
> chance go for it, I think that’s the remaining blocker.
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Tyler Palsulich 
> Reply-To: "dev@tika.apache.org" 
> Date: Thursday, December 18, 2014 at 12:54 PM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >Hi All,
> >
> >It's been a few months, so I just want to follow up on this thread. We've
> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
> >1.7
> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445?
> >Has anyone tried their hand at the suggested (significant) fix?
> >
> >Are there any other issues someone would like to fit in?
> >
> >Cheers,
> >Tyler
> >
> >[0] -
> >
> https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selecte
> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
> >
> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >>
> >> Thanks Tim saw your patch and am looking now.
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: , "Timothy B." 
> >> Reply-To: "dev@tika.apache.org" 
> >> Date: Monday, October 27, 2014 at 12:30 PM
> >> To: "dev@tika.apache.org" 
> >> Subject: RE: 1.7 release?
> >>
> >> >Sounds good.  As long as the default behavior remains the same, I'm
> >> >happy.  I'm going to play with a combination of your patch and Tyler's
> >> >and see what the ramifications are for embedded docs.
> >> >
> >> >To confirm, the OCR integration is fantastic.  Thank you and Tyler!
> >> >
> >> >
> >> >Best,
> >> >
> >> >   Tim
> >> >
> >> >-Original Message-
> >> >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
> >> >Sent: Friday, October 24, 2014 5:36 PM
> >> >To: dev@tika.apache.org
> >> >Subject: Re: 1.7 release?
> >> >
> >> >Hey Tim,
> >> >
> >> >What do you think about my existing patch for 1445? For example to
> >> >just call all the parsers? I thought I was seeing behavior that was
> >> >slow because of that, but it turned out to be Tesseract and my machine
> >> >at the time?
> >> >
> >> >I think my patch for 1445 may be enough, and we should get the metadata
> >> >I think? Thoughts?
> >> >
> >> >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
> >> >even take it upon myself to try and experiment with the idea of
> >>multip

Re: 1.7 release?

2014-12-18 Thread Mattmann, Chris A (3980)
I haven’t tried my hand at it - been super busy. tyler if you have a
chance go for it, I think that’s the remaining blocker.

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Tyler Palsulich 
Reply-To: "dev@tika.apache.org" 
Date: Thursday, December 18, 2014 at 12:54 PM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>Hi All,
>
>It's been a few months, so I just want to follow up on this thread. We've
>resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as
>1.7
>(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445?
>Has anyone tried their hand at the suggested (significant) fix?
>
>Are there any other issues someone would like to fit in?
>
>Cheers,
>Tyler
>
>[0] -
>https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selecte
>dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
>
>On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
>chris.a.mattm...@jpl.nasa.gov> wrote:
>>
>> Thanks Tim saw your patch and am looking now.
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: , "Timothy B." 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Monday, October 27, 2014 at 12:30 PM
>> To: "dev@tika.apache.org" 
>> Subject: RE: 1.7 release?
>>
>> >Sounds good.  As long as the default behavior remains the same, I'm
>> >happy.  I'm going to play with a combination of your patch and Tyler's
>> >and see what the ramifications are for embedded docs.
>> >
>> >To confirm, the OCR integration is fantastic.  Thank you and Tyler!
>> >
>> >
>> >Best,
>> >
>> >   Tim
>> >
>> >-Original Message-
>> >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
>> >Sent: Friday, October 24, 2014 5:36 PM
>> >To: dev@tika.apache.org
>> >Subject: Re: 1.7 release?
>> >
>> >Hey Tim,
>> >
>> >What do you think about my existing patch for 1445? For example to
>> >just call all the parsers? I thought I was seeing behavior that was
>> >slow because of that, but it turned out to be Tesseract and my machine
>> >at the time?
>> >
>> >I think my patch for 1445 may be enough, and we should get the metadata
>> >I think? Thoughts?
>> >
>> >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
>> >even take it upon myself to try and experiment with the idea of
>>multiple
>> >parsers being called. I think a simple solution to the metadata key
>> >conflict issue is simply to have a policy to add values (by default)
>>and
>> >replace if a property is set in ParseContext. Some simple updates to
>> >CompositeParser would allow this.
>> >
>> >Thoughts?
>> >
>> >Cheers,
>> >Chris
>> >
>> >
>> >++
>> >Chris Mattmann, Ph.D.
>> >Chief Architect
>> >Instrument Software and Science Data Systems Section (398)
>> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >Office: 168-519, Mailstop: 168-527
>> >Email: chris.a.mattm...@nasa.gov
>> >WWW:  http://sunset.usc.edu/~mattmann/
>> >++
>> &

Re: 1.7 release?

2014-12-18 Thread Thomas Ledoux
Hi, it might be worth waiting until POI 3.11-FINAL is released so that the
TIKA release do not depend on a beta version. It's due on Sunday, corrects
a lot of old office parsing and just needs the patch in TIKA-1469 to
properly work.

Regards
  Thomas

2014-12-18 21:54 GMT+01:00 Tyler Palsulich :
>
> Hi All,
>
> It's been a few months, so I just want to follow up on this thread. We've
> resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as 1.7
> (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445?
> Has anyone tried their hand at the suggested (significant) fix?
>
> Are there any other issues someone would like to fit in?
>
> Cheers,
> Tyler
>
> [0] -
>
> https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selectedTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel
>
> On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> > Thanks Tim saw your patch and am looking now.
> >
> > ++
> > Chris Mattmann, Ph.D.
> > Chief Architect
> > Instrument Software and Science Data Systems Section (398)
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 168-519, Mailstop: 168-527
> > Email: chris.a.mattm...@nasa.gov
> > WWW:  http://sunset.usc.edu/~mattmann/
> > ++
> > Adjunct Associate Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++
> >
> >
> >
> >
> >
> >
> > -Original Message-
> > From: , "Timothy B." 
> > Reply-To: "dev@tika.apache.org" 
> > Date: Monday, October 27, 2014 at 12:30 PM
> > To: "dev@tika.apache.org" 
> > Subject: RE: 1.7 release?
> >
> > >Sounds good.  As long as the default behavior remains the same, I'm
> > >happy.  I'm going to play with a combination of your patch and Tyler's
> > >and see what the ramifications are for embedded docs.
> > >
> > >To confirm, the OCR integration is fantastic.  Thank you and Tyler!
> > >
> > >
> > >Best,
> > >
> > >   Tim
> > >
> > >-Original Message-
> > >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
> > >Sent: Friday, October 24, 2014 5:36 PM
> > >To: dev@tika.apache.org
> > >Subject: Re: 1.7 release?
> > >
> > >Hey Tim,
> > >
> > >What do you think about my existing patch for 1445? For example to
> > >just call all the parsers? I thought I was seeing behavior that was
> > >slow because of that, but it turned out to be Tesseract and my machine
> > >at the time?
> > >
> > >I think my patch for 1445 may be enough, and we should get the metadata
> > >I think? Thoughts?
> > >
> > >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
> > >even take it upon myself to try and experiment with the idea of multiple
> > >parsers being called. I think a simple solution to the metadata key
> > >conflict issue is simply to have a policy to add values (by default) and
> > >replace if a property is set in ParseContext. Some simple updates to
> > >CompositeParser would allow this.
> > >
> > >Thoughts?
> > >
> > >Cheers,
> > >Chris
> > >
> > >
> > >++
> > >Chris Mattmann, Ph.D.
> > >Chief Architect
> > >Instrument Software and Science Data Systems Section (398)
> > >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >Office: 168-519, Mailstop: 168-527
> > >Email: chris.a.mattm...@nasa.gov
> > >WWW:  http://sunset.usc.edu/~mattmann/
> > >++
> > >Adjunct Associate Professor, Computer Science Department
> > >University of Southern California, Los Angeles, CA 90089 USA
> > >++
> > >
> > >
> > >
> > >
> > >
> > >
> > >-Original Message-
> > >From: , "Timothy B." 
> > >Reply-To: "dev@tika.apache.org" 
> > >Date: Friday, October 24, 2014 at 2:24 PM
> > >To: "

Re: 1.7 release?

2014-12-18 Thread Tyler Palsulich
Hi All,

It's been a few months, so I just want to follow up on this thread. We've
resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as 1.7
(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445?
Has anyone tried their hand at the suggested (significant) fix?

Are there any other issues someone would like to fit in?

Cheers,
Tyler

[0] -
https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selectedTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel

On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:
>
> Thanks Tim saw your patch and am looking now.
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: , "Timothy B." 
> Reply-To: "dev@tika.apache.org" 
> Date: Monday, October 27, 2014 at 12:30 PM
> To: "dev@tika.apache.org" 
> Subject: RE: 1.7 release?
>
> >Sounds good.  As long as the default behavior remains the same, I'm
> >happy.  I'm going to play with a combination of your patch and Tyler's
> >and see what the ramifications are for embedded docs.
> >
> >To confirm, the OCR integration is fantastic.  Thank you and Tyler!
> >
> >
> >Best,
> >
> >       Tim
> >
> >-Original Message-
> >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
> >Sent: Friday, October 24, 2014 5:36 PM
> >To: dev@tika.apache.org
> >Subject: Re: 1.7 release?
> >
> >Hey Tim,
> >
> >What do you think about my existing patch for 1445? For example to
> >just call all the parsers? I thought I was seeing behavior that was
> >slow because of that, but it turned out to be Tesseract and my machine
> >at the time?
> >
> >I think my patch for 1445 may be enough, and we should get the metadata
> >I think? Thoughts?
> >
> >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
> >even take it upon myself to try and experiment with the idea of multiple
> >parsers being called. I think a simple solution to the metadata key
> >conflict issue is simply to have a policy to add values (by default) and
> >replace if a property is set in ParseContext. Some simple updates to
> >CompositeParser would allow this.
> >
> >Thoughts?
> >
> >Cheers,
> >Chris
> >
> >
> >++
> >Chris Mattmann, Ph.D.
> >Chief Architect
> >Instrument Software and Science Data Systems Section (398)
> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >Office: 168-519, Mailstop: 168-527
> >Email: chris.a.mattm...@nasa.gov
> >WWW:  http://sunset.usc.edu/~mattmann/
> >++++++++++
> >Adjunct Associate Professor, Computer Science Department
> >University of Southern California, Los Angeles, CA 90089 USA
> >++
> >
> >
> >
> >
> >
> >
> >-Original Message-
> >From: , "Timothy B." 
> >Reply-To: "dev@tika.apache.org" 
> >Date: Friday, October 24, 2014 at 2:24 PM
> >To: "dev@tika.apache.org" 
> >Subject: RE: 1.7 release?
> >
> >>Sorry for coming late to the game on the implications of TIKA-1445.  I
> >>don't want to hold up the release of 1.7.
> >>
> >>However, would it be possible to return to the legacy default behavior of
> >>extracting metadata from images?
> >>
> >>We can then document on the OCR parser page on the wiki that you need to
> >>install Tesseract _and_ make a change in the parser/mime config file. If
> >>you want this new capability, it will take a small bit of work until we
> >>solve TIKA-1445.
> >>
> >>I worry that the current behavior of 1.7 would be surprising to most
> >>non-dev users (well, even to at least one dev :) ).
> >>
> >&g

Re: 1.7 release?

2014-10-27 Thread Mattmann, Chris A (3980)
Thanks Tim saw your patch and am looking now.

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: , "Timothy B." 
Reply-To: "dev@tika.apache.org" 
Date: Monday, October 27, 2014 at 12:30 PM
To: "dev@tika.apache.org" 
Subject: RE: 1.7 release?

>Sounds good.  As long as the default behavior remains the same, I'm
>happy.  I'm going to play with a combination of your patch and Tyler's
>and see what the ramifications are for embedded docs.
>
>To confirm, the OCR integration is fantastic.  Thank you and Tyler!
>
>
>Best,
>
>   Tim
>
>-Original Message-
>From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov]
>Sent: Friday, October 24, 2014 5:36 PM
>To: dev@tika.apache.org
>Subject: Re: 1.7 release?
>
>Hey Tim,
>
>What do you think about my existing patch for 1445? For example to
>just call all the parsers? I thought I was seeing behavior that was
>slow because of that, but it turned out to be Tesseract and my machine
>at the time?
>
>I think my patch for 1445 may be enough, and we should get the metadata
>I think? Thoughts?
>
>I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
>even take it upon myself to try and experiment with the idea of multiple
>parsers being called. I think a simple solution to the metadata key
>conflict issue is simply to have a policy to add values (by default) and
>replace if a property is set in ParseContext. Some simple updates to
>CompositeParser would allow this.
>
>Thoughts?
>
>Cheers,
>Chris
>
>
>++
>Chris Mattmann, Ph.D.
>Chief Architect
>Instrument Software and Science Data Systems Section (398)
>NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>Office: 168-519, Mailstop: 168-527
>Email: chris.a.mattm...@nasa.gov
>WWW:  http://sunset.usc.edu/~mattmann/
>++
>Adjunct Associate Professor, Computer Science Department
>University of Southern California, Los Angeles, CA 90089 USA
>++++++++++
>
>
>
>
>
>
>-Original Message-
>From: , "Timothy B." 
>Reply-To: "dev@tika.apache.org" 
>Date: Friday, October 24, 2014 at 2:24 PM
>To: "dev@tika.apache.org" 
>Subject: RE: 1.7 release?
>
>>Sorry for coming late to the game on the implications of TIKA-1445.  I
>>don't want to hold up the release of 1.7.
>>
>>However, would it be possible to return to the legacy default behavior of
>>extracting metadata from images?
>>
>>We can then document on the OCR parser page on the wiki that you need to
>>install Tesseract _and_ make a change in the parser/mime config file. If
>>you want this new capability, it will take a small bit of work until we
>>solve TIKA-1445.
>>
>>I worry that the current behavior of 1.7 would be surprising to most
>>non-dev users (well, even to at least one dev :) ).
>>
>>Cheers,
>>  
>>  Tim
>>
>>
>>From: Oleg Tikhonov [olegtikho...@gmail.com]
>>Sent: Friday, October 24, 2014 2:24 PM
>>To: dev@tika.apache.org
>>Subject: Re: 1.7 release?
>>
>>Hi Tyler,
>>don't mention.
>>
>>Cheers,
>>Oleg
>>On Oct 24, 2014 8:02 PM, "Tyler Palsulich"  wrote:
>>
>>> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there
>>>any
>>> other issues anyone would like to resolve before a new release?
>>>
>>> Thanks,
>>> Tyler
>>>
>>> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
>>> wrote:
>>>
>>> > Sorry!!!
>>> >
>>> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
>>> > chris.a.mattm...@jpl.nasa.gov> wrote:
>>> >
>>> > > Thanks Oleg, will try tomorrow for me Los angeles time!
>>> > >
>>> > > +++

RE: 1.7 release?

2014-10-27 Thread Allison, Timothy B.
Sounds good.  As long as the default behavior remains the same, I'm happy.  I'm 
going to play with a combination of your patch and Tyler's and see what the 
ramifications are for embedded docs.

To confirm, the OCR integration is fantastic.  Thank you and Tyler!


Best,

   Tim

-Original Message-
From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] 
Sent: Friday, October 24, 2014 5:36 PM
To: dev@tika.apache.org
Subject: Re: 1.7 release?

Hey Tim,

What do you think about my existing patch for 1445? For example to
just call all the parsers? I thought I was seeing behavior that was
slow because of that, but it turned out to be Tesseract and my machine
at the time?

I think my patch for 1445 may be enough, and we should get the metadata
I think? Thoughts?

I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
even take it upon myself to try and experiment with the idea of multiple
parsers being called. I think a simple solution to the metadata key
conflict issue is simply to have a policy to add values (by default) and
replace if a property is set in ParseContext. Some simple updates to
CompositeParser would allow this.

Thoughts?

Cheers,
Chris


++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: , "Timothy B." 
Reply-To: "dev@tika.apache.org" 
Date: Friday, October 24, 2014 at 2:24 PM
To: "dev@tika.apache.org" 
Subject: RE: 1.7 release?

>Sorry for coming late to the game on the implications of TIKA-1445.  I
>don't want to hold up the release of 1.7.
>
>However, would it be possible to return to the legacy default behavior of
>extracting metadata from images?
>
>We can then document on the OCR parser page on the wiki that you need to
>install Tesseract _and_ make a change in the parser/mime config file. If
>you want this new capability, it will take a small bit of work until we
>solve TIKA-1445.
>
>I worry that the current behavior of 1.7 would be surprising to most
>non-dev users (well, even to at least one dev :) ).
>
>Cheers,
>  
>  Tim
>
>
>From: Oleg Tikhonov [olegtikho...@gmail.com]
>Sent: Friday, October 24, 2014 2:24 PM
>To: dev@tika.apache.org
>Subject: Re: 1.7 release?
>
>Hi Tyler,
>don't mention.
>
>Cheers,
>Oleg
>On Oct 24, 2014 8:02 PM, "Tyler Palsulich"  wrote:
>
>> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there
>>any
>> other issues anyone would like to resolve before a new release?
>>
>> Thanks,
>> Tyler
>>
>> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
>> wrote:
>>
>> > Sorry!!!
>> >
>> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
>> > chris.a.mattm...@jpl.nasa.gov> wrote:
>> >
>> > > Thanks Oleg, will try tomorrow for me Los angeles time!
>> > >
>> > > ++
>> > > Chris Mattmann, Ph.D.
>> > > Chief Architect
>> > > Instrument Software and Science Data Systems Section (398)
>> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> > > Office: 168-519, Mailstop: 168-527
>> > > Email: chris.a.mattm...@nasa.gov
>> > > WWW:  http://sunset.usc.edu/~mattmann/
>> > > ++
>> > > Adjunct Associate Professor, Computer Science Department
>> > > University of Southern California, Los Angeles, CA 90089 USA
>> > > ++
>> > >
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > -Original Message-
>> > > From: Oleg Tikhonov 
>> > > Reply-To: "dev@tika.apache.org" 
>> > > Date: Monday, October 20, 2014 at 11:20 PM
>> > > To: "dev@tika.apache.org" 
>> > > Subject: Re: 1.7 release?
>> > >
>> > > >Please take a try with newest patch.
>> > > >Cheers,
>> > > >Oleg
>&

Re: 1.7 release?

2014-10-24 Thread Mattmann, Chris A (3980)
Hey Tim,

What do you think about my existing patch for 1445? For example to
just call all the parsers? I thought I was seeing behavior that was
slow because of that, but it turned out to be Tesseract and my machine
at the time?

I think my patch for 1445 may be enough, and we should get the metadata
I think? Thoughts?

I honestly think we need to deliver Tesseract in 1.7. We're close. I'll
even take it upon myself to try and experiment with the idea of multiple
parsers being called. I think a simple solution to the metadata key
conflict issue is simply to have a policy to add values (by default) and
replace if a property is set in ParseContext. Some simple updates to
CompositeParser would allow this.

Thoughts?

Cheers,
Chris


++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: , "Timothy B." 
Reply-To: "dev@tika.apache.org" 
Date: Friday, October 24, 2014 at 2:24 PM
To: "dev@tika.apache.org" 
Subject: RE: 1.7 release?

>Sorry for coming late to the game on the implications of TIKA-1445.  I
>don't want to hold up the release of 1.7.
>
>However, would it be possible to return to the legacy default behavior of
>extracting metadata from images?
>
>We can then document on the OCR parser page on the wiki that you need to
>install Tesseract _and_ make a change in the parser/mime config file. If
>you want this new capability, it will take a small bit of work until we
>solve TIKA-1445.
>
>I worry that the current behavior of 1.7 would be surprising to most
>non-dev users (well, even to at least one dev :) ).
>
>Cheers,
>  
>  Tim
>
>
>From: Oleg Tikhonov [olegtikho...@gmail.com]
>Sent: Friday, October 24, 2014 2:24 PM
>To: dev@tika.apache.org
>Subject: Re: 1.7 release?
>
>Hi Tyler,
>don't mention.
>
>Cheers,
>Oleg
>On Oct 24, 2014 8:02 PM, "Tyler Palsulich"  wrote:
>
>> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there
>>any
>> other issues anyone would like to resolve before a new release?
>>
>> Thanks,
>> Tyler
>>
>> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
>> wrote:
>>
>> > Sorry!!!
>> >
>> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
>> > chris.a.mattm...@jpl.nasa.gov> wrote:
>> >
>> > > Thanks Oleg, will try tomorrow for me Los angeles time!
>> > >
>> > > ++
>> > > Chris Mattmann, Ph.D.
>> > > Chief Architect
>> > > Instrument Software and Science Data Systems Section (398)
>> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> > > Office: 168-519, Mailstop: 168-527
>> > > Email: chris.a.mattm...@nasa.gov
>> > > WWW:  http://sunset.usc.edu/~mattmann/
>> > > ++
>> > > Adjunct Associate Professor, Computer Science Department
>> > > University of Southern California, Los Angeles, CA 90089 USA
>> > > ++
>> > >
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > -Original Message-
>> > > From: Oleg Tikhonov 
>> > > Reply-To: "dev@tika.apache.org" 
>> > > Date: Monday, October 20, 2014 at 11:20 PM
>> > > To: "dev@tika.apache.org" 
>> > > Subject: Re: 1.7 release?
>> > >
>> > > >Please take a try with newest patch.
>> > > >Cheers,
>> > > >Oleg
>> > > >
>> > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov <
>> olegtikho...@gmail.com>
>> > > >wrote:
>> > > >
>> > > >> Taken. Thanks. in progress ...
>> > > >>
>> > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
>> > > >> chris.a.mattm...@jpl

RE: 1.7 release?

2014-10-24 Thread Allison, Timothy B.
Sorry for coming late to the game on the implications of TIKA-1445.  I don't 
want to hold up the release of 1.7.  

However, would it be possible to return to the legacy default behavior of 
extracting metadata from images?  

We can then document on the OCR parser page on the wiki that you need to 
install Tesseract _and_ make a change in the parser/mime config file. If you 
want this new capability, it will take a small bit of work until we solve 
TIKA-1445.

I worry that the current behavior of 1.7 would be surprising to most non-dev 
users (well, even to at least one dev :) ).

Cheers,
  
  Tim


From: Oleg Tikhonov [olegtikho...@gmail.com]
Sent: Friday, October 24, 2014 2:24 PM
To: dev@tika.apache.org
Subject: Re: 1.7 release?

Hi Tyler,
don't mention.

Cheers,
Oleg
On Oct 24, 2014 8:02 PM, "Tyler Palsulich"  wrote:

> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any
> other issues anyone would like to resolve before a new release?
>
> Thanks,
> Tyler
>
> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
> wrote:
>
> > Sorry!!!
> >
> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
> > chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> > > Thanks Oleg, will try tomorrow for me Los angeles time!
> > >
> > > ++
> > > Chris Mattmann, Ph.D.
> > > Chief Architect
> > > Instrument Software and Science Data Systems Section (398)
> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > > Office: 168-519, Mailstop: 168-527
> > > Email: chris.a.mattm...@nasa.gov
> > > WWW:  http://sunset.usc.edu/~mattmann/
> > > ++
> > > Adjunct Associate Professor, Computer Science Department
> > > University of Southern California, Los Angeles, CA 90089 USA
> > > ++
> > >
> > >
> > >
> > >
> > >
> > >
> > > -Original Message-
> > > From: Oleg Tikhonov 
> > > Reply-To: "dev@tika.apache.org" 
> > > Date: Monday, October 20, 2014 at 11:20 PM
> > > To: "dev@tika.apache.org" 
> > > Subject: Re: 1.7 release?
> > >
> > > >Please take a try with newest patch.
> > > >Cheers,
> > > >Oleg
> > > >
> > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov <
> olegtikho...@gmail.com>
> > > >wrote:
> > > >
> > > >> Taken. Thanks. in progress ...
> > > >>
> > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
> > > >> chris.a.mattm...@jpl.nasa.gov> wrote:
> > > >>
> > > >>> Trunk is the current checkout/branch:
> > > >>>
> > > >>> http://svn.apache.org/repos/asf/tika/trunk
> > > >>>
> > > >>>
> > > >>> ++
> > > >>> Chris Mattmann, Ph.D.
> > > >>> Chief Architect
> > > >>> Instrument Software and Science Data Systems Section (398)
> > > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > > >>> Office: 168-519, Mailstop: 168-527
> > > >>> Email: chris.a.mattm...@nasa.gov
> > > >>> WWW:  http://sunset.usc.edu/~mattmann/
> > > >>> ++
> > > >>> Adjunct Associate Professor, Computer Science Department
> > > >>> University of Southern California, Los Angeles, CA 90089 USA
> > > >>> ++
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>> -Original Message-
> > > >>> From: Oleg Tikhonov 
> > > >>> Reply-To: "dev@tika.apache.org" 
> > > >>> Date: Monday, October 20, 2014 at 10:16 PM
> > > >>> To: "dev@tika.apache.org" 
> > > >>> Subject: Re: 1.7 release?
> > > >>>
> > > >>> >Hi, I can try this on.
> > > &

Re: 1.7 release?

2014-10-24 Thread Oleg Tikhonov
Hi Tyler,
don't mention.

Cheers,
Oleg
On Oct 24, 2014 8:02 PM, "Tyler Palsulich"  wrote:

> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any
> other issues anyone would like to resolve before a new release?
>
> Thanks,
> Tyler
>
> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
> wrote:
>
> > Sorry!!!
> >
> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
> > chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> > > Thanks Oleg, will try tomorrow for me Los angeles time!
> > >
> > > ++
> > > Chris Mattmann, Ph.D.
> > > Chief Architect
> > > Instrument Software and Science Data Systems Section (398)
> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > > Office: 168-519, Mailstop: 168-527
> > > Email: chris.a.mattm...@nasa.gov
> > > WWW:  http://sunset.usc.edu/~mattmann/
> > > ++
> > > Adjunct Associate Professor, Computer Science Department
> > > University of Southern California, Los Angeles, CA 90089 USA
> > > ++
> > >
> > >
> > >
> > >
> > >
> > >
> > > -Original Message-
> > > From: Oleg Tikhonov 
> > > Reply-To: "dev@tika.apache.org" 
> > > Date: Monday, October 20, 2014 at 11:20 PM
> > > To: "dev@tika.apache.org" 
> > > Subject: Re: 1.7 release?
> > >
> > > >Please take a try with newest patch.
> > > >Cheers,
> > > >Oleg
> > > >
> > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov <
> olegtikho...@gmail.com>
> > > >wrote:
> > > >
> > > >> Taken. Thanks. in progress ...
> > > >>
> > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
> > > >> chris.a.mattm...@jpl.nasa.gov> wrote:
> > > >>
> > > >>> Trunk is the current checkout/branch:
> > > >>>
> > > >>> http://svn.apache.org/repos/asf/tika/trunk
> > > >>>
> > > >>>
> > > >>> ++
> > > >>> Chris Mattmann, Ph.D.
> > > >>> Chief Architect
> > > >>> Instrument Software and Science Data Systems Section (398)
> > > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > > >>> Office: 168-519, Mailstop: 168-527
> > > >>> Email: chris.a.mattm...@nasa.gov
> > > >>> WWW:  http://sunset.usc.edu/~mattmann/
> > > >>> ++
> > > >>> Adjunct Associate Professor, Computer Science Department
> > > >>> University of Southern California, Los Angeles, CA 90089 USA
> > > >>> ++
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>> -Original Message-
> > > >>> From: Oleg Tikhonov 
> > > >>> Reply-To: "dev@tika.apache.org" 
> > > >>> Date: Monday, October 20, 2014 at 10:16 PM
> > > >>> To: "dev@tika.apache.org" 
> > > >>> Subject: Re: 1.7 release?
> > > >>>
> > > >>> >Hi, I can try this on.
> > > >>> >What is a trunk?
> > > >>> >
> > > >>> >
> > > >>> >Thanks,
> > > >>> >Oleg
> > > >>> >
> > > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
> > > >>> >chris.a.mattm...@jpl.nasa.gov> wrote:
> > > >>> >
> > > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and
> > > >>> >> I were talking the other day - maybe we shouldn't run the
> > > >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
> > > >>> >>
> > > >>> >>
> ++
> > > >>> >> Chris Mattmann, Ph.D.
> > > >>> >&

Re: 1.7 release?

2014-10-24 Thread Tyler Palsulich
Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any
other issues anyone would like to resolve before a new release?

Thanks,
Tyler

On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov 
wrote:

> Sorry!!!
>
> On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
>
> > Thanks Oleg, will try tomorrow for me Los angeles time!
> >
> > ++
> > Chris Mattmann, Ph.D.
> > Chief Architect
> > Instrument Software and Science Data Systems Section (398)
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 168-519, Mailstop: 168-527
> > Email: chris.a.mattm...@nasa.gov
> > WWW:  http://sunset.usc.edu/~mattmann/
> > ++
> > Adjunct Associate Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++
> >
> >
> >
> >
> >
> >
> > -Original Message-
> > From: Oleg Tikhonov 
> > Reply-To: "dev@tika.apache.org" 
> > Date: Monday, October 20, 2014 at 11:20 PM
> > To: "dev@tika.apache.org" 
> > Subject: Re: 1.7 release?
> >
> > >Please take a try with newest patch.
> > >Cheers,
> > >Oleg
> > >
> > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov 
> > >wrote:
> > >
> > >> Taken. Thanks. in progress ...
> > >>
> > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
> > >> chris.a.mattm...@jpl.nasa.gov> wrote:
> > >>
> > >>> Trunk is the current checkout/branch:
> > >>>
> > >>> http://svn.apache.org/repos/asf/tika/trunk
> > >>>
> > >>>
> > >>> ++
> > >>> Chris Mattmann, Ph.D.
> > >>> Chief Architect
> > >>> Instrument Software and Science Data Systems Section (398)
> > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >>> Office: 168-519, Mailstop: 168-527
> > >>> Email: chris.a.mattm...@nasa.gov
> > >>> WWW:  http://sunset.usc.edu/~mattmann/
> > >>> ++
> > >>> Adjunct Associate Professor, Computer Science Department
> > >>> University of Southern California, Los Angeles, CA 90089 USA
> > >>> ++
> > >>>
> > >>>
> > >>>
> > >>>
> > >>>
> > >>>
> > >>> -Original Message-
> > >>> From: Oleg Tikhonov 
> > >>> Reply-To: "dev@tika.apache.org" 
> > >>> Date: Monday, October 20, 2014 at 10:16 PM
> > >>> To: "dev@tika.apache.org" 
> > >>> Subject: Re: 1.7 release?
> > >>>
> > >>> >Hi, I can try this on.
> > >>> >What is a trunk?
> > >>> >
> > >>> >
> > >>> >Thanks,
> > >>> >Oleg
> > >>> >
> > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
> > >>> >chris.a.mattm...@jpl.nasa.gov> wrote:
> > >>> >
> > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and
> > >>> >> I were talking the other day - maybe we shouldn't run the
> > >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
> > >>> >>
> > >>> >> ++
> > >>> >> Chris Mattmann, Ph.D.
> > >>> >> Chief Architect
> > >>> >> Instrument Software and Science Data Systems Section (398)
> > >>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >>> >> Office: 168-519, Mailstop: 168-527
> > >>> >> Email: chris.a.mattm...@nasa.gov
> > >>> >> WWW:  http://sunset.usc.edu/~mattmann/
> > >>> >> ++
> > >>> >> Adjunct Associate Professor, Computer Science Department
> > >>> >> University of Southern California, Los Angeles, CA 90089 USA
> > >>> >> ++
> > >>> >>
> > >>> >>
> > >>> >>
> > >>> >>
> > >>> >>
> > >>> >>
> > >>> >> -Original Message-
> > >>> >> From: Hong-Thai Nguyen 
> > >>> >> Reply-To: "dev@tika.apache.org" 
> > >>> >> Date: Thursday, October 16, 2014 at 2:03 AM
> > >>> >> To: "dev@tika.apache.org" 
> > >>> >> Subject: Re: 1.7 release?
> > >>> >>
> > >>> >> >Hi Andrzej,
> > >>> >> >
> > >>> >> >We are impatient for 1.7 release too.
> > >>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can
> > >>>build
> > >>> >> >successfully on Windows, I have no objection to release 1.7
> > >>> >> >
> > >>> >> >Thanks,
> > >>> >> >
> > >>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki <
> a...@getopt.org>
> > >>> >>wrote:
> > >>> >> >
> > >>> >> >> Hi,
> > >>> >> >>
> > >>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that
> > >>> >>includes
> > >>> >> >>the
> > >>> >> >> fix for broken ODF parsing...
> > >>> >> >>
> > >>> >> >> ---
> > >>> >> >> Best regards,
> > >>> >> >>
> > >>> >> >> Andrzej Bialecki
> > >>> >> >>
> > >>> >> >>
> > >>> >> >
> > >>> >> >
> > >>> >> >--
> > >>> >> >--
> > >>> >> >Hong-Thai
> > >>> >>
> > >>> >>
> > >>>
> > >>>
> > >>
> >
> >
>


Re: 1.7 release?

2014-10-20 Thread Oleg Tikhonov
Sorry!!!

On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Thanks Oleg, will try tomorrow for me Los angeles time!
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Oleg Tikhonov 
> Reply-To: "dev@tika.apache.org" 
> Date: Monday, October 20, 2014 at 11:20 PM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >Please take a try with newest patch.
> >Cheers,
> >Oleg
> >
> >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov 
> >wrote:
> >
> >> Taken. Thanks. in progress ...
> >>
> >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
> >> chris.a.mattm...@jpl.nasa.gov> wrote:
> >>
> >>> Trunk is the current checkout/branch:
> >>>
> >>> http://svn.apache.org/repos/asf/tika/trunk
> >>>
> >>>
> >>> ++
> >>> Chris Mattmann, Ph.D.
> >>> Chief Architect
> >>> Instrument Software and Science Data Systems Section (398)
> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>> Office: 168-519, Mailstop: 168-527
> >>> Email: chris.a.mattm...@nasa.gov
> >>> WWW:  http://sunset.usc.edu/~mattmann/
> >>> ++
> >>> Adjunct Associate Professor, Computer Science Department
> >>> University of Southern California, Los Angeles, CA 90089 USA
> >>> ++
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>> -Original Message-
> >>> From: Oleg Tikhonov 
> >>> Reply-To: "dev@tika.apache.org" 
> >>> Date: Monday, October 20, 2014 at 10:16 PM
> >>> To: "dev@tika.apache.org" 
> >>> Subject: Re: 1.7 release?
> >>>
> >>> >Hi, I can try this on.
> >>> >What is a trunk?
> >>> >
> >>> >
> >>> >Thanks,
> >>> >Oleg
> >>> >
> >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
> >>> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >>> >
> >>> >> Hmm any idea why this is failing on Windows? Tyler P. and
> >>> >> I were talking the other day - maybe we shouldn't run the
> >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
> >>> >>
> >>> >> ++
> >>> >> Chris Mattmann, Ph.D.
> >>> >> Chief Architect
> >>> >> Instrument Software and Science Data Systems Section (398)
> >>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>> >> Office: 168-519, Mailstop: 168-527
> >>> >> Email: chris.a.mattm...@nasa.gov
> >>> >> WWW:  http://sunset.usc.edu/~mattmann/
> >>> >> ++++++++++
> >>> >> Adjunct Associate Professor, Computer Science Department
> >>> >> University of Southern California, Los Angeles, CA 90089 USA
> >>> >> ++
> >>> >>
> >>> >>
> >>> >>
> >>> >>
> >>> >>
> >>> >>
> >>> >> -Original Message-
> >>> >> From: Hong-Thai Nguyen 
> >>> >> Reply-To: "dev@tika.apache.org" 
> >>> >> Date: Thursday, October 16, 2014 at 2:03 AM
> >>> >> To: "dev@tika.apache.org" 
> >>> >> Subject: Re: 1.7 release?
> >>> >>
> >>> >> >Hi Andrzej,
> >>> >> >
> >>> >> >We are impatient for 1.7 release too.
> >>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can
> >>>build
> >>> >> >successfully on Windows, I have no objection to release 1.7
> >>> >> >
> >>> >> >Thanks,
> >>> >> >
> >>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki 
> >>> >>wrote:
> >>> >> >
> >>> >> >> Hi,
> >>> >> >>
> >>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that
> >>> >>includes
> >>> >> >>the
> >>> >> >> fix for broken ODF parsing...
> >>> >> >>
> >>> >> >> ---
> >>> >> >> Best regards,
> >>> >> >>
> >>> >> >> Andrzej Bialecki
> >>> >> >>
> >>> >> >>
> >>> >> >
> >>> >> >
> >>> >> >--
> >>> >> >--
> >>> >> >Hong-Thai
> >>> >>
> >>> >>
> >>>
> >>>
> >>
>
>


Re: 1.7 release?

2014-10-20 Thread Mattmann, Chris A (3980)
Thanks Oleg, will try tomorrow for me Los angeles time!

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Oleg Tikhonov 
Reply-To: "dev@tika.apache.org" 
Date: Monday, October 20, 2014 at 11:20 PM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>Please take a try with newest patch.
>Cheers,
>Oleg
>
>On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov 
>wrote:
>
>> Taken. Thanks. in progress ...
>>
>> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
>> chris.a.mattm...@jpl.nasa.gov> wrote:
>>
>>> Trunk is the current checkout/branch:
>>>
>>> http://svn.apache.org/repos/asf/tika/trunk
>>>
>>>
>>> ++
>>> Chris Mattmann, Ph.D.
>>> Chief Architect
>>> Instrument Software and Science Data Systems Section (398)
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 168-519, Mailstop: 168-527
>>> Email: chris.a.mattm...@nasa.gov
>>> WWW:  http://sunset.usc.edu/~mattmann/
>>> ++
>>> Adjunct Associate Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++
>>>
>>>
>>>
>>>
>>>
>>>
>>> -Original Message-
>>> From: Oleg Tikhonov 
>>> Reply-To: "dev@tika.apache.org" 
>>> Date: Monday, October 20, 2014 at 10:16 PM
>>> To: "dev@tika.apache.org" 
>>> Subject: Re: 1.7 release?
>>>
>>> >Hi, I can try this on.
>>> >What is a trunk?
>>> >
>>> >
>>> >Thanks,
>>> >Oleg
>>> >
>>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
>>> >chris.a.mattm...@jpl.nasa.gov> wrote:
>>> >
>>> >> Hmm any idea why this is failing on Windows? Tyler P. and
>>> >> I were talking the other day - maybe we shouldn't run the
>>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
>>> >>
>>> >> ++
>>> >> Chris Mattmann, Ph.D.
>>> >> Chief Architect
>>> >> Instrument Software and Science Data Systems Section (398)
>>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> >> Office: 168-519, Mailstop: 168-527
>>> >> Email: chris.a.mattm...@nasa.gov
>>> >> WWW:  http://sunset.usc.edu/~mattmann/
>>> >> ++
>>> >> Adjunct Associate Professor, Computer Science Department
>>> >> University of Southern California, Los Angeles, CA 90089 USA
>>> >> ++
>>> >>
>>> >>
>>> >>
>>> >>
>>> >>
>>> >>
>>> >> -Original Message-
>>> >> From: Hong-Thai Nguyen 
>>> >> Reply-To: "dev@tika.apache.org" 
>>> >> Date: Thursday, October 16, 2014 at 2:03 AM
>>> >> To: "dev@tika.apache.org" 
>>> >> Subject: Re: 1.7 release?
>>> >>
>>> >> >Hi Andrzej,
>>> >> >
>>> >> >We are impatient for 1.7 release too.
>>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can
>>>build
>>> >> >successfully on Windows, I have no objection to release 1.7
>>> >> >
>>> >> >Thanks,
>>> >> >
>>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki 
>>> >>wrote:
>>> >> >
>>> >> >> Hi,
>>> >> >>
>>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that
>>> >>includes
>>> >> >>the
>>> >> >> fix for broken ODF parsing...
>>> >> >>
>>> >> >> ---
>>> >> >> Best regards,
>>> >> >>
>>> >> >> Andrzej Bialecki
>>> >> >>
>>> >> >>
>>> >> >
>>> >> >
>>> >> >--
>>> >> >--
>>> >> >Hong-Thai
>>> >>
>>> >>
>>>
>>>
>>



Re: 1.7 release?

2014-10-20 Thread Oleg Tikhonov
Please take a try with newest patch.
Cheers,
Oleg

On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov 
wrote:

> Taken. Thanks. in progress ...
>
> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
> chris.a.mattm...@jpl.nasa.gov> wrote:
>
>> Trunk is the current checkout/branch:
>>
>> http://svn.apache.org/repos/asf/tika/trunk
>>
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: Oleg Tikhonov 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Monday, October 20, 2014 at 10:16 PM
>> To: "dev@tika.apache.org" 
>> Subject: Re: 1.7 release?
>>
>> >Hi, I can try this on.
>> >What is a trunk?
>> >
>> >
>> >Thanks,
>> >Oleg
>> >
>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
>> >chris.a.mattm...@jpl.nasa.gov> wrote:
>> >
>> >> Hmm any idea why this is failing on Windows? Tyler P. and
>> >> I were talking the other day - maybe we shouldn't run the
>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
>> >>
>> >> ++
>> >> Chris Mattmann, Ph.D.
>> >> Chief Architect
>> >> Instrument Software and Science Data Systems Section (398)
>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> >> Office: 168-519, Mailstop: 168-527
>> >> Email: chris.a.mattm...@nasa.gov
>> >> WWW:  http://sunset.usc.edu/~mattmann/
>> >> ++++++++++
>> >> Adjunct Associate Professor, Computer Science Department
>> >> University of Southern California, Los Angeles, CA 90089 USA
>> >> ++
>> >>
>> >>
>> >>
>> >>
>> >>
>> >>
>> >> -----Original Message-
>> >> From: Hong-Thai Nguyen 
>> >> Reply-To: "dev@tika.apache.org" 
>> >> Date: Thursday, October 16, 2014 at 2:03 AM
>> >> To: "dev@tika.apache.org" 
>> >> Subject: Re: 1.7 release?
>> >>
>> >> >Hi Andrzej,
>> >> >
>> >> >We are impatient for 1.7 release too.
>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can build
>> >> >successfully on Windows, I have no objection to release 1.7
>> >> >
>> >> >Thanks,
>> >> >
>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki 
>> >>wrote:
>> >> >
>> >> >> Hi,
>> >> >>
>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that
>> >>includes
>> >> >>the
>> >> >> fix for broken ODF parsing...
>> >> >>
>> >> >> ---
>> >> >> Best regards,
>> >> >>
>> >> >> Andrzej Bialecki
>> >> >>
>> >> >>
>> >> >
>> >> >
>> >> >--
>> >> >--
>> >> >Hong-Thai
>> >>
>> >>
>>
>>
>


Re: 1.7 release?

2014-10-20 Thread Oleg Tikhonov
Taken. Thanks. in progress ...

On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Trunk is the current checkout/branch:
>
> http://svn.apache.org/repos/asf/tika/trunk
>
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Oleg Tikhonov 
> Reply-To: "dev@tika.apache.org" 
> Date: Monday, October 20, 2014 at 10:16 PM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >Hi, I can try this on.
> >What is a trunk?
> >
> >
> >Thanks,
> >Oleg
> >
> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> >> Hmm any idea why this is failing on Windows? Tyler P. and
> >> I were talking the other day - maybe we shouldn't run the
> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-527
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++++++++++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: Hong-Thai Nguyen 
> >> Reply-To: "dev@tika.apache.org" 
> >> Date: Thursday, October 16, 2014 at 2:03 AM
> >> To: "dev@tika.apache.org" 
> >> Subject: Re: 1.7 release?
> >>
> >> >Hi Andrzej,
> >> >
> >> >We are impatient for 1.7 release too.
> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can build
> >> >successfully on Windows, I have no objection to release 1.7
> >> >
> >> >Thanks,
> >> >
> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki 
> >>wrote:
> >> >
> >> >> Hi,
> >> >>
> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that
> >>includes
> >> >>the
> >> >> fix for broken ODF parsing...
> >> >>
> >> >> ---
> >> >> Best regards,
> >> >>
> >> >> Andrzej Bialecki
> >> >>
> >> >>
> >> >
> >> >
> >> >--
> >> >--
> >> >Hong-Thai
> >>
> >>
>
>


Re: 1.7 release?

2014-10-20 Thread Mattmann, Chris A (3980)
Trunk is the current checkout/branch:

http://svn.apache.org/repos/asf/tika/trunk


++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Oleg Tikhonov 
Reply-To: "dev@tika.apache.org" 
Date: Monday, October 20, 2014 at 10:16 PM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>Hi, I can try this on.
>What is a trunk?
>
>
>Thanks,
>Oleg
>
>On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
>chris.a.mattm...@jpl.nasa.gov> wrote:
>
>> Hmm any idea why this is failing on Windows? Tyler P. and
>> I were talking the other day - maybe we shouldn't run the
>> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
>>
>> ++
>> Chris Mattmann, Ph.D.
>> Chief Architect
>> Instrument Software and Science Data Systems Section (398)
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 168-519, Mailstop: 168-527
>> Email: chris.a.mattm...@nasa.gov
>> WWW:  http://sunset.usc.edu/~mattmann/
>> ++
>> Adjunct Associate Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++
>>
>>
>>
>>
>>
>>
>> -Original Message-
>> From: Hong-Thai Nguyen 
>> Reply-To: "dev@tika.apache.org" 
>> Date: Thursday, October 16, 2014 at 2:03 AM
>> To: "dev@tika.apache.org" 
>> Subject: Re: 1.7 release?
>>
>> >Hi Andrzej,
>> >
>> >We are impatient for 1.7 release too.
>> >I'm having compiling problem of TIKA-1422 on me. If anyone can build
>> >successfully on Windows, I have no objection to release 1.7
>> >
>> >Thanks,
>> >
>> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki 
>>wrote:
>> >
>> >> Hi,
>> >>
>> >> Any news on the 1.7 release? or at least a 1.6.1 release that
>>includes
>> >>the
>> >> fix for broken ODF parsing...
>> >>
>> >> ---
>> >> Best regards,
>> >>
>> >> Andrzej Bialecki
>> >>
>> >>
>> >
>> >
>> >--
>> >--
>> >Hong-Thai
>>
>>



Re: 1.7 release?

2014-10-20 Thread Oleg Tikhonov
Hi, I can try this on.
What is a trunk?


Thanks,
Oleg

On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Hmm any idea why this is failing on Windows? Tyler P. and
> I were talking the other day - maybe we shouldn't run the
> tests from TIKA-1422 unless Tesseract is installed? Thoughts?
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Hong-Thai Nguyen 
> Reply-To: "dev@tika.apache.org" 
> Date: Thursday, October 16, 2014 at 2:03 AM
> To: "dev@tika.apache.org" 
> Subject: Re: 1.7 release?
>
> >Hi Andrzej,
> >
> >We are impatient for 1.7 release too.
> >I'm having compiling problem of TIKA-1422 on me. If anyone can build
> >successfully on Windows, I have no objection to release 1.7
> >
> >Thanks,
> >
> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki  wrote:
> >
> >> Hi,
> >>
> >> Any news on the 1.7 release? or at least a 1.6.1 release that includes
> >>the
> >> fix for broken ODF parsing...
> >>
> >> ---
> >> Best regards,
> >>
> >> Andrzej Bialecki
> >>
> >>
> >
> >
> >--
> >--
> >Hong-Thai
>
>


Re: 1.7 release?

2014-10-20 Thread Mattmann, Chris A (3980)
Hmm any idea why this is failing on Windows? Tyler P. and
I were talking the other day - maybe we shouldn't run the
tests from TIKA-1422 unless Tesseract is installed? Thoughts?

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Hong-Thai Nguyen 
Reply-To: "dev@tika.apache.org" 
Date: Thursday, October 16, 2014 at 2:03 AM
To: "dev@tika.apache.org" 
Subject: Re: 1.7 release?

>Hi Andrzej,
>
>We are impatient for 1.7 release too.
>I'm having compiling problem of TIKA-1422 on me. If anyone can build
>successfully on Windows, I have no objection to release 1.7
>
>Thanks,
>
>On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki  wrote:
>
>> Hi,
>>
>> Any news on the 1.7 release? or at least a 1.6.1 release that includes
>>the
>> fix for broken ODF parsing...
>>
>> ---
>> Best regards,
>>
>> Andrzej Bialecki
>>
>>
>
>
>-- 
>--
>Hong-Thai



Re: 1.7 release?

2014-10-16 Thread Hong-Thai Nguyen
Hi Andrzej,

We are impatient for 1.7 release too.
I'm having compiling problem of TIKA-1422 on me. If anyone can build
successfully on Windows, I have no objection to release 1.7

Thanks,

On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki  wrote:

> Hi,
>
> Any news on the 1.7 release? or at least a 1.6.1 release that includes the
> fix for broken ODF parsing…
>
> ---
> Best regards,
>
> Andrzej Bialecki
>
>


-- 
--
Hong-Thai


1.7 release?

2014-10-16 Thread Andrzej Białecki
Hi,

Any news on the 1.7 release? or at least a 1.6.1 release that includes the fix 
for broken ODF parsing…

---
Best regards,

Andrzej Bialecki