Re: [VOTE] Apache Tika 1.7 Release
woot ;) Knew you would ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich Reply-To: "dev@tika.apache.org" Date: Thursday, January 15, 2015 at 1:36 PM To: "dev@tika.apache.org" Subject: Re: [VOTE] Apache Tika 1.7 Release >Found it: >https://github.com/chrismattmann/apachestuff/blob/master/extract-tika-cont >ribs >:) > >Thanks! >Tyler > >On Thu, Jan 15, 2015 at 8:57 AM, Tyler Palsulich >wrote: > >> Thanks, Chris! That sounds useful. Let me know when you get a chance to >> upload it somewhere. >> >> Tyler >> >> On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) < >> chris.a.mattm...@jpl.nasa.gov> wrote: >> >>> yeah good idea Nick. Also I had a script that would partially >>> auto-generate >>> the contributors and so forth - let me see if I can find it (it used >>> Tika!) >>> Hah, how’s THAT for eating our own dogfood? >>> >>> ++ >>> Chris Mattmann, Ph.D. >>> Chief Architect >>> Instrument Software and Science Data Systems Section (398) >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>> Office: 168-519, Mailstop: 168-527 >>> Email: chris.a.mattm...@nasa.gov >>> WWW: http://sunset.usc.edu/~mattmann/ >>> ++ >>> Adjunct Associate Professor, Computer Science Department >>> University of Southern California, Los Angeles, CA 90089 USA >>> ++++++++++ >>> >>> >>> >>> >>> >>> >>> -Original Message- >>> From: Nick Burch >>> Reply-To: "dev@tika.apache.org" >>> Date: Wednesday, January 14, 2015 at 11:49 AM >>> To: "dev@tika.apache.org" >>> Subject: Re: [VOTE] Apache Tika 1.7 Release >>> >>> >On Wed, 14 Jan 2015, Tyler Palsulich wrote: >>> >> Nick, thanks for building the site! We still need to rebuild the >>>index, >>> >> right? >>> > >>> >You'll need to build the 1.7 index page (based on the changelog), then >>> >update the download page + homepage + menu, and finally rebuild the >>>site >>> > >>> >(All I did was finish off the formats page, which we update as >>> >development >>> >progresses, copy over + tweak the remaining pages, and start a formats >>> >page for the new version. Maybe it's worth adding all these steps to >>>the >>> >release process docs?) >>> > >>> >Nick >>> >>> >>
Re: [VOTE] Apache Tika 1.7 Release
Found it: https://github.com/chrismattmann/apachestuff/blob/master/extract-tika-contribs :) Thanks! Tyler On Thu, Jan 15, 2015 at 8:57 AM, Tyler Palsulich wrote: > Thanks, Chris! That sounds useful. Let me know when you get a chance to > upload it somewhere. > > Tyler > > On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > >> yeah good idea Nick. Also I had a script that would partially >> auto-generate >> the contributors and so forth - let me see if I can find it (it used >> Tika!) >> Hah, how’s THAT for eating our own dogfood? >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: Nick Burch >> Reply-To: "dev@tika.apache.org" >> Date: Wednesday, January 14, 2015 at 11:49 AM >> To: "dev@tika.apache.org" >> Subject: Re: [VOTE] Apache Tika 1.7 Release >> >> >On Wed, 14 Jan 2015, Tyler Palsulich wrote: >> >> Nick, thanks for building the site! We still need to rebuild the index, >> >> right? >> > >> >You'll need to build the 1.7 index page (based on the changelog), then >> >update the download page + homepage + menu, and finally rebuild the site >> > >> >(All I did was finish off the formats page, which we update as >> >development >> >progresses, copy over + tweak the remaining pages, and start a formats >> >page for the new version. Maybe it's worth adding all these steps to the >> >release process docs?) >> > >> >Nick >> >> >
[RESULT] [VOTE] Apache Tika 1.7 Release Candidate #3
Hi All, The VOTE for releasing Apache Tika 1.7 RC#3 finished with the following tally: +1: Chris Mattmann David Meikle Hong-Thai Nguyen Nick Burch Tim Allison Tyler Palsulich +0: [None] -1: [None] Thank you everyone for voting! I will move forward with the release. Have a good day, Tyler
Re: [VOTE] Apache Tika 1.7 Release
Thanks, Chris! That sounds useful. Let me know when you get a chance to upload it somewhere. Tyler On Wed, Jan 14, 2015 at 11:22 PM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > yeah good idea Nick. Also I had a script that would partially auto-generate > the contributors and so forth - let me see if I can find it (it used Tika!) > Hah, how’s THAT for eating our own dogfood? > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Nick Burch > Reply-To: "dev@tika.apache.org" > Date: Wednesday, January 14, 2015 at 11:49 AM > To: "dev@tika.apache.org" > Subject: Re: [VOTE] Apache Tika 1.7 Release > > >On Wed, 14 Jan 2015, Tyler Palsulich wrote: > >> Nick, thanks for building the site! We still need to rebuild the index, > >> right? > > > >You'll need to build the 1.7 index page (based on the changelog), then > >update the download page + homepage + menu, and finally rebuild the site > > > >(All I did was finish off the formats page, which we update as > >development > >progresses, copy over + tweak the remaining pages, and start a formats > >page for the new version. Maybe it's worth adding all these steps to the > >release process docs?) > > > >Nick > >
Re: [VOTE] Apache Tika 1.7 Release
yeah good idea Nick. Also I had a script that would partially auto-generate the contributors and so forth - let me see if I can find it (it used Tika!) Hah, how’s THAT for eating our own dogfood? ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Nick Burch Reply-To: "dev@tika.apache.org" Date: Wednesday, January 14, 2015 at 11:49 AM To: "dev@tika.apache.org" Subject: Re: [VOTE] Apache Tika 1.7 Release >On Wed, 14 Jan 2015, Tyler Palsulich wrote: >> Nick, thanks for building the site! We still need to rebuild the index, >> right? > >You'll need to build the 1.7 index page (based on the changelog), then >update the download page + homepage + menu, and finally rebuild the site > >(All I did was finish off the formats page, which we update as >development >progresses, copy over + tweak the remaining pages, and start a formats >page for the new version. Maybe it's worth adding all these steps to the >release process docs?) > >Nick
Re: [VOTE] Apache Tika 1.7 Release
Hi Tyler, > On 9 Jan 2015, at 22:02, Tyler Palsulich wrote: > > A candidate for the Tika 1.7 release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > <https://dist.apache.org/repos/dist/dev/tika/> > > The release candidate is a zip archive of the sources in: > http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ > <http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/> +1 from me. Cheers, Dave
Re: [VOTE] Apache Tika 1.7 Release
On Wed, 14 Jan 2015, Tyler Palsulich wrote: Nick, thanks for building the site! We still need to rebuild the index, right? You'll need to build the 1.7 index page (based on the changelog), then update the download page + homepage + menu, and finally rebuild the site (All I did was finish off the formats page, which we update as development progresses, copy over + tweak the remaining pages, and start a formats page for the new version. Maybe it's worth adding all these steps to the release process docs?) Nick
Re: [VOTE] Apache Tika 1.7 Release
Thanks everyone! I'll close off this VOTE and roll the release tomorrow morning. Nick, thanks for building the site! We still need to rebuild the index, right? Tyler On Wed, Jan 14, 2015 at 8:37 AM, Allison, Timothy B. wrote: > +1 > > Built successfully on both Windows 7 and RHEL 6.5 for me...no Tesseract > installed. Relying on post rc2 release eval for TIKA 1445 against trunk > for no new regressions. Manually confirmed image metadata is being > extracted. > > Thank you, Tyler! > > Best, > > Tim > > -Original Message- > From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] > Sent: Wednesday, January 14, 2015 10:54 AM > To: u...@tika.apache.org; dev@tika.apache.org > Subject: Re: [VOTE] Apache Tika 1.7 Release > > +1 to release > > GPG sigs and Checksums good (after import of tika.asc) > > Great work Tyler and team! > > Cheers, > Chris > > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc > tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/ > % Total% Received % Xferd Average Speed TimeTime Time > Current > Dload Upload Total SpentLeft > Speed > 100 58.8M 100 58.8M0 0 839k 0 0:01:11 0:01:11 --:--:-- > 1137k > % Total% Received % Xferd Average Speed TimeTime Time > Current > Dload Upload Total SpentLeft > Speed > 100 473 100 4730 0901 0 --:--:-- --:--:-- --:--:-- > 900 > % Total% Received % Xferd Average Speed TimeTime Time > Current > Dload Upload Total SpentLeft > Speed > 10033 100330 0 58 0 --:--:-- --:--:-- --:--:-- > 58 > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls > tika-1.7-src.zip tika-1.7-src.zip.asc tika-1.7-src.zip.md5 > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs > Verifying Signature for file tika-1.7-src.zip.asc > gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 > gpg: Can't check signature: public key not found > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O > https://people.apache.org/keys/group/tika.asc > % Total% Received % Xferd Average Speed TimeTime Time > Current > Dload Upload Total SpentLeft > Speed > 100 153k 100 153k0 0 149k 0 0:00:01 0:00:01 --:--:-- > 149k > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika > tika: No such file or directory. > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc > gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY) > " not changed > gpg: key A355A63E: "Jukka Zitting " not changed > gpg: key 8A26D9A6: "Jukka Zitting " not changed > gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) " > not changed > gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) " > 2 new signatures > gpg: key D84E41AE: "Nick Burch " 16 new signatures > gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY) > " not changed > gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) " > not changed > gpg: key DEDEAB92: "Sergey Beryozkin (Release Management) > " not changed > gpg: key 97EDDE66: "tallison (apache_distro_keys) " > not changed > gpg: key D4F10117: public key "Tyler Palsulich " > imported > gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY) > " not changed > gpg: key 0890B1AB: public key "Konstantin Gribov (gross) > " imported > gpg: Total number processed: 13 > gpg: imported: 2 (RSA: 2) > gpg: unchanged: 9 > gpg: new signatures: 18 > gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model > gpg: depth: 0 valid: 3 signed: 0 trust: 0-, 0q, 0n, 0m, 0f, 3u > gpg: next trustdb check due at 2015-08-18 > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs > Verifying Signature for file tika-1.7-src.zip.asc > gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 > gpg: Good signature from "Tyler Palsulich " > gpg: WARNING: This key is not certified with a trusted signature! > gpg: There is no indication that the signature belongs to the > owner. > Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4 183E 8810 BB19 D4F1 0117 > Verifying Signature for file tika.asc > gpg: verify signatures failed: unexpected data > [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% > $HOME/bin/verify_md5_
RE: [VOTE] Apache Tika 1.7 Release
+1 Built successfully on both Windows 7 and RHEL 6.5 for me...no Tesseract installed. Relying on post rc2 release eval for TIKA 1445 against trunk for no new regressions. Manually confirmed image metadata is being extracted. Thank you, Tyler! Best, Tim -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Wednesday, January 14, 2015 10:54 AM To: u...@tika.apache.org; dev@tika.apache.org Subject: Re: [VOTE] Apache Tika 1.7 Release +1 to release GPG sigs and Checksums good (after import of tika.asc) Great work Tyler and team! Cheers, Chris [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/ % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 58.8M 100 58.8M0 0 839k 0 0:01:11 0:01:11 --:--:-- 1137k % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 473 100 4730 0901 0 --:--:-- --:--:-- --:--:-- 900 % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 10033 100330 0 58 0 --:--:-- --:--:-- --:--:-- 58 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls tika-1.7-src.zip tika-1.7-src.zip.asc tika-1.7-src.zip.md5 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs Verifying Signature for file tika-1.7-src.zip.asc gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 gpg: Can't check signature: public key not found [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O https://people.apache.org/keys/group/tika.asc % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 153k 100 153k0 0 149k 0 0:00:01 0:00:01 --:--:-- 149k [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika tika: No such file or directory. [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY) " not changed gpg: key A355A63E: "Jukka Zitting " not changed gpg: key 8A26D9A6: "Jukka Zitting " not changed gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) " not changed gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) " 2 new signatures gpg: key D84E41AE: "Nick Burch " 16 new signatures gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY) " not changed gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) " not changed gpg: key DEDEAB92: "Sergey Beryozkin (Release Management) " not changed gpg: key 97EDDE66: "tallison (apache_distro_keys) " not changed gpg: key D4F10117: public key "Tyler Palsulich " imported gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY) " not changed gpg: key 0890B1AB: public key "Konstantin Gribov (gross) " imported gpg: Total number processed: 13 gpg: imported: 2 (RSA: 2) gpg: unchanged: 9 gpg: new signatures: 18 gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model gpg: depth: 0 valid: 3 signed: 0 trust: 0-, 0q, 0n, 0m, 0f, 3u gpg: next trustdb check due at 2015-08-18 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs Verifying Signature for file tika-1.7-src.zip.asc gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 gpg: Good signature from "Tyler Palsulich " gpg: WARNING: This key is not certified with a trusted signature! gpg: There is no indication that the signature belongs to the owner. Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4 183E 8810 BB19 D4F1 0117 Verifying Signature for file tika.asc gpg: verify signatures failed: unexpected data [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_md5_checksums md5sum: stat '*.tar.gz': No such file or directory md5sum: stat '*.bz2': No such file or directory md5sum: stat '*.tgz': No such file or directory tika-1.7-src.zip: OK [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA
Re: [VOTE] Apache Tika 1.7 Release
+1 to release GPG sigs and Checksums good (after import of tika.asc) Great work Tyler and team! Cheers, Chris [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/stage_apache_rc tika 1.7-src https://dist.apache.org/repos/dist/dev/tika/ % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 58.8M 100 58.8M0 0 839k 0 0:01:11 0:01:11 --:--:-- 1137k % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 473 100 4730 0901 0 --:--:-- --:--:-- --:--:-- 900 % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 10033 100330 0 58 0 --:--:-- --:--:-- --:--:-- 58 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ls tika-1.7-src.zip tika-1.7-src.zip.asc tika-1.7-src.zip.md5 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs Verifying Signature for file tika-1.7-src.zip.asc gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 gpg: Can't check signature: public key not found [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% curl -O https://people.apache.org/keys/group/tika.asc % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 153k 100 153k0 0 149k 0 0:00:01 0:00:01 --:--:-- 149k [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika tika: No such file or directory. [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% gpg --import < tika.asc gpg: key B876884A: "Chris Mattmann (CODE SIGNING KEY) " not changed gpg: key A355A63E: "Jukka Zitting " not changed gpg: key 8A26D9A6: "Jukka Zitting " not changed gpg: key 42CFAE07: "Jukka Zitting (CODE SIGNING KEY) " not changed gpg: key 0EB30B07: "David Meikle (CODE SIGNING KEY) " 2 new signatures gpg: key D84E41AE: "Nick Burch " 16 new signatures gpg: key 6E68DA61: "Michael McCandless (CODE SIGNING KEY) " not changed gpg: key 95D21F2E: "Ray Gauss II (CODE SIGNING KEY) " not changed gpg: key DEDEAB92: "Sergey Beryozkin (Release Management) " not changed gpg: key 97EDDE66: "tallison (apache_distro_keys) " not changed gpg: key D4F10117: public key "Tyler Palsulich " imported gpg: key 48BAEBF6: "Lewis John McGibbney (CODE SIGNING KEY) " not changed gpg: key 0890B1AB: public key "Konstantin Gribov (gross) " imported gpg: Total number processed: 13 gpg: imported: 2 (RSA: 2) gpg: unchanged: 9 gpg: new signatures: 18 gpg: 3 marginal(s) needed, 1 complete(s) needed, PGP trust model gpg: depth: 0 valid: 3 signed: 0 trust: 0-, 0q, 0n, 0m, 0f, 3u gpg: next trustdb check due at 2015-08-18 [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_gpg_sigs Verifying Signature for file tika-1.7-src.zip.asc gpg: Signature made Fri Jan 9 16:36:49 2015 EST using RSA key ID D4F10117 gpg: Good signature from "Tyler Palsulich " gpg: WARNING: This key is not certified with a trusted signature! gpg: There is no indication that the signature belongs to the owner. Primary key fingerprint: 1D32 9CC2 D69C 821B FBE4 183E 8810 BB19 D4F1 0117 Verifying Signature for file tika.asc gpg: verify signatures failed: unexpected data [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% $HOME/bin/verify_md5_checksums md5sum: stat '*.tar.gz': No such file or directory md5sum: stat '*.bz2': No such file or directory md5sum: stat '*.tgz': No such file or directory tika-1.7-src.zip: OK [chipotle:~/tmp/apache-tika-1.7-rc3] mattmann% ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++ -Original Message- From: Tyler Palsulich Reply-To: Date: Friday, January 9, 2015 at 5:02 PM To: , "u...@tika.apache.org" Subject: [VOTE] Apache Tika 1.7 Release >Hi All, > >A candidate for the Tika 1.7 release is available at: >https://dist.apache.org/repos/dist/dev/tika/ > > >The release candidate is a zip archive of the sources in: >http://svn.apache.org/repos/asf/tika/ta
Re: [VOTE] Apache Tika 1.7 Release
I've checked again some regression tests. Seem fine for me too. So +1 Great job Tyler ! On Fri, Jan 9, 2015 at 11:02 PM, Tyler Palsulich wrote: > Hi All, > > A candidate for the Tika 1.7 release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of the sources in: > http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ > > The SHA1 checksum of the archive is > b2190c267433e62c08560576ab7197e506bfdc11 > > In addition, a staged maven repository is available here: > > > https://repository.apache.org/content/repositories/orgapachetika-1007/org/apache/tika/ > > Please vote on releasing this package as Apache Tika 1.7. > > The vote is open for the next 72 hours and passes if a majority of at least > three +1 Tika PMC votes are cast. > > [ ] +1 Release this package as Apache Tika 1.7 > [ ] -1 Do not release this package because... > > Thanks! > Tyler > > P.S. Here is my +1! > -- -- Hong-Thai
Re: [VOTE] Apache Tika 1.7 Release
On Fri, 9 Jan 2015, Tyler Palsulich wrote: A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ All looks good to me (signatures, hashes, metadata etc), so I'm +1 Nick
Re: [VOTE] Apache Tika 1.7 Release
+1, works for me 2015-01-13 9:23 GMT+01:00 Tyler Palsulich : > Hi Folks, > > Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( > http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract > metadata > fixes and David's test fix. > > Thanks, > Tyler > > On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer > wrote: > > > +1. > > > > Worked great once I manually > > edited > > > tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties > > and set useNonSequentialParser to true > > > > Peter > > >
Re: [VOTE] Apache Tika 1.7 Release
+1 On Tue, Jan 13, 2015 at 3:23 AM, Tyler Palsulich wrote: > Hi Folks, > > Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( > http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract > metadata fixes and David's test fix. > > Thanks, > Tyler > > On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer > wrote: > >> +1. >> >> Worked great once I manually >> edited >> tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties >> and set useNonSequentialParser to true >> >> Peter >> > > -- *Lewis*
Re: [VOTE] Apache Tika 1.7 Release
Hi Folks, Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract metadata fixes and David's test fix. Thanks, Tyler On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer wrote: > +1. > > Worked great once I manually > edited > tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties > and set useNonSequentialParser to true > > Peter >
[VOTE] Apache Tika 1.7 Release
Hi All, A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ The SHA1 checksum of the archive is b2190c267433e62c08560576ab7197e506bfdc11 In addition, a staged maven repository is available here: https://repository.apache.org/content/repositories/orgapachetika-1007/org/apache/tika/ Please vote on releasing this package as Apache Tika 1.7. The vote is open for the next 72 hours and passes if a majority of at least three +1 Tika PMC votes are cast. [ ] +1 Release this package as Apache Tika 1.7 [ ] -1 Do not release this package because... Thanks! Tyler P.S. Here is my +1!
Re: [VOTE] Apache Tika 1.7 Release
+1. Worked great once I manually edited tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties and set useNonSequentialParser to true Peter
Re: [VOTE] Apache Tika 1.7 Release
Seems fine for me: +1 No big regression on our corpus test of 23K docs: 15-01-07 18:19:27 INFO (DocumentConversionErrorPlugin.java : 116) [pool-3-thread-1] Summary of document conversion errors: - pdf (4) * (2) org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.ParserDecorator$1@4b0b2006 * (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@4b0b2006 * (1) org.apache.tika.exception.TikaException: Unable to extract PDF content - ps (3) * (3) org.apache.tika.exception.TikaException: Unable to unpack document stream - pptx (10) * (9) org.apache.tika.exception.TikaException: Error creating OOXML extractor * (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@45df8db8 - doc (6) * (6) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@58797499 - ppt (14) * (13) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@58797499 * (1) org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.ParserDecorator$1@58797499 - xls (9) * (9) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@58797499 - vsd (3) * (3) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@58797499 - odp (2) * (2) org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.ParserDecorator$1@753ce4d8 - chm (1) * (1) org.apache.tika.exception.TikaException: CHM file extract error: extracted Length is wrong. - dwg (4) * (4) org.apache.tika.exception.TikaException: Unsupported AutoCAD drawing version: AC1014 - pps (2) * (2) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@58797499 - chw (1) * (1) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.ParserDecorator$1@a0b8fca Thank Tyler, On Tue, Jan 6, 2015 at 7:59 AM, Tyler Palsulich wrote: > Hi All, > > A candidate for the Tika 1.7 release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of the sources in: > http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ > > The SHA1 checksum of the archive is > 0307a8367ae6f8b1103824fd11337fd89e24e6a4. > > In addition, a staged maven repository is available here: > > > https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/ > > Please vote on releasing this package as Apache Tika 1.7. > > The vote is open for the next 72 hours and passes if a majority of at least > three +1 Tika PMC votes are cast. > > [ ] +1 Release this package as Apache Tika 1.7 > [ ] -1 Do not release this package because... > > Thanks! > Tyler > > P.S. Count this as my +1! > -- -- Hong-Thai
Re: [VOTE] Apache Tika 1.7 Release
-1 on this for me too as there is a small unit test failure from ODFParser on Windows from TIKA-1412. I have added the tweak to fix this on trunk. (I have also tested the latest changes added by Tim and Tyler in TIKA-1445 on Windows, Mac and Ubuntu with a decent batch of files, and everything is working nicely at this end.) On 7 January 2015 at 01:11, Allison, Timothy B. wrote: > -1 > > I'm sorry that I haven't had a chance to kick the tires on the recent > changes to the metadata extraction from images until now, but it looks like > 1.7-rc2 and trunk are not pulling metadata from embedded images. > > I've posted a test file from govdocs1 to TIKA-1445. I may have time > tomorrow to see what's going on. I should also have time tomorrow to > finish the analysis of the comparison between 1.6 and 1.7 on govdocs1. > > Sorry for my delay, all! And even greater apologies if user error is at > fault and metadata is successfully being extracted from embedded images. :) > > Thank you, Tyler, for running this release! > > > -Original Message- > From: Nick Burch [mailto:apa...@gagravarr.org] > Sent: Tuesday, January 06, 2015 11:36 AM > To: dev@tika.apache.org > Subject: Re: [VOTE] Apache Tika 1.7 Release > > On Tue, 6 Jan 2015, Tyler Palsulich wrote: > > A candidate for the Tika 1.7 release is available at: > >https://dist.apache.org/repos/dist/dev/tika/ > > > > The release candidate is a zip archive of the sources in: > >http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ > > > > The SHA1 checksum of the archive is > >0307a8367ae6f8b1103824fd11337fd89e24e6a4. > > > > In addition, a staged maven repository is available here: > > > > > https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/ > > Looks good to me, I'm +1 > > Nick >
RE: [VOTE] Apache Tika 1.7 Release
-1 I'm sorry that I haven't had a chance to kick the tires on the recent changes to the metadata extraction from images until now, but it looks like 1.7-rc2 and trunk are not pulling metadata from embedded images. I've posted a test file from govdocs1 to TIKA-1445. I may have time tomorrow to see what's going on. I should also have time tomorrow to finish the analysis of the comparison between 1.6 and 1.7 on govdocs1. Sorry for my delay, all! And even greater apologies if user error is at fault and metadata is successfully being extracted from embedded images. :) Thank you, Tyler, for running this release! -Original Message- From: Nick Burch [mailto:apa...@gagravarr.org] Sent: Tuesday, January 06, 2015 11:36 AM To: dev@tika.apache.org Subject: Re: [VOTE] Apache Tika 1.7 Release On Tue, 6 Jan 2015, Tyler Palsulich wrote: > A candidate for the Tika 1.7 release is available at: >https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of the sources in: >http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ > > The SHA1 checksum of the archive is >0307a8367ae6f8b1103824fd11337fd89e24e6a4. > > In addition, a staged maven repository is available here: > > https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/ Looks good to me, I'm +1 Nick
Re: [VOTE] Apache Tika 1.7 Release
On Tue, 6 Jan 2015, Tyler Palsulich wrote: A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ The SHA1 checksum of the archive is 0307a8367ae6f8b1103824fd11337fd89e24e6a4. In addition, a staged maven repository is available here: https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/ Looks good to me, I'm +1 Nick
[VOTE] Apache Tika 1.7 Release
Hi All, A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ The SHA1 checksum of the archive is 0307a8367ae6f8b1103824fd11337fd89e24e6a4. In addition, a staged maven repository is available here: https://repository.apache.org/content/repositories/orgapachetika-1006/org/apache/tika/ Please vote on releasing this package as Apache Tika 1.7. The vote is open for the next 72 hours and passes if a majority of at least three +1 Tika PMC votes are cast. [ ] +1 Release this package as Apache Tika 1.7 [ ] -1 Do not release this package because... Thanks! Tyler P.S. Count this as my +1!
Re: 1.7 release? | potential blocker?
Thanks, Nick! You were right. OK -- Technically, RC#1 is up at https://dist.apache.org/repos/dist/dev/tika/. > Should I also patch the rc1 branch or will you re-branch from trunk? I'll re-branch. Tyler On Mon, Jan 5, 2015 at 12:03 PM, Allison, Timothy B. wrote: > I'll patch trunk tonight (with null check, of course :)). Should I also > patch the rc1 branch or will you re-branch from trunk? > > -Original Message- > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > Sent: Monday, January 05, 2015 11:38 AM > To: dev@tika.apache.org > Subject: Re: 1.7 release? | potential blocker? > > Works for me. I got stalled midway through the process of getting RC#1 out > (authentication issues). But, going to try to finish it right now (best way > to upload to dist.apache.org? > http://www.apache.org/dev/release.html#upload-scp each file?). I won't > send > a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2. > > Sound good? > > Tyler > > On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. > wrote: > > > All, > > > > I think I may have found a problem with the interaction of > > OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7. > > > > If you use the AutoDetectParser instead of the OutlookPSTParser() in > > OutlookPSTParserTest: > > > > // OutlookPSTParser pstParser = new OutlookPSTParser(); > > Parser pstParser = new AutoDetectParser(); > > > > I'm seeing this exception: > > > > org.apache.tika.exception.TikaException: Failed to close temporary > > resources > > at > > > org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) > > at > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) > > > > Are others seeing this? > > > > I'll try to dig into this today, might not get to it until tomorrow. > > > > Best, > > > > Tim > > > > > > > > -Original Message- > > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > > Sent: Monday, December 22, 2014 1:58 PM > > To: dev@tika.apache.org > > Subject: Re: 1.7 release? > > > > Hi All, > > > > Nick added the temporary fix for TIKA-1445 and made the POI updates for > > TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for > 1.7! > > :) > > > > I'll start the process this weekend or a couple days into the new year. > > > > Cheers, > > Tyler > > On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < > > chris.a.mattm...@jpl.nasa.gov> wrote: > > > > > +1 > > > > > > ++ > > > Chris Mattmann, Ph.D. > > > Chief Architect > > > Instrument Software and Science Data Systems Section (398) > > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 168-519, Mailstop: 168-527 > > > Email: chris.a.mattm...@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++ > > > Adjunct Associate Professor, Computer Science Department > > > University of Southern California, Los Angeles, CA 90089 USA > > > ++ > > > > > > > > > > > > > > > > > > > > > -Original Message- > > > From: Tyler Palsulich > > > Reply-To: "dev@tika.apache.org" > > > Date: Thursday, December 18, 2014 at 9:15 PM > > > To: "dev@tika.apache.org" > > > Subject: Re: 1.7 release? > > > > > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > > > >Nick > > > >just recommended, I'll try adding metadata extraction to Tesseract > soon, > > > >then adding the extensible solution in 1.8. > > > > > > > >Tyler > > > > > > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > > > >chris.a.mattm...@jpl.nasa.gov> wrote: > > > >> > > > >> I haven’t tried my hand at it - been super busy. tyler if you have a > > > >> chance go for it, I think that’s the remaining blocker. > > > >> > > > >> ++++++++++ > > > >> Chris Mattmann, Ph.D. > > > >> Chief Architect > > > &g
RE: 1.7 release? | potential blocker?
I'll patch trunk tonight (with null check, of course :)). Should I also patch the rc1 branch or will you re-branch from trunk? -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, January 05, 2015 11:38 AM To: dev@tika.apache.org Subject: Re: 1.7 release? | potential blocker? Works for me. I got stalled midway through the process of getting RC#1 out (authentication issues). But, going to try to finish it right now (best way to upload to dist.apache.org? http://www.apache.org/dev/release.html#upload-scp each file?). I won't send a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2. Sound good? Tyler On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. wrote: > All, > > I think I may have found a problem with the interaction of > OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7. > > If you use the AutoDetectParser instead of the OutlookPSTParser() in > OutlookPSTParserTest: > > // OutlookPSTParser pstParser = new OutlookPSTParser(); > Parser pstParser = new AutoDetectParser(); > > I'm seeing this exception: > > org.apache.tika.exception.TikaException: Failed to close temporary > resources > at > org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) > > Are others seeing this? > > I'll try to dig into this today, might not get to it until tomorrow. > > Best, > > Tim > > > > -Original Message- > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > Sent: Monday, December 22, 2014 1:58 PM > To: dev@tika.apache.org > Subject: Re: 1.7 release? > > Hi All, > > Nick added the temporary fix for TIKA-1445 and made the POI updates for > TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7! > :) > > I'll start the process this weekend or a couple days into the new year. > > Cheers, > Tyler > On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < > chris.a.mattm...@jpl.nasa.gov> wrote: > > > +1 > > > > ++ > > Chris Mattmann, Ph.D. > > Chief Architect > > Instrument Software and Science Data Systems Section (398) > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > Office: 168-519, Mailstop: 168-527 > > Email: chris.a.mattm...@nasa.gov > > WWW: http://sunset.usc.edu/~mattmann/ > > ++ > > Adjunct Associate Professor, Computer Science Department > > University of Southern California, Los Angeles, CA 90089 USA > > ++ > > > > > > > > > > > > > > -Original Message- > > From: Tyler Palsulich > > Reply-To: "dev@tika.apache.org" > > Date: Thursday, December 18, 2014 at 9:15 PM > > To: "dev@tika.apache.org" > > Subject: Re: 1.7 release? > > > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > > >Nick > > >just recommended, I'll try adding metadata extraction to Tesseract soon, > > >then adding the extensible solution in 1.8. > > > > > >Tyler > > > > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > > >chris.a.mattm...@jpl.nasa.gov> wrote: > > >> > > >> I haven’t tried my hand at it - been super busy. tyler if you have a > > >> chance go for it, I think that’s the remaining blocker. > > >> > > >> ++ > > >> Chris Mattmann, Ph.D. > > >> Chief Architect > > >> Instrument Software and Science Data Systems Section (398) > > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > >> Office: 168-519, Mailstop: 168-527 > > >> Email: chris.a.mattm...@nasa.gov > > >> WWW: http://sunset.usc.edu/~mattmann/ > > >> ++ > > >> Adjunct Associate Professor, Computer Science Department > > >> University of Southern California, Los Angeles, CA 90089 USA > > >> ++ > > >> > > >> > > >> > > >> > > >> > > >> > > >> -Original Message- > > >> From: Tyler Palsulich > > >>
Re: 1.7 release? | potential blocker?
On Mon, 5 Jan 2015, Tyler Palsulich wrote: Works for me. I got stalled midway through the process of getting RC#1 out (authentication issues). But, going to try to finish it right now (best way to upload to dist.apache.org? That's a svn checkout For the RC, assuming it's the same process as for Apache POI, you checkout https://dist.apache.org/repos/dist/dev/tika and put the files there Then, if the vote passes, you svn mv them to https://dist.apache.org/repos/dist/release/tika/ + upload things to maven central Nick
Re: 1.7 release? | potential blocker?
Works for me. I got stalled midway through the process of getting RC#1 out (authentication issues). But, going to try to finish it right now (best way to upload to dist.apache.org? http://www.apache.org/dev/release.html#upload-scp each file?). I won't send a VOTE for RC#1, though -- I'll wait for Tim's patch then send an RC#2. Sound good? Tyler On Mon, Jan 5, 2015 at 8:09 AM, Allison, Timothy B. wrote: > All, > > I think I may have found a problem with the interaction of > OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7. > > If you use the AutoDetectParser instead of the OutlookPSTParser() in > OutlookPSTParserTest: > > // OutlookPSTParser pstParser = new OutlookPSTParser(); > Parser pstParser = new AutoDetectParser(); > > I'm seeing this exception: > > org.apache.tika.exception.TikaException: Failed to close temporary > resources > at > org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) > > Are others seeing this? > > I'll try to dig into this today, might not get to it until tomorrow. > > Best, > > Tim > > > > -Original Message- > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] > Sent: Monday, December 22, 2014 1:58 PM > To: dev@tika.apache.org > Subject: Re: 1.7 release? > > Hi All, > > Nick added the temporary fix for TIKA-1445 and made the POI updates for > TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7! > :) > > I'll start the process this weekend or a couple days into the new year. > > Cheers, > Tyler > On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < > chris.a.mattm...@jpl.nasa.gov> wrote: > > > +1 > > > > ++ > > Chris Mattmann, Ph.D. > > Chief Architect > > Instrument Software and Science Data Systems Section (398) > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > Office: 168-519, Mailstop: 168-527 > > Email: chris.a.mattm...@nasa.gov > > WWW: http://sunset.usc.edu/~mattmann/ > > ++ > > Adjunct Associate Professor, Computer Science Department > > University of Southern California, Los Angeles, CA 90089 USA > > ++++++ > > > > > > > > > > > > > > -Original Message- > > From: Tyler Palsulich > > Reply-To: "dev@tika.apache.org" > > Date: Thursday, December 18, 2014 at 9:15 PM > > To: "dev@tika.apache.org" > > Subject: Re: 1.7 release? > > > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > > >Nick > > >just recommended, I'll try adding metadata extraction to Tesseract soon, > > >then adding the extensible solution in 1.8. > > > > > >Tyler > > > > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > > >chris.a.mattm...@jpl.nasa.gov> wrote: > > >> > > >> I haven’t tried my hand at it - been super busy. tyler if you have a > > >> chance go for it, I think that’s the remaining blocker. > > >> > > >> ++ > > >> Chris Mattmann, Ph.D. > > >> Chief Architect > > >> Instrument Software and Science Data Systems Section (398) > > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > >> Office: 168-519, Mailstop: 168-527 > > >> Email: chris.a.mattm...@nasa.gov > > >> WWW: http://sunset.usc.edu/~mattmann/ > > >> ++ > > >> Adjunct Associate Professor, Computer Science Department > > >> University of Southern California, Los Angeles, CA 90089 USA > > >> ++ > > >> > > >> > > >> > > >> > > >> > > >> > > >> -Original Message- > > >> From: Tyler Palsulich > > >> Reply-To: "dev@tika.apache.org" > > >> Date: Thursday, December 18, 2014 at 12:54 PM > > >> To: "dev@tika.apache.org" > > >> Subject: Re: 1.7 release? > > >> > > >> >Hi
RE: 1.7 release? | potential blocker?
All, I think I may have found a problem with the interaction of OutlookPSTParser with AutoDetectParser that I'd want to fix before 1.7. If you use the AutoDetectParser instead of the OutlookPSTParser() in OutlookPSTParserTest: // OutlookPSTParser pstParser = new OutlookPSTParser(); Parser pstParser = new AutoDetectParser(); I'm seeing this exception: org.apache.tika.exception.TikaException: Failed to close temporary resources at org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) Are others seeing this? I'll try to dig into this today, might not get to it until tomorrow. Best, Tim -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, December 22, 2014 1:58 PM To: dev@tika.apache.org Subject: Re: 1.7 release? Hi All, Nick added the temporary fix for TIKA-1445 and made the POI updates for TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7! :) I'll start the process this weekend or a couple days into the new year. Cheers, Tyler On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > +1 > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Tyler Palsulich > Reply-To: "dev@tika.apache.org" > Date: Thursday, December 18, 2014 at 9:15 PM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > >Nick > >just recommended, I'll try adding metadata extraction to Tesseract soon, > >then adding the extensible solution in 1.8. > > > >Tyler > > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > >chris.a.mattm...@jpl.nasa.gov> wrote: > >> > >> I haven’t tried my hand at it - been super busy. tyler if you have a > >> chance go for it, I think that’s the remaining blocker. > >> > >> ++ > >> Chris Mattmann, Ph.D. > >> Chief Architect > >> Instrument Software and Science Data Systems Section (398) > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> Office: 168-519, Mailstop: 168-527 > >> Email: chris.a.mattm...@nasa.gov > >> WWW: http://sunset.usc.edu/~mattmann/ > >> ++ > >> Adjunct Associate Professor, Computer Science Department > >> University of Southern California, Los Angeles, CA 90089 USA > >> ++ > >> > >> > >> > >> > >> > >> > >> -Original Message- > >> From: Tyler Palsulich > >> Reply-To: "dev@tika.apache.org" > >> Date: Thursday, December 18, 2014 at 12:54 PM > >> To: "dev@tika.apache.org" > >> Subject: Re: 1.7 release? > >> > >> >Hi All, > >> > > >> >It's been a few months, so I just want to follow up on this thread. > >>We've > >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as > >> >1.7 > >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with > >>TIKA-1445? > >> >Has anyone tried their hand at the suggested (significant) fix? > >> > > >> >Are there any other issues someone would like to fit in? > >> > > >> >Cheers, > >> >Tyler > >> > > >> >[0] - > >> > > >> > >> > https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select > >>e > >> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel > >> > > >> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < > >> >chris.a.mattm...@jpl.nasa.gov> wrote: > >> >> > >>
Re: 1.7 release?
> On 22 Dec 2014, at 18:57, Tyler Palsulich wrote: > > Hi All, > > Nick added the temporary fix for TIKA-1445 and made the POI updates for > TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7! > :) > > I'll start the process this weekend or a couple days into the new year. Nice one Tyler! Cheers, Dave
Re: 1.7 release?
+1 for going. Many thanks to Tyler and to Nick to take the POI upgrade. So many christmas gifts in advance or just after :-) Merry christmas to all 2014-12-22 19:59 GMT+01:00 Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov>: > WOOO HOO! Go Tyler go! :0) Merry Christmas bud. > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Tyler Palsulich > Reply-To: "dev@tika.apache.org" > Date: Monday, December 22, 2014 at 10:57 AM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >Hi All, > > > >Nick added the temporary fix for TIKA-1445 and made the POI updates for > >TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for > >1.7! > >:) > > > >I'll start the process this weekend or a couple days into the new year. > > > >Cheers, > >Tyler > >On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < > >chris.a.mattm...@jpl.nasa.gov> wrote: > > > >> +1 > >> > >> ++ > >> Chris Mattmann, Ph.D. > >> Chief Architect > >> Instrument Software and Science Data Systems Section (398) > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> Office: 168-519, Mailstop: 168-527 > >> Email: chris.a.mattm...@nasa.gov > >> WWW: http://sunset.usc.edu/~mattmann/ > >> ++ > >> Adjunct Associate Professor, Computer Science Department > >> University of Southern California, Los Angeles, CA 90089 USA > >> ++ > >> > >> > >> > >> > >> > >> > >> -Original Message- > >> From: Tyler Palsulich > >> Reply-To: "dev@tika.apache.org" > >> Date: Thursday, December 18, 2014 at 9:15 PM > >> To: "dev@tika.apache.org" > >> Subject: Re: 1.7 release? > >> > >> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > >> >Nick > >> >just recommended, I'll try adding metadata extraction to Tesseract > >>soon, > >> >then adding the extensible solution in 1.8. > >> > > >> >Tyler > >> > > >> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > >> >chris.a.mattm...@jpl.nasa.gov> wrote: > >> >> > >> >> I haven’t tried my hand at it - been super busy. tyler if you have a > >> >> chance go for it, I think that’s the remaining blocker. > >> >> > >> >> ++ > >> >> Chris Mattmann, Ph.D. > >> >> Chief Architect > >> >> Instrument Software and Science Data Systems Section (398) > >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> >> Office: 168-519, Mailstop: 168-527 > >> >> Email: chris.a.mattm...@nasa.gov > >> >> WWW: http://sunset.usc.edu/~mattmann/ > >> >> ++ > >> >> Adjunct Associate Professor, Computer Science Department > >> >> University of Southern California, Los Angeles, CA 90089 USA > >> >> ++ > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> -Original Message- > >> >> From: Tyler Palsulich > >> >> Reply-To: "dev@tika.apache.org" > >> >> Date: Thursday, December 18, 2014 at 12:54 PM > >> >> To: "dev@tika.apache.org" > >> >> Subject: Re: 1.7 release? > >> >> > >> >> >Hi All, > >> >> > > >>
Re: 1.7 release?
WOOO HOO! Go Tyler go! :0) Merry Christmas bud. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich Reply-To: "dev@tika.apache.org" Date: Monday, December 22, 2014 at 10:57 AM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >Hi All, > >Nick added the temporary fix for TIKA-1445 and made the POI updates for >TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for >1.7! >:) > >I'll start the process this weekend or a couple days into the new year. > >Cheers, >Tyler >On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < >chris.a.mattm...@jpl.nasa.gov> wrote: > >> +1 >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: Tyler Palsulich >> Reply-To: "dev@tika.apache.org" >> Date: Thursday, December 18, 2014 at 9:15 PM >> To: "dev@tika.apache.org" >> Subject: Re: 1.7 release? >> >> >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As >> >Nick >> >just recommended, I'll try adding metadata extraction to Tesseract >>soon, >> >then adding the extensible solution in 1.8. >> > >> >Tyler >> > >> >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < >> >chris.a.mattm...@jpl.nasa.gov> wrote: >> >> >> >> I haven’t tried my hand at it - been super busy. tyler if you have a >> >> chance go for it, I think that’s the remaining blocker. >> >> >> >> ++ >> >> Chris Mattmann, Ph.D. >> >> Chief Architect >> >> Instrument Software and Science Data Systems Section (398) >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >> Office: 168-519, Mailstop: 168-527 >> >> Email: chris.a.mattm...@nasa.gov >> >> WWW: http://sunset.usc.edu/~mattmann/ >> >> ++ >> >> Adjunct Associate Professor, Computer Science Department >> >> University of Southern California, Los Angeles, CA 90089 USA >> >> ++ >> >> >> >> >> >> >> >> >> >> >> >> >> >> -Original Message- >> >> From: Tyler Palsulich >> >> Reply-To: "dev@tika.apache.org" >> >> Date: Thursday, December 18, 2014 at 12:54 PM >> >> To: "dev@tika.apache.org" >> >> Subject: Re: 1.7 release? >> >> >> >> >Hi All, >> >> > >> >> >It's been a few months, so I just want to follow up on this thread. >> >>We've >> >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA >>marked as >> >> >1.7 >> >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with >> >>TIKA-1445? >> >> >Has anyone tried their hand at the suggested (significant) fix? >> >> > >> >> >Are there any other issues someone would like to fit in? >> >> > >> >> >Cheers, >> >> >Tyler >> >> > >> >>
Re: 1.7 release?
Hi All, Nick added the temporary fix for TIKA-1445 and made the POI updates for TIKA-1469 (thanks!). And, I'll volunteer to be the Release Manager for 1.7! :) I'll start the process this weekend or a couple days into the new year. Cheers, Tyler On Dec 18, 2014 9:45 PM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > +1 > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Tyler Palsulich > Reply-To: "dev@tika.apache.org" > Date: Thursday, December 18, 2014 at 9:15 PM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As > >Nick > >just recommended, I'll try adding metadata extraction to Tesseract soon, > >then adding the extensible solution in 1.8. > > > >Tyler > > > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < > >chris.a.mattm...@jpl.nasa.gov> wrote: > >> > >> I haven’t tried my hand at it - been super busy. tyler if you have a > >> chance go for it, I think that’s the remaining blocker. > >> > >> ++ > >> Chris Mattmann, Ph.D. > >> Chief Architect > >> Instrument Software and Science Data Systems Section (398) > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> Office: 168-519, Mailstop: 168-527 > >> Email: chris.a.mattm...@nasa.gov > >> WWW: http://sunset.usc.edu/~mattmann/ > >> ++ > >> Adjunct Associate Professor, Computer Science Department > >> University of Southern California, Los Angeles, CA 90089 USA > >> ++ > >> > >> > >> > >> > >> > >> > >> -Original Message- > >> From: Tyler Palsulich > >> Reply-To: "dev@tika.apache.org" > >> Date: Thursday, December 18, 2014 at 12:54 PM > >> To: "dev@tika.apache.org" > >> Subject: Re: 1.7 release? > >> > >> >Hi All, > >> > > >> >It's been a few months, so I just want to follow up on this thread. > >>We've > >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as > >> >1.7 > >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with > >>TIKA-1445? > >> >Has anyone tried their hand at the suggested (significant) fix? > >> > > >> >Are there any other issues someone would like to fit in? > >> > > >> >Cheers, > >> >Tyler > >> > > >> >[0] - > >> > > >> > >> > https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select > >>e > >> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel > >> > > >> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < > >> >chris.a.mattm...@jpl.nasa.gov> wrote: > >> >> > >> >> Thanks Tim saw your patch and am looking now. > >> >> > >> >> ++ > >> >> Chris Mattmann, Ph.D. > >> >> Chief Architect > >> >> Instrument Software and Science Data Systems Section (398) > >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> >> Office: 168-519, Mailstop: 168-527 > >> >> Email: chris.a.mattm...@nasa.gov > >> >> WWW: http://sunset.usc.edu/~mattmann/ > >> >> ++ > >> >> Adjunct Associate Professor, Computer Science Department > >> >> University of Southern California, Los Angeles, CA 90089 USA > >> >>
Re: 1.7 release?
+1 ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich Reply-To: "dev@tika.apache.org" Date: Thursday, December 18, 2014 at 9:15 PM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As >Nick >just recommended, I'll try adding metadata extraction to Tesseract soon, >then adding the extensible solution in 1.8. > >Tyler > >On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < >chris.a.mattm...@jpl.nasa.gov> wrote: >> >> I haven’t tried my hand at it - been super busy. tyler if you have a >> chance go for it, I think that’s the remaining blocker. >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: Tyler Palsulich >> Reply-To: "dev@tika.apache.org" >> Date: Thursday, December 18, 2014 at 12:54 PM >> To: "dev@tika.apache.org" >> Subject: Re: 1.7 release? >> >> >Hi All, >> > >> >It's been a few months, so I just want to follow up on this thread. >>We've >> >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as >> >1.7 >> >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with >>TIKA-1445? >> >Has anyone tried their hand at the suggested (significant) fix? >> > >> >Are there any other issues someone would like to fit in? >> > >> >Cheers, >> >Tyler >> > >> >[0] - >> > >> >>https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?select >>e >> >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel >> > >> >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < >> >chris.a.mattm...@jpl.nasa.gov> wrote: >> >> >> >> Thanks Tim saw your patch and am looking now. >> >> >> >> ++ >> >> Chris Mattmann, Ph.D. >> >> Chief Architect >> >> Instrument Software and Science Data Systems Section (398) >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >> Office: 168-519, Mailstop: 168-527 >> >> Email: chris.a.mattm...@nasa.gov >> >> WWW: http://sunset.usc.edu/~mattmann/ >> >> ++ >> >> Adjunct Associate Professor, Computer Science Department >> >> University of Southern California, Los Angeles, CA 90089 USA >> >> ++ >> >> >> >> >> >> >> >> >> >> >> >> >> >> -Original Message- >> >> From: , "Timothy B." >> >> Reply-To: "dev@tika.apache.org" >> >> Date: Monday, October 27, 2014 at 12:30 PM >> >> To: "dev@tika.apache.org" >> >> Subject: RE: 1.7 release? >> >> >> >> >Sounds good. As long as the default behavior remains the same, I'm >> >> >happy. I'm going to play with a combination of your patch and >>Tyler's >> >> >and see what the ramifications are for embedded docs. >> >> > >> >> >To confirm, the OCR integration is fantastic. Thank you and Tyler! >> >> > >> >&g
Re: 1.7 release?
I'm OK with trying the fix in 1.8 (or 1.7 if people feel strongly). As Nick just recommended, I'll try adding metadata extraction to Tesseract soon, then adding the extensible solution in 1.8. Tyler On Thu, Dec 18, 2014 at 11:58 PM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > > I haven’t tried my hand at it - been super busy. tyler if you have a > chance go for it, I think that’s the remaining blocker. > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Tyler Palsulich > Reply-To: "dev@tika.apache.org" > Date: Thursday, December 18, 2014 at 12:54 PM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >Hi All, > > > >It's been a few months, so I just want to follow up on this thread. We've > >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as > >1.7 > >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445? > >Has anyone tried their hand at the suggested (significant) fix? > > > >Are there any other issues someone would like to fit in? > > > >Cheers, > >Tyler > > > >[0] - > > > https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selecte > >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel > > > >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < > >chris.a.mattm...@jpl.nasa.gov> wrote: > >> > >> Thanks Tim saw your patch and am looking now. > >> > >> ++ > >> Chris Mattmann, Ph.D. > >> Chief Architect > >> Instrument Software and Science Data Systems Section (398) > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> Office: 168-519, Mailstop: 168-527 > >> Email: chris.a.mattm...@nasa.gov > >> WWW: http://sunset.usc.edu/~mattmann/ > >> ++ > >> Adjunct Associate Professor, Computer Science Department > >> University of Southern California, Los Angeles, CA 90089 USA > >> ++ > >> > >> > >> > >> > >> > >> > >> -Original Message- > >> From: , "Timothy B." > >> Reply-To: "dev@tika.apache.org" > >> Date: Monday, October 27, 2014 at 12:30 PM > >> To: "dev@tika.apache.org" > >> Subject: RE: 1.7 release? > >> > >> >Sounds good. As long as the default behavior remains the same, I'm > >> >happy. I'm going to play with a combination of your patch and Tyler's > >> >and see what the ramifications are for embedded docs. > >> > > >> >To confirm, the OCR integration is fantastic. Thank you and Tyler! > >> > > >> > > >> >Best, > >> > > >> > Tim > >> > > >> >-Original Message- > >> >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] > >> >Sent: Friday, October 24, 2014 5:36 PM > >> >To: dev@tika.apache.org > >> >Subject: Re: 1.7 release? > >> > > >> >Hey Tim, > >> > > >> >What do you think about my existing patch for 1445? For example to > >> >just call all the parsers? I thought I was seeing behavior that was > >> >slow because of that, but it turned out to be Tesseract and my machine > >> >at the time? > >> > > >> >I think my patch for 1445 may be enough, and we should get the metadata > >> >I think? Thoughts? > >> > > >> >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll > >> >even take it upon myself to try and experiment with the idea of > >>multip
Re: 1.7 release?
I haven’t tried my hand at it - been super busy. tyler if you have a chance go for it, I think that’s the remaining blocker. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich Reply-To: "dev@tika.apache.org" Date: Thursday, December 18, 2014 at 12:54 PM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >Hi All, > >It's been a few months, so I just want to follow up on this thread. We've >resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as >1.7 >(TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445? >Has anyone tried their hand at the suggested (significant) fix? > >Are there any other issues someone would like to fit in? > >Cheers, >Tyler > >[0] - >https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selecte >dTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel > >On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < >chris.a.mattm...@jpl.nasa.gov> wrote: >> >> Thanks Tim saw your patch and am looking now. >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: , "Timothy B." >> Reply-To: "dev@tika.apache.org" >> Date: Monday, October 27, 2014 at 12:30 PM >> To: "dev@tika.apache.org" >> Subject: RE: 1.7 release? >> >> >Sounds good. As long as the default behavior remains the same, I'm >> >happy. I'm going to play with a combination of your patch and Tyler's >> >and see what the ramifications are for embedded docs. >> > >> >To confirm, the OCR integration is fantastic. Thank you and Tyler! >> > >> > >> >Best, >> > >> > Tim >> > >> >-Original Message- >> >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] >> >Sent: Friday, October 24, 2014 5:36 PM >> >To: dev@tika.apache.org >> >Subject: Re: 1.7 release? >> > >> >Hey Tim, >> > >> >What do you think about my existing patch for 1445? For example to >> >just call all the parsers? I thought I was seeing behavior that was >> >slow because of that, but it turned out to be Tesseract and my machine >> >at the time? >> > >> >I think my patch for 1445 may be enough, and we should get the metadata >> >I think? Thoughts? >> > >> >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll >> >even take it upon myself to try and experiment with the idea of >>multiple >> >parsers being called. I think a simple solution to the metadata key >> >conflict issue is simply to have a policy to add values (by default) >>and >> >replace if a property is set in ParseContext. Some simple updates to >> >CompositeParser would allow this. >> > >> >Thoughts? >> > >> >Cheers, >> >Chris >> > >> > >> >++ >> >Chris Mattmann, Ph.D. >> >Chief Architect >> >Instrument Software and Science Data Systems Section (398) >> >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >Office: 168-519, Mailstop: 168-527 >> >Email: chris.a.mattm...@nasa.gov >> >WWW: http://sunset.usc.edu/~mattmann/ >> >++ >> &
Re: 1.7 release?
Hi, it might be worth waiting until POI 3.11-FINAL is released so that the TIKA release do not depend on a beta version. It's due on Sunday, corrects a lot of old office parsing and just needs the patch in TIKA-1469 to properly work. Regards Thomas 2014-12-18 21:54 GMT+01:00 Tyler Palsulich : > > Hi All, > > It's been a few months, so I just want to follow up on this thread. We've > resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as 1.7 > (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445? > Has anyone tried their hand at the suggested (significant) fix? > > Are there any other issues someone would like to fit in? > > Cheers, > Tyler > > [0] - > > https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selectedTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel > > On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > > > > Thanks Tim saw your patch and am looking now. > > > > ++ > > Chris Mattmann, Ph.D. > > Chief Architect > > Instrument Software and Science Data Systems Section (398) > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > Office: 168-519, Mailstop: 168-527 > > Email: chris.a.mattm...@nasa.gov > > WWW: http://sunset.usc.edu/~mattmann/ > > ++ > > Adjunct Associate Professor, Computer Science Department > > University of Southern California, Los Angeles, CA 90089 USA > > ++ > > > > > > > > > > > > > > -Original Message- > > From: , "Timothy B." > > Reply-To: "dev@tika.apache.org" > > Date: Monday, October 27, 2014 at 12:30 PM > > To: "dev@tika.apache.org" > > Subject: RE: 1.7 release? > > > > >Sounds good. As long as the default behavior remains the same, I'm > > >happy. I'm going to play with a combination of your patch and Tyler's > > >and see what the ramifications are for embedded docs. > > > > > >To confirm, the OCR integration is fantastic. Thank you and Tyler! > > > > > > > > >Best, > > > > > > Tim > > > > > >-Original Message- > > >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] > > >Sent: Friday, October 24, 2014 5:36 PM > > >To: dev@tika.apache.org > > >Subject: Re: 1.7 release? > > > > > >Hey Tim, > > > > > >What do you think about my existing patch for 1445? For example to > > >just call all the parsers? I thought I was seeing behavior that was > > >slow because of that, but it turned out to be Tesseract and my machine > > >at the time? > > > > > >I think my patch for 1445 may be enough, and we should get the metadata > > >I think? Thoughts? > > > > > >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll > > >even take it upon myself to try and experiment with the idea of multiple > > >parsers being called. I think a simple solution to the metadata key > > >conflict issue is simply to have a policy to add values (by default) and > > >replace if a property is set in ParseContext. Some simple updates to > > >CompositeParser would allow this. > > > > > >Thoughts? > > > > > >Cheers, > > >Chris > > > > > > > > >++ > > >Chris Mattmann, Ph.D. > > >Chief Architect > > >Instrument Software and Science Data Systems Section (398) > > >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > >Office: 168-519, Mailstop: 168-527 > > >Email: chris.a.mattm...@nasa.gov > > >WWW: http://sunset.usc.edu/~mattmann/ > > >++ > > >Adjunct Associate Professor, Computer Science Department > > >University of Southern California, Los Angeles, CA 90089 USA > > >++ > > > > > > > > > > > > > > > > > > > > >-Original Message- > > >From: , "Timothy B." > > >Reply-To: "dev@tika.apache.org" > > >Date: Friday, October 24, 2014 at 2:24 PM > > >To: "
Re: 1.7 release?
Hi All, It's been a few months, so I just want to follow up on this thread. We've resolved/closed 51 issues for v1.7 [0]. There are two on JIRA marked as 1.7 (TIKA-1465 and TIKA-894). Do we still want to aim for 1.7 with TIKA-1445? Has anyone tried their hand at the suggested (significant) fix? Are there any other issues someone would like to fit in? Cheers, Tyler [0] - https://issues.apache.org/jira/browse/TIKA/fixforversion/12327096/?selectedTab=com.atlassian.jira.jira-projects-plugin:version-issues-panel On Tue, Oct 28, 2014 at 1:46 AM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > > Thanks Tim saw your patch and am looking now. > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: , "Timothy B." > Reply-To: "dev@tika.apache.org" > Date: Monday, October 27, 2014 at 12:30 PM > To: "dev@tika.apache.org" > Subject: RE: 1.7 release? > > >Sounds good. As long as the default behavior remains the same, I'm > >happy. I'm going to play with a combination of your patch and Tyler's > >and see what the ramifications are for embedded docs. > > > >To confirm, the OCR integration is fantastic. Thank you and Tyler! > > > > > >Best, > > > > Tim > > > >-Original Message- > >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] > >Sent: Friday, October 24, 2014 5:36 PM > >To: dev@tika.apache.org > >Subject: Re: 1.7 release? > > > >Hey Tim, > > > >What do you think about my existing patch for 1445? For example to > >just call all the parsers? I thought I was seeing behavior that was > >slow because of that, but it turned out to be Tesseract and my machine > >at the time? > > > >I think my patch for 1445 may be enough, and we should get the metadata > >I think? Thoughts? > > > >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll > >even take it upon myself to try and experiment with the idea of multiple > >parsers being called. I think a simple solution to the metadata key > >conflict issue is simply to have a policy to add values (by default) and > >replace if a property is set in ParseContext. Some simple updates to > >CompositeParser would allow this. > > > >Thoughts? > > > >Cheers, > >Chris > > > > > >++ > >Chris Mattmann, Ph.D. > >Chief Architect > >Instrument Software and Science Data Systems Section (398) > >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >Office: 168-519, Mailstop: 168-527 > >Email: chris.a.mattm...@nasa.gov > >WWW: http://sunset.usc.edu/~mattmann/ > >++++++++++ > >Adjunct Associate Professor, Computer Science Department > >University of Southern California, Los Angeles, CA 90089 USA > >++ > > > > > > > > > > > > > >-Original Message- > >From: , "Timothy B." > >Reply-To: "dev@tika.apache.org" > >Date: Friday, October 24, 2014 at 2:24 PM > >To: "dev@tika.apache.org" > >Subject: RE: 1.7 release? > > > >>Sorry for coming late to the game on the implications of TIKA-1445. I > >>don't want to hold up the release of 1.7. > >> > >>However, would it be possible to return to the legacy default behavior of > >>extracting metadata from images? > >> > >>We can then document on the OCR parser page on the wiki that you need to > >>install Tesseract _and_ make a change in the parser/mime config file. If > >>you want this new capability, it will take a small bit of work until we > >>solve TIKA-1445. > >> > >>I worry that the current behavior of 1.7 would be surprising to most > >>non-dev users (well, even to at least one dev :) ). > >> > >&g
Re: 1.7 release?
Thanks Tim saw your patch and am looking now. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: , "Timothy B." Reply-To: "dev@tika.apache.org" Date: Monday, October 27, 2014 at 12:30 PM To: "dev@tika.apache.org" Subject: RE: 1.7 release? >Sounds good. As long as the default behavior remains the same, I'm >happy. I'm going to play with a combination of your patch and Tyler's >and see what the ramifications are for embedded docs. > >To confirm, the OCR integration is fantastic. Thank you and Tyler! > > >Best, > > Tim > >-Original Message- >From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] >Sent: Friday, October 24, 2014 5:36 PM >To: dev@tika.apache.org >Subject: Re: 1.7 release? > >Hey Tim, > >What do you think about my existing patch for 1445? For example to >just call all the parsers? I thought I was seeing behavior that was >slow because of that, but it turned out to be Tesseract and my machine >at the time? > >I think my patch for 1445 may be enough, and we should get the metadata >I think? Thoughts? > >I honestly think we need to deliver Tesseract in 1.7. We're close. I'll >even take it upon myself to try and experiment with the idea of multiple >parsers being called. I think a simple solution to the metadata key >conflict issue is simply to have a policy to add values (by default) and >replace if a property is set in ParseContext. Some simple updates to >CompositeParser would allow this. > >Thoughts? > >Cheers, >Chris > > >++ >Chris Mattmann, Ph.D. >Chief Architect >Instrument Software and Science Data Systems Section (398) >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >Office: 168-519, Mailstop: 168-527 >Email: chris.a.mattm...@nasa.gov >WWW: http://sunset.usc.edu/~mattmann/ >++ >Adjunct Associate Professor, Computer Science Department >University of Southern California, Los Angeles, CA 90089 USA >++++++++++ > > > > > > >-Original Message- >From: , "Timothy B." >Reply-To: "dev@tika.apache.org" >Date: Friday, October 24, 2014 at 2:24 PM >To: "dev@tika.apache.org" >Subject: RE: 1.7 release? > >>Sorry for coming late to the game on the implications of TIKA-1445. I >>don't want to hold up the release of 1.7. >> >>However, would it be possible to return to the legacy default behavior of >>extracting metadata from images? >> >>We can then document on the OCR parser page on the wiki that you need to >>install Tesseract _and_ make a change in the parser/mime config file. If >>you want this new capability, it will take a small bit of work until we >>solve TIKA-1445. >> >>I worry that the current behavior of 1.7 would be surprising to most >>non-dev users (well, even to at least one dev :) ). >> >>Cheers, >> >> Tim >> >> >>From: Oleg Tikhonov [olegtikho...@gmail.com] >>Sent: Friday, October 24, 2014 2:24 PM >>To: dev@tika.apache.org >>Subject: Re: 1.7 release? >> >>Hi Tyler, >>don't mention. >> >>Cheers, >>Oleg >>On Oct 24, 2014 8:02 PM, "Tyler Palsulich" wrote: >> >>> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there >>>any >>> other issues anyone would like to resolve before a new release? >>> >>> Thanks, >>> Tyler >>> >>> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov >>> wrote: >>> >>> > Sorry!!! >>> > >>> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < >>> > chris.a.mattm...@jpl.nasa.gov> wrote: >>> > >>> > > Thanks Oleg, will try tomorrow for me Los angeles time! >>> > > >>> > > +++
RE: 1.7 release?
Sounds good. As long as the default behavior remains the same, I'm happy. I'm going to play with a combination of your patch and Tyler's and see what the ramifications are for embedded docs. To confirm, the OCR integration is fantastic. Thank you and Tyler! Best, Tim -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Friday, October 24, 2014 5:36 PM To: dev@tika.apache.org Subject: Re: 1.7 release? Hey Tim, What do you think about my existing patch for 1445? For example to just call all the parsers? I thought I was seeing behavior that was slow because of that, but it turned out to be Tesseract and my machine at the time? I think my patch for 1445 may be enough, and we should get the metadata I think? Thoughts? I honestly think we need to deliver Tesseract in 1.7. We're close. I'll even take it upon myself to try and experiment with the idea of multiple parsers being called. I think a simple solution to the metadata key conflict issue is simply to have a policy to add values (by default) and replace if a property is set in ParseContext. Some simple updates to CompositeParser would allow this. Thoughts? Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: , "Timothy B." Reply-To: "dev@tika.apache.org" Date: Friday, October 24, 2014 at 2:24 PM To: "dev@tika.apache.org" Subject: RE: 1.7 release? >Sorry for coming late to the game on the implications of TIKA-1445. I >don't want to hold up the release of 1.7. > >However, would it be possible to return to the legacy default behavior of >extracting metadata from images? > >We can then document on the OCR parser page on the wiki that you need to >install Tesseract _and_ make a change in the parser/mime config file. If >you want this new capability, it will take a small bit of work until we >solve TIKA-1445. > >I worry that the current behavior of 1.7 would be surprising to most >non-dev users (well, even to at least one dev :) ). > >Cheers, > > Tim > > >From: Oleg Tikhonov [olegtikho...@gmail.com] >Sent: Friday, October 24, 2014 2:24 PM >To: dev@tika.apache.org >Subject: Re: 1.7 release? > >Hi Tyler, >don't mention. > >Cheers, >Oleg >On Oct 24, 2014 8:02 PM, "Tyler Palsulich" wrote: > >> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there >>any >> other issues anyone would like to resolve before a new release? >> >> Thanks, >> Tyler >> >> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov >> wrote: >> >> > Sorry!!! >> > >> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < >> > chris.a.mattm...@jpl.nasa.gov> wrote: >> > >> > > Thanks Oleg, will try tomorrow for me Los angeles time! >> > > >> > > ++ >> > > Chris Mattmann, Ph.D. >> > > Chief Architect >> > > Instrument Software and Science Data Systems Section (398) >> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> > > Office: 168-519, Mailstop: 168-527 >> > > Email: chris.a.mattm...@nasa.gov >> > > WWW: http://sunset.usc.edu/~mattmann/ >> > > ++ >> > > Adjunct Associate Professor, Computer Science Department >> > > University of Southern California, Los Angeles, CA 90089 USA >> > > ++ >> > > >> > > >> > > >> > > >> > > >> > > >> > > -Original Message- >> > > From: Oleg Tikhonov >> > > Reply-To: "dev@tika.apache.org" >> > > Date: Monday, October 20, 2014 at 11:20 PM >> > > To: "dev@tika.apache.org" >> > > Subject: Re: 1.7 release? >> > > >> > > >Please take a try with newest patch. >> > > >Cheers, >> > > >Oleg >&
Re: 1.7 release?
Hey Tim, What do you think about my existing patch for 1445? For example to just call all the parsers? I thought I was seeing behavior that was slow because of that, but it turned out to be Tesseract and my machine at the time? I think my patch for 1445 may be enough, and we should get the metadata I think? Thoughts? I honestly think we need to deliver Tesseract in 1.7. We're close. I'll even take it upon myself to try and experiment with the idea of multiple parsers being called. I think a simple solution to the metadata key conflict issue is simply to have a policy to add values (by default) and replace if a property is set in ParseContext. Some simple updates to CompositeParser would allow this. Thoughts? Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: , "Timothy B." Reply-To: "dev@tika.apache.org" Date: Friday, October 24, 2014 at 2:24 PM To: "dev@tika.apache.org" Subject: RE: 1.7 release? >Sorry for coming late to the game on the implications of TIKA-1445. I >don't want to hold up the release of 1.7. > >However, would it be possible to return to the legacy default behavior of >extracting metadata from images? > >We can then document on the OCR parser page on the wiki that you need to >install Tesseract _and_ make a change in the parser/mime config file. If >you want this new capability, it will take a small bit of work until we >solve TIKA-1445. > >I worry that the current behavior of 1.7 would be surprising to most >non-dev users (well, even to at least one dev :) ). > >Cheers, > > Tim > > >From: Oleg Tikhonov [olegtikho...@gmail.com] >Sent: Friday, October 24, 2014 2:24 PM >To: dev@tika.apache.org >Subject: Re: 1.7 release? > >Hi Tyler, >don't mention. > >Cheers, >Oleg >On Oct 24, 2014 8:02 PM, "Tyler Palsulich" wrote: > >> Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there >>any >> other issues anyone would like to resolve before a new release? >> >> Thanks, >> Tyler >> >> On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov >> wrote: >> >> > Sorry!!! >> > >> > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < >> > chris.a.mattm...@jpl.nasa.gov> wrote: >> > >> > > Thanks Oleg, will try tomorrow for me Los angeles time! >> > > >> > > ++ >> > > Chris Mattmann, Ph.D. >> > > Chief Architect >> > > Instrument Software and Science Data Systems Section (398) >> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> > > Office: 168-519, Mailstop: 168-527 >> > > Email: chris.a.mattm...@nasa.gov >> > > WWW: http://sunset.usc.edu/~mattmann/ >> > > ++ >> > > Adjunct Associate Professor, Computer Science Department >> > > University of Southern California, Los Angeles, CA 90089 USA >> > > ++ >> > > >> > > >> > > >> > > >> > > >> > > >> > > -Original Message- >> > > From: Oleg Tikhonov >> > > Reply-To: "dev@tika.apache.org" >> > > Date: Monday, October 20, 2014 at 11:20 PM >> > > To: "dev@tika.apache.org" >> > > Subject: Re: 1.7 release? >> > > >> > > >Please take a try with newest patch. >> > > >Cheers, >> > > >Oleg >> > > > >> > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov < >> olegtikho...@gmail.com> >> > > >wrote: >> > > > >> > > >> Taken. Thanks. in progress ... >> > > >> >> > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < >> > > >> chris.a.mattm...@jpl
RE: 1.7 release?
Sorry for coming late to the game on the implications of TIKA-1445. I don't want to hold up the release of 1.7. However, would it be possible to return to the legacy default behavior of extracting metadata from images? We can then document on the OCR parser page on the wiki that you need to install Tesseract _and_ make a change in the parser/mime config file. If you want this new capability, it will take a small bit of work until we solve TIKA-1445. I worry that the current behavior of 1.7 would be surprising to most non-dev users (well, even to at least one dev :) ). Cheers, Tim From: Oleg Tikhonov [olegtikho...@gmail.com] Sent: Friday, October 24, 2014 2:24 PM To: dev@tika.apache.org Subject: Re: 1.7 release? Hi Tyler, don't mention. Cheers, Oleg On Oct 24, 2014 8:02 PM, "Tyler Palsulich" wrote: > Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any > other issues anyone would like to resolve before a new release? > > Thanks, > Tyler > > On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov > wrote: > > > Sorry!!! > > > > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < > > chris.a.mattm...@jpl.nasa.gov> wrote: > > > > > Thanks Oleg, will try tomorrow for me Los angeles time! > > > > > > ++ > > > Chris Mattmann, Ph.D. > > > Chief Architect > > > Instrument Software and Science Data Systems Section (398) > > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 168-519, Mailstop: 168-527 > > > Email: chris.a.mattm...@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++ > > > Adjunct Associate Professor, Computer Science Department > > > University of Southern California, Los Angeles, CA 90089 USA > > > ++ > > > > > > > > > > > > > > > > > > > > > -Original Message- > > > From: Oleg Tikhonov > > > Reply-To: "dev@tika.apache.org" > > > Date: Monday, October 20, 2014 at 11:20 PM > > > To: "dev@tika.apache.org" > > > Subject: Re: 1.7 release? > > > > > > >Please take a try with newest patch. > > > >Cheers, > > > >Oleg > > > > > > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov < > olegtikho...@gmail.com> > > > >wrote: > > > > > > > >> Taken. Thanks. in progress ... > > > >> > > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < > > > >> chris.a.mattm...@jpl.nasa.gov> wrote: > > > >> > > > >>> Trunk is the current checkout/branch: > > > >>> > > > >>> http://svn.apache.org/repos/asf/tika/trunk > > > >>> > > > >>> > > > >>> ++ > > > >>> Chris Mattmann, Ph.D. > > > >>> Chief Architect > > > >>> Instrument Software and Science Data Systems Section (398) > > > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > >>> Office: 168-519, Mailstop: 168-527 > > > >>> Email: chris.a.mattm...@nasa.gov > > > >>> WWW: http://sunset.usc.edu/~mattmann/ > > > >>> ++ > > > >>> Adjunct Associate Professor, Computer Science Department > > > >>> University of Southern California, Los Angeles, CA 90089 USA > > > >>> ++ > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> -Original Message- > > > >>> From: Oleg Tikhonov > > > >>> Reply-To: "dev@tika.apache.org" > > > >>> Date: Monday, October 20, 2014 at 10:16 PM > > > >>> To: "dev@tika.apache.org" > > > >>> Subject: Re: 1.7 release? > > > >>> > > > >>> >Hi, I can try this on. > > > &
Re: 1.7 release?
Hi Tyler, don't mention. Cheers, Oleg On Oct 24, 2014 8:02 PM, "Tyler Palsulich" wrote: > Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any > other issues anyone would like to resolve before a new release? > > Thanks, > Tyler > > On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov > wrote: > > > Sorry!!! > > > > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < > > chris.a.mattm...@jpl.nasa.gov> wrote: > > > > > Thanks Oleg, will try tomorrow for me Los angeles time! > > > > > > ++ > > > Chris Mattmann, Ph.D. > > > Chief Architect > > > Instrument Software and Science Data Systems Section (398) > > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 168-519, Mailstop: 168-527 > > > Email: chris.a.mattm...@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++ > > > Adjunct Associate Professor, Computer Science Department > > > University of Southern California, Los Angeles, CA 90089 USA > > > ++ > > > > > > > > > > > > > > > > > > > > > -Original Message- > > > From: Oleg Tikhonov > > > Reply-To: "dev@tika.apache.org" > > > Date: Monday, October 20, 2014 at 11:20 PM > > > To: "dev@tika.apache.org" > > > Subject: Re: 1.7 release? > > > > > > >Please take a try with newest patch. > > > >Cheers, > > > >Oleg > > > > > > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov < > olegtikho...@gmail.com> > > > >wrote: > > > > > > > >> Taken. Thanks. in progress ... > > > >> > > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < > > > >> chris.a.mattm...@jpl.nasa.gov> wrote: > > > >> > > > >>> Trunk is the current checkout/branch: > > > >>> > > > >>> http://svn.apache.org/repos/asf/tika/trunk > > > >>> > > > >>> > > > >>> ++ > > > >>> Chris Mattmann, Ph.D. > > > >>> Chief Architect > > > >>> Instrument Software and Science Data Systems Section (398) > > > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > >>> Office: 168-519, Mailstop: 168-527 > > > >>> Email: chris.a.mattm...@nasa.gov > > > >>> WWW: http://sunset.usc.edu/~mattmann/ > > > >>> ++ > > > >>> Adjunct Associate Professor, Computer Science Department > > > >>> University of Southern California, Los Angeles, CA 90089 USA > > > >>> ++ > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> > > > >>> -Original Message- > > > >>> From: Oleg Tikhonov > > > >>> Reply-To: "dev@tika.apache.org" > > > >>> Date: Monday, October 20, 2014 at 10:16 PM > > > >>> To: "dev@tika.apache.org" > > > >>> Subject: Re: 1.7 release? > > > >>> > > > >>> >Hi, I can try this on. > > > >>> >What is a trunk? > > > >>> > > > > >>> > > > > >>> >Thanks, > > > >>> >Oleg > > > >>> > > > > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < > > > >>> >chris.a.mattm...@jpl.nasa.gov> wrote: > > > >>> > > > > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and > > > >>> >> I were talking the other day - maybe we shouldn't run the > > > >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? > > > >>> >> > > > >>> >> > ++ > > > >>> >> Chris Mattmann, Ph.D. > > > >>> >&
Re: 1.7 release?
Thank you for the help, Oleg! I just resolved TIKA-1422. So, are there any other issues anyone would like to resolve before a new release? Thanks, Tyler On Tue, Oct 21, 2014 at 2:42 AM, Oleg Tikhonov wrote: > Sorry!!! > > On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > > > Thanks Oleg, will try tomorrow for me Los angeles time! > > > > ++ > > Chris Mattmann, Ph.D. > > Chief Architect > > Instrument Software and Science Data Systems Section (398) > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > Office: 168-519, Mailstop: 168-527 > > Email: chris.a.mattm...@nasa.gov > > WWW: http://sunset.usc.edu/~mattmann/ > > ++ > > Adjunct Associate Professor, Computer Science Department > > University of Southern California, Los Angeles, CA 90089 USA > > ++ > > > > > > > > > > > > > > -Original Message- > > From: Oleg Tikhonov > > Reply-To: "dev@tika.apache.org" > > Date: Monday, October 20, 2014 at 11:20 PM > > To: "dev@tika.apache.org" > > Subject: Re: 1.7 release? > > > > >Please take a try with newest patch. > > >Cheers, > > >Oleg > > > > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov > > >wrote: > > > > > >> Taken. Thanks. in progress ... > > >> > > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < > > >> chris.a.mattm...@jpl.nasa.gov> wrote: > > >> > > >>> Trunk is the current checkout/branch: > > >>> > > >>> http://svn.apache.org/repos/asf/tika/trunk > > >>> > > >>> > > >>> ++ > > >>> Chris Mattmann, Ph.D. > > >>> Chief Architect > > >>> Instrument Software and Science Data Systems Section (398) > > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > >>> Office: 168-519, Mailstop: 168-527 > > >>> Email: chris.a.mattm...@nasa.gov > > >>> WWW: http://sunset.usc.edu/~mattmann/ > > >>> ++ > > >>> Adjunct Associate Professor, Computer Science Department > > >>> University of Southern California, Los Angeles, CA 90089 USA > > >>> ++ > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> > > >>> -Original Message- > > >>> From: Oleg Tikhonov > > >>> Reply-To: "dev@tika.apache.org" > > >>> Date: Monday, October 20, 2014 at 10:16 PM > > >>> To: "dev@tika.apache.org" > > >>> Subject: Re: 1.7 release? > > >>> > > >>> >Hi, I can try this on. > > >>> >What is a trunk? > > >>> > > > >>> > > > >>> >Thanks, > > >>> >Oleg > > >>> > > > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < > > >>> >chris.a.mattm...@jpl.nasa.gov> wrote: > > >>> > > > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and > > >>> >> I were talking the other day - maybe we shouldn't run the > > >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? > > >>> >> > > >>> >> ++ > > >>> >> Chris Mattmann, Ph.D. > > >>> >> Chief Architect > > >>> >> Instrument Software and Science Data Systems Section (398) > > >>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > >>> >> Office: 168-519, Mailstop: 168-527 > > >>> >> Email: chris.a.mattm...@nasa.gov > > >>> >> WWW: http://sunset.usc.edu/~mattmann/ > > >>> >> ++ > > >>> >> Adjunct Associate Professor, Computer Science Department > > >>> >> University of Southern California, Los Angeles, CA 90089 USA > > >>> >> ++ > > >>> >> > > >>> >> > > >>> >> > > >>> >> > > >>> >> > > >>> >> > > >>> >> -Original Message- > > >>> >> From: Hong-Thai Nguyen > > >>> >> Reply-To: "dev@tika.apache.org" > > >>> >> Date: Thursday, October 16, 2014 at 2:03 AM > > >>> >> To: "dev@tika.apache.org" > > >>> >> Subject: Re: 1.7 release? > > >>> >> > > >>> >> >Hi Andrzej, > > >>> >> > > > >>> >> >We are impatient for 1.7 release too. > > >>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can > > >>>build > > >>> >> >successfully on Windows, I have no objection to release 1.7 > > >>> >> > > > >>> >> >Thanks, > > >>> >> > > > >>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki < > a...@getopt.org> > > >>> >>wrote: > > >>> >> > > > >>> >> >> Hi, > > >>> >> >> > > >>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that > > >>> >>includes > > >>> >> >>the > > >>> >> >> fix for broken ODF parsing... > > >>> >> >> > > >>> >> >> --- > > >>> >> >> Best regards, > > >>> >> >> > > >>> >> >> Andrzej Bialecki > > >>> >> >> > > >>> >> >> > > >>> >> > > > >>> >> > > > >>> >> >-- > > >>> >> >-- > > >>> >> >Hong-Thai > > >>> >> > > >>> >> > > >>> > > >>> > > >> > > > > >
Re: 1.7 release?
Sorry!!! On Tue, Oct 21, 2014 at 9:37 AM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Thanks Oleg, will try tomorrow for me Los angeles time! > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Oleg Tikhonov > Reply-To: "dev@tika.apache.org" > Date: Monday, October 20, 2014 at 11:20 PM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >Please take a try with newest patch. > >Cheers, > >Oleg > > > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov > >wrote: > > > >> Taken. Thanks. in progress ... > >> > >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < > >> chris.a.mattm...@jpl.nasa.gov> wrote: > >> > >>> Trunk is the current checkout/branch: > >>> > >>> http://svn.apache.org/repos/asf/tika/trunk > >>> > >>> > >>> ++ > >>> Chris Mattmann, Ph.D. > >>> Chief Architect > >>> Instrument Software and Science Data Systems Section (398) > >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >>> Office: 168-519, Mailstop: 168-527 > >>> Email: chris.a.mattm...@nasa.gov > >>> WWW: http://sunset.usc.edu/~mattmann/ > >>> ++ > >>> Adjunct Associate Professor, Computer Science Department > >>> University of Southern California, Los Angeles, CA 90089 USA > >>> ++ > >>> > >>> > >>> > >>> > >>> > >>> > >>> -Original Message- > >>> From: Oleg Tikhonov > >>> Reply-To: "dev@tika.apache.org" > >>> Date: Monday, October 20, 2014 at 10:16 PM > >>> To: "dev@tika.apache.org" > >>> Subject: Re: 1.7 release? > >>> > >>> >Hi, I can try this on. > >>> >What is a trunk? > >>> > > >>> > > >>> >Thanks, > >>> >Oleg > >>> > > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < > >>> >chris.a.mattm...@jpl.nasa.gov> wrote: > >>> > > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and > >>> >> I were talking the other day - maybe we shouldn't run the > >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? > >>> >> > >>> >> ++ > >>> >> Chris Mattmann, Ph.D. > >>> >> Chief Architect > >>> >> Instrument Software and Science Data Systems Section (398) > >>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >>> >> Office: 168-519, Mailstop: 168-527 > >>> >> Email: chris.a.mattm...@nasa.gov > >>> >> WWW: http://sunset.usc.edu/~mattmann/ > >>> >> ++++++++++ > >>> >> Adjunct Associate Professor, Computer Science Department > >>> >> University of Southern California, Los Angeles, CA 90089 USA > >>> >> ++ > >>> >> > >>> >> > >>> >> > >>> >> > >>> >> > >>> >> > >>> >> -Original Message- > >>> >> From: Hong-Thai Nguyen > >>> >> Reply-To: "dev@tika.apache.org" > >>> >> Date: Thursday, October 16, 2014 at 2:03 AM > >>> >> To: "dev@tika.apache.org" > >>> >> Subject: Re: 1.7 release? > >>> >> > >>> >> >Hi Andrzej, > >>> >> > > >>> >> >We are impatient for 1.7 release too. > >>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can > >>>build > >>> >> >successfully on Windows, I have no objection to release 1.7 > >>> >> > > >>> >> >Thanks, > >>> >> > > >>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki > >>> >>wrote: > >>> >> > > >>> >> >> Hi, > >>> >> >> > >>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that > >>> >>includes > >>> >> >>the > >>> >> >> fix for broken ODF parsing... > >>> >> >> > >>> >> >> --- > >>> >> >> Best regards, > >>> >> >> > >>> >> >> Andrzej Bialecki > >>> >> >> > >>> >> >> > >>> >> > > >>> >> > > >>> >> >-- > >>> >> >-- > >>> >> >Hong-Thai > >>> >> > >>> >> > >>> > >>> > >> > >
Re: 1.7 release?
Thanks Oleg, will try tomorrow for me Los angeles time! ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Oleg Tikhonov Reply-To: "dev@tika.apache.org" Date: Monday, October 20, 2014 at 11:20 PM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >Please take a try with newest patch. >Cheers, >Oleg > >On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov >wrote: > >> Taken. Thanks. in progress ... >> >> On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < >> chris.a.mattm...@jpl.nasa.gov> wrote: >> >>> Trunk is the current checkout/branch: >>> >>> http://svn.apache.org/repos/asf/tika/trunk >>> >>> >>> ++ >>> Chris Mattmann, Ph.D. >>> Chief Architect >>> Instrument Software and Science Data Systems Section (398) >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>> Office: 168-519, Mailstop: 168-527 >>> Email: chris.a.mattm...@nasa.gov >>> WWW: http://sunset.usc.edu/~mattmann/ >>> ++ >>> Adjunct Associate Professor, Computer Science Department >>> University of Southern California, Los Angeles, CA 90089 USA >>> ++++++++++ >>> >>> >>> >>> >>> >>> >>> -Original Message- >>> From: Oleg Tikhonov >>> Reply-To: "dev@tika.apache.org" >>> Date: Monday, October 20, 2014 at 10:16 PM >>> To: "dev@tika.apache.org" >>> Subject: Re: 1.7 release? >>> >>> >Hi, I can try this on. >>> >What is a trunk? >>> > >>> > >>> >Thanks, >>> >Oleg >>> > >>> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < >>> >chris.a.mattm...@jpl.nasa.gov> wrote: >>> > >>> >> Hmm any idea why this is failing on Windows? Tyler P. and >>> >> I were talking the other day - maybe we shouldn't run the >>> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? >>> >> >>> >> ++ >>> >> Chris Mattmann, Ph.D. >>> >> Chief Architect >>> >> Instrument Software and Science Data Systems Section (398) >>> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>> >> Office: 168-519, Mailstop: 168-527 >>> >> Email: chris.a.mattm...@nasa.gov >>> >> WWW: http://sunset.usc.edu/~mattmann/ >>> >> ++ >>> >> Adjunct Associate Professor, Computer Science Department >>> >> University of Southern California, Los Angeles, CA 90089 USA >>> >> ++ >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> -Original Message- >>> >> From: Hong-Thai Nguyen >>> >> Reply-To: "dev@tika.apache.org" >>> >> Date: Thursday, October 16, 2014 at 2:03 AM >>> >> To: "dev@tika.apache.org" >>> >> Subject: Re: 1.7 release? >>> >> >>> >> >Hi Andrzej, >>> >> > >>> >> >We are impatient for 1.7 release too. >>> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can >>>build >>> >> >successfully on Windows, I have no objection to release 1.7 >>> >> > >>> >> >Thanks, >>> >> > >>> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki >>> >>wrote: >>> >> > >>> >> >> Hi, >>> >> >> >>> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that >>> >>includes >>> >> >>the >>> >> >> fix for broken ODF parsing... >>> >> >> >>> >> >> --- >>> >> >> Best regards, >>> >> >> >>> >> >> Andrzej Bialecki >>> >> >> >>> >> >> >>> >> > >>> >> > >>> >> >-- >>> >> >-- >>> >> >Hong-Thai >>> >> >>> >> >>> >>> >>
Re: 1.7 release?
Please take a try with newest patch. Cheers, Oleg On Tue, Oct 21, 2014 at 9:08 AM, Oleg Tikhonov wrote: > Taken. Thanks. in progress ... > > On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > >> Trunk is the current checkout/branch: >> >> http://svn.apache.org/repos/asf/tika/trunk >> >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: Oleg Tikhonov >> Reply-To: "dev@tika.apache.org" >> Date: Monday, October 20, 2014 at 10:16 PM >> To: "dev@tika.apache.org" >> Subject: Re: 1.7 release? >> >> >Hi, I can try this on. >> >What is a trunk? >> > >> > >> >Thanks, >> >Oleg >> > >> >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < >> >chris.a.mattm...@jpl.nasa.gov> wrote: >> > >> >> Hmm any idea why this is failing on Windows? Tyler P. and >> >> I were talking the other day - maybe we shouldn't run the >> >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? >> >> >> >> ++ >> >> Chris Mattmann, Ph.D. >> >> Chief Architect >> >> Instrument Software and Science Data Systems Section (398) >> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >> Office: 168-519, Mailstop: 168-527 >> >> Email: chris.a.mattm...@nasa.gov >> >> WWW: http://sunset.usc.edu/~mattmann/ >> >> ++++++++++ >> >> Adjunct Associate Professor, Computer Science Department >> >> University of Southern California, Los Angeles, CA 90089 USA >> >> ++ >> >> >> >> >> >> >> >> >> >> >> >> >> >> -----Original Message- >> >> From: Hong-Thai Nguyen >> >> Reply-To: "dev@tika.apache.org" >> >> Date: Thursday, October 16, 2014 at 2:03 AM >> >> To: "dev@tika.apache.org" >> >> Subject: Re: 1.7 release? >> >> >> >> >Hi Andrzej, >> >> > >> >> >We are impatient for 1.7 release too. >> >> >I'm having compiling problem of TIKA-1422 on me. If anyone can build >> >> >successfully on Windows, I have no objection to release 1.7 >> >> > >> >> >Thanks, >> >> > >> >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki >> >>wrote: >> >> > >> >> >> Hi, >> >> >> >> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that >> >>includes >> >> >>the >> >> >> fix for broken ODF parsing... >> >> >> >> >> >> --- >> >> >> Best regards, >> >> >> >> >> >> Andrzej Bialecki >> >> >> >> >> >> >> >> > >> >> > >> >> >-- >> >> >-- >> >> >Hong-Thai >> >> >> >> >> >> >
Re: 1.7 release?
Taken. Thanks. in progress ... On Tue, Oct 21, 2014 at 8:54 AM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Trunk is the current checkout/branch: > > http://svn.apache.org/repos/asf/tika/trunk > > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Oleg Tikhonov > Reply-To: "dev@tika.apache.org" > Date: Monday, October 20, 2014 at 10:16 PM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >Hi, I can try this on. > >What is a trunk? > > > > > >Thanks, > >Oleg > > > >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < > >chris.a.mattm...@jpl.nasa.gov> wrote: > > > >> Hmm any idea why this is failing on Windows? Tyler P. and > >> I were talking the other day - maybe we shouldn't run the > >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? > >> > >> ++ > >> Chris Mattmann, Ph.D. > >> Chief Architect > >> Instrument Software and Science Data Systems Section (398) > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >> Office: 168-519, Mailstop: 168-527 > >> Email: chris.a.mattm...@nasa.gov > >> WWW: http://sunset.usc.edu/~mattmann/ > >> ++ > >> Adjunct Associate Professor, Computer Science Department > >> University of Southern California, Los Angeles, CA 90089 USA > >> ++++++++++ > >> > >> > >> > >> > >> > >> > >> -Original Message- > >> From: Hong-Thai Nguyen > >> Reply-To: "dev@tika.apache.org" > >> Date: Thursday, October 16, 2014 at 2:03 AM > >> To: "dev@tika.apache.org" > >> Subject: Re: 1.7 release? > >> > >> >Hi Andrzej, > >> > > >> >We are impatient for 1.7 release too. > >> >I'm having compiling problem of TIKA-1422 on me. If anyone can build > >> >successfully on Windows, I have no objection to release 1.7 > >> > > >> >Thanks, > >> > > >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki > >>wrote: > >> > > >> >> Hi, > >> >> > >> >> Any news on the 1.7 release? or at least a 1.6.1 release that > >>includes > >> >>the > >> >> fix for broken ODF parsing... > >> >> > >> >> --- > >> >> Best regards, > >> >> > >> >> Andrzej Bialecki > >> >> > >> >> > >> > > >> > > >> >-- > >> >-- > >> >Hong-Thai > >> > >> > >
Re: 1.7 release?
Trunk is the current checkout/branch: http://svn.apache.org/repos/asf/tika/trunk ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Oleg Tikhonov Reply-To: "dev@tika.apache.org" Date: Monday, October 20, 2014 at 10:16 PM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >Hi, I can try this on. >What is a trunk? > > >Thanks, >Oleg > >On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < >chris.a.mattm...@jpl.nasa.gov> wrote: > >> Hmm any idea why this is failing on Windows? Tyler P. and >> I were talking the other day - maybe we shouldn't run the >> tests from TIKA-1422 unless Tesseract is installed? Thoughts? >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> ++ >> >> >> >> >> >> >> -Original Message- >> From: Hong-Thai Nguyen >> Reply-To: "dev@tika.apache.org" >> Date: Thursday, October 16, 2014 at 2:03 AM >> To: "dev@tika.apache.org" >> Subject: Re: 1.7 release? >> >> >Hi Andrzej, >> > >> >We are impatient for 1.7 release too. >> >I'm having compiling problem of TIKA-1422 on me. If anyone can build >> >successfully on Windows, I have no objection to release 1.7 >> > >> >Thanks, >> > >> >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki >>wrote: >> > >> >> Hi, >> >> >> >> Any news on the 1.7 release? or at least a 1.6.1 release that >>includes >> >>the >> >> fix for broken ODF parsing... >> >> >> >> --- >> >> Best regards, >> >> >> >> Andrzej Bialecki >> >> >> >> >> > >> > >> >-- >> >-- >> >Hong-Thai >> >>
Re: 1.7 release?
Hi, I can try this on. What is a trunk? Thanks, Oleg On Tue, Oct 21, 2014 at 6:21 AM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Hmm any idea why this is failing on Windows? Tyler P. and > I were talking the other day - maybe we shouldn't run the > tests from TIKA-1422 unless Tesseract is installed? Thoughts? > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++ > > > > > > > -Original Message- > From: Hong-Thai Nguyen > Reply-To: "dev@tika.apache.org" > Date: Thursday, October 16, 2014 at 2:03 AM > To: "dev@tika.apache.org" > Subject: Re: 1.7 release? > > >Hi Andrzej, > > > >We are impatient for 1.7 release too. > >I'm having compiling problem of TIKA-1422 on me. If anyone can build > >successfully on Windows, I have no objection to release 1.7 > > > >Thanks, > > > >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki wrote: > > > >> Hi, > >> > >> Any news on the 1.7 release? or at least a 1.6.1 release that includes > >>the > >> fix for broken ODF parsing... > >> > >> --- > >> Best regards, > >> > >> Andrzej Bialecki > >> > >> > > > > > >-- > >-- > >Hong-Thai > >
Re: 1.7 release?
Hmm any idea why this is failing on Windows? Tyler P. and I were talking the other day - maybe we shouldn't run the tests from TIKA-1422 unless Tesseract is installed? Thoughts? ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Hong-Thai Nguyen Reply-To: "dev@tika.apache.org" Date: Thursday, October 16, 2014 at 2:03 AM To: "dev@tika.apache.org" Subject: Re: 1.7 release? >Hi Andrzej, > >We are impatient for 1.7 release too. >I'm having compiling problem of TIKA-1422 on me. If anyone can build >successfully on Windows, I have no objection to release 1.7 > >Thanks, > >On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki wrote: > >> Hi, >> >> Any news on the 1.7 release? or at least a 1.6.1 release that includes >>the >> fix for broken ODF parsing... >> >> --- >> Best regards, >> >> Andrzej Bialecki >> >> > > >-- >-- >Hong-Thai
Re: 1.7 release?
Hi Andrzej, We are impatient for 1.7 release too. I'm having compiling problem of TIKA-1422 on me. If anyone can build successfully on Windows, I have no objection to release 1.7 Thanks, On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki wrote: > Hi, > > Any news on the 1.7 release? or at least a 1.6.1 release that includes the > fix for broken ODF parsing… > > --- > Best regards, > > Andrzej Bialecki > > -- -- Hong-Thai
1.7 release?
Hi, Any news on the 1.7 release? or at least a 1.6.1 release that includes the fix for broken ODF parsing… --- Best regards, Andrzej Bialecki