Hi Tim, Taking a fast look at Nick's fix on TIKA-2419 seems conservative to me, restricted to corrupted xml, so I think there is no need to rerun the regression tests.
So +1 from me, ++1 with age detection :) 2017-07-05 22:35 GMT-03:00 Allison, Timothy B. <talli...@mitre.org>: > All, > I'm waiting to get some resolution on TIKA-2399. The regression tests > came back with nothing surprising. I fixed the npe that they uncovered in > the new ppt macro extraction code. > Will I need to rerun with the updates to mime detection that Nick just > made? Or are we good enough to go once we figure out what we can do w > TIKA-2399? > > Onward. > > Cheers, > Tim > > -----Original Message----- > From: Allison, Timothy B. [mailto:talli...@mitre.org] > Sent: Monday, July 3, 2017 2:35 PM > To: dev@tika.apache.org > Subject: RE: Tika 1.15.1? -> 1.16 > > Sounds good. I'll kick off regression tests now, with a goal of creating > 1.16-rc1 on Wednesday 14:00 UTC? > > -----Original Message----- > From: Mattmann, Chris A (3010) [mailto:chris.a.mattm...@jpl.nasa.gov] > Sent: Monday, July 3, 2017 2:24 PM > To: dev@tika.apache.org > Subject: Re: Tika 1.15.1? -> 1.16 > > Hey Tim, if I don’t get it done by today, push 1.16 and we’ll put Age > Detection in 1.17. > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Principal Data Scientist, Engineering Administrative Office (3010) > Manager, NSF & Open Source Projects Formulation and Development Offices > (8212) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 180-503E, Mailstop: 180-503 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Director, Information Retrieval and Data Science Group (IRDS) Adjunct > Associate Professor, Computer Science Department University of Southern > California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > On 7/3/17, 7:17 AM, "Allison, Timothy B." <talli...@mitre.org> wrote: > > All, > I think we're now solidly at 1.16. Anyone still strongly in favor > of 1.15.1? > > Chris, > Will age detection be ready soon, or should we push that to 1.17? > > -----Original Message----- > From: Allison, Timothy B. [mailto:talli...@mitre.org] > Sent: Friday, June 30, 2017 7:01 AM > To: dev@tika.apache.org; lfcnas...@gmail.com > Subject: RE: Tika 1.15.1? -> 1.16 > > Y, I was thinking that I may have already pushed us over this > threshold with the * below. 1.16 it is then? > > Chris, let us know when the age detection is good to go or if 1.17 is > a better target. > > > * Allow extraction of scripts as embedded "MACRO". Users > must turn this on via TikaConfig (TIKA-2391). > > * Allow users to turn off extraction of headers and footers > from .doc, .docx, .xls, .xlsx, .xlsb (TIKA-2362) > > * Extract text from charts in .docx, .pptx, .xlsx and .xlsb > (TIKA-2254). > > * Extract text from diagrams in .docx, .pptx, .xlsx and .xlsb > (TIKA-1945). > > * Enable base32 encoding of digests and enable BouncyCastle > implementations > of digest algorithms (TIKA-2386). > > -----Original Message----- > From: Luís Filipe Nassif [mailto:lfcnas...@gmail.com] > Sent: Thursday, June 29, 2017 4:12 PM > To: dev@tika.apache.org > Subject: Re: Tika 1.15.1? > > Agreed. > > Luis > > > 2017-06-29 15:45 GMT-03:00 Bob Paulin <b...@bobpaulin.com>: > > > If we're adding features does it make sense just to bump to 1.16 > > rather than 1.15.1? Traditionally point releases would be bug fixes > only [1]. > > > > > > - Bob > > > > [1] http://semver.org/ > > On 6/29/2017 1:18 PM, Allison, Timothy B. wrote: > > > K. > > > > > > -----Original Message----- > > > From: Mattmann, Chris A (3010) > > > [mailto:chris.a.mattm...@jpl.nasa.gov] > > > Sent: Thursday, June 29, 2017 1:59 PM > > > To: dev@tika.apache.org > > > Subject: Re: Tika 1.15.1? > > > > > > Hey Tim, I’d like to try and get in: > > > > > > https://issues.apache.org/jira/browse/TIKA-1988 > > > > > > today for 15.1. I am working on integrating it now and adding some > > > docs > > to the wiki. > > > > > > I’ll keep you posted. > > > > > > Cheers, > > > Chris > > > > > > > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > Chris Mattmann, Ph.D. > > > Principal Data Scientist, Engineering Administrative Office (3010) > > Manager, NSF & Open Source Projects Formulation and Development > > Offices > > (8212) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 180-503E, Mailstop: 180-503 > > > Email: chris.a.mattm...@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > Director, Information Retrieval and Data Science Group (IRDS) > > > Adjunct > > Associate Professor, Computer Science Department University of > > Southern California, Los Angeles, CA 90089 USA > > > WWW: http://irds.usc.edu/ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > > > > > > > On 6/28/17, 12:24 PM, "Allison, Timothy B." <talli...@mitre.org> > wrote: > > > > > > POI is available on maven, and I just upgraded. > > > > > > Unless there are objections, I'll change our > > > > > > org.apache.tika.parser.sentiment.analysis.SentimentParser > > > > > > to > > > > > > > > > org.apache.tika.parser.sentiment.analysis.SentimentAnalysisParser > > > > > > and we should be good to go for 1.15.1? > > > > > > Let me know if you'd like to hold off for a bit, but there's > > > always > > 1.15.2. :) > > > > > > Cheers, > > > > > > Tim > > > > > > -----Original Message----- > > > From: Mattmann, Chris A (3010) > > > [mailto:chris.a.mattm...@jpl.nasa.gov > > ] > > > Sent: Friday, June 23, 2017 3:39 PM > > > To: dev@tika.apache.org > > > Subject: Re: Tika 1.15.1? > > > > > > Let me get back to you I’d like to see if we can get some > > > progress > > on the Age Detector Parser > > > > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > Chris Mattmann, Ph.D. > > > Principal Data Scientist, Engineering Administrative Office > > > (3010) > > Manager, NSF & Open Source Projects Formulation and Development > > Offices > > (8212) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > > > Office: 180-503E, Mailstop: 180-503 > > > Email: chris.a.mattm...@nasa.gov > > > WWW: http://sunset.usc.edu/~mattmann/ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > Director, Information Retrieval and Data Science Group (IRDS) > > Adjunct Associate Professor, Computer Science Department University > of > > Southern California, Los Angeles, CA 90089 USA > > > WWW: http://irds.usc.edu/ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > ++++++++++++++ > > > > > > > > > On 6/23/17, 10:01 AM, "Allison, Timothy B." < > talli...@mitre.org> > > wrote: > > > > > > All, > > > With the exception of the SentimentParser (which we have > a > > path forward on), I think we're good to go. It looks like POI is > > about to kick off the release process for 3.17-beta1, and the batch > > results look good. I propose waiting a week or so to incorporate > that. > > > Anything else we need to get in for 1.15.1? > > > > > > Cheers, > > > > > > Tim > > > > > > -----Original Message----- > > > From: Chris Mattmann [mailto:mattm...@apache.org] > > > Sent: Friday, June 16, 2017 2:43 PM > > > To: dev@tika.apache.org > > > Subject: Re: Tika 1.15.1? > > > > > > Yep agreed on both Tim. If I don’t get it done this > weekend, > > we’ll apply the approach you mention below. > > > > > > Great seeing you yesterday! > > > > > > > > > > > > > > > On 6/16/17, 11:40 AM, "Allison, Timothy B." > > > <talli...@mitre.org> > > wrote: > > > > > > All, > > > > > > I'm hoping to wrap up the TEIParser next week (I'm > > > thinking > > about modifying code to handle DOM)...and this should rid us of > > org.json licensing issues. Run a release for 1.15.1 probably the > following week? > > > > > > Anything else we want to get in to 1.15.1? > > > > > > Chris, I'm not sure where you are on the > SentimentParser. > > If there will be a quick fix, great; otherwise, we should be ok with > > the added exclusions (TIKA-2397) and if we rename the class in Tika > so > > that we don't have a conflict over oat.parsers.SentimentParser > (TIKA-2368). > > > > > > Cheers, > > > > > > Tim > > > > > > -----Original Message----- > > > From: Tyler Bui-Palsulich [mailto: > tbpalsul...@gmail.com] > > > Sent: Friday, June 2, 2017 8:39 PM > > > To: dev@tika.apache.org > > > Subject: Re: Tika 1.16? > > > > > > +1 to 1.15.1. > > > > > > It would also be nice to be able to have "cheap" > > > security > > releases as they come up. > > > > > > Tyler > > > > > > On Jun 2, 2017 6:12 AM, "Bob Paulin" < > b...@bobpaulin.com> > > wrote: > > > > > > > Would be breaking a bit from the current release > > > numbering > > but I'd > > > > fully support moving to semantic versioning. +1 to a > > 1.15.1 > > > > > > > > - Bob > > > > > > > > > > > > On 6/2/2017 8:06 AM, Luís Filipe Nassif wrote: > > > > > Maybe 1.15.1? > > > > > > > > > > Em 1 de jun de 2017 10:03 AM, "Bob Paulin" < > > b...@bobpaulin.com> escreveu: > > > > > > > > > >> +1 > > > > >> > > > > >> > > > > >> On 6/1/2017 6:50 AM, Allison, Timothy B. wrote: > > > > >>> Given the broken OSGi and the org.json issues > with > > 1.15, does it > > > > >>> make > > > > >> sense to aim for 1.16 fairly soon, say 3-4 weeks? > > > > >>> Cheers, > > > > >>> > > > > >>> Tim > > > > >>> > > > > >>> > > > > >> > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >