Hi Zabrane, I think that -T only outputs the text within the body of the XHTML. Not positive on that though...
Cheers, Chris On 11/10/10 9:09 AM, "zabrane Mikael" <[email protected]> wrote: Hi Chris, Congratulations for Tika 0.8. Could someone explains me please what's the aim of the new option "-T": $ java -jar tika-app-0.8.jar --help usage: tika [option] [file] Options: -? or --help Print this usage message -v or --verbose Print debug level messages -x or --xml Output XHTML content (default) -h or --html Output HTML content -t or --text Output plain text content -T or --text-main Output plain text content (main content only) ... Is there any real difference between "-t" and "-T" options? -- Regards Zabrane 2010/11/9 Mattmann, Chris A (388J) <[email protected]>: > Hi Folks, > > I have posted a candidate for the Apache Tika 0.8 release. The source code > is at: > > http://people.apache.org/~mattmann/apache-tika-0.8/rc1/ > > See the included CHANGES.txt file for details on release contents and latest > changes. The release was made using the Maven2 release plugin, according to > Jukka Zitting's notes: > > http://tinyurl.com/yz2cqls > > This plugin creates a Tika 0.8 tag at: > > http://svn.apache.org/repos/asf/tika/tags/0.8/ > > And a staged M2 repository at repository.apache.org, would normally be > available, but I accidentally created the real thing at Central. Assuming > this VOTE passes, we'll just leave it there. You can check out the staging > repo for 0.8 here: > > https://repository.apache.org/content/groups/staging/org/apache/tika/ > > Please vote on releasing these packages as Apache Tika 0.8. The vote is open > for the next 72 hours. Only votes from Tika PMC are binding, but everyone > is welcome to check the release candidate and voice their approval or > disapproval. The vote passes if at least three binding +1 votes are cast. > > [ ] +1 Release the packages as Apache Tika 0.8. > > [ ] -1 Do not release the packages because... > > Thanks! > > Cheers, > Chris > > P.S. Here's my +1. > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Senior Computer Scientist > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 171-266B, Mailstop: 171-246 > Email: [email protected] > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Adjunct Assistant Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
