Go to http://www.textmining.org for a platform independent library to extract text from Word documents. I wrote 99.99% of the Word component of POI and all of the textmining.org library.
I have seen several discussions and web pages that point to textmining.org that say I simply wrap POI classes (For example, the JGuru GAQ http://www.jguru.com ) This is totally false. * The textmining.org library is optimized for extracting text. POI is not. * The textmining.org libraries supports extracting text from Word 6/95. POI does not. * The textmining.org libraries do not extract deleted text that is still in the document for the purposes of revision marking. POI does not handle this. -Ryan Ackley ----- Original Message ----- From: "Chandan Tamrakar" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Tuesday, August 24, 2004 7:31 AM Subject: Re: worddoucments search > please look at Apache POI project. > http://jakarta.apache.org > > Words documents can be extracted using POI apis and later can be indexed. > > regards > > ----- Original Message ----- > From: "Santosh" <[EMAIL PROTECTED]> > To: "Lucene Users List" <[EMAIL PROTECTED]> > Sent: Tuesday, August 24, 2004 6:00 PM > Subject: worddoucments search > > > Can lucene be able to search word documents? if so please give me > information about it > > regards > Santosh kumar > > > -----------------------SOFTPRO DISCLAIMER------------------------------ > > Information contained in this E-MAIL and any attachments are > confidential being proprietary to SOFTPRO SYSTEMS is 'privileged' > and 'confidential'. > > If you are not an intended or authorised recipient of this E-MAIL or > have received it in error, You are notified that any use, copying or > dissemination of the information contained in this E-MAIL in any > manner whatsoever is strictly prohibited. Please delete it immediately > and notify the sender by E-MAIL. > > In such a case reading, reproducing, printing or further dissemination > of this E-MAIL is strictly prohibited and may be unlawful. > > SOFTPRO SYSYTEMS does not REPRESENT or WARRANT that an attachment > hereto is free from computer viruses or other defects. > > The opinions expressed in this E-MAIL and any ATTACHEMENTS may be > those of the author and are not necessarily those of SOFTPRO SYSTEMS. > ------------------------------------------------------------------------ > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]