Re: Does Lucene Java 2.3.2 supports parsing of Microsoft office 2007 documents...

2008-06-28 Thread Nick Burch
On Fri, 27 Jun 2008, Hasan Diwan wrote: The new ODF-compatible Office 2007 is not supported by POI. Actually, it is, just not the version in trunk. You can download nightly builds of the ooxml branch from http://encore.torchbox.com/poi-svn-build/OOXML-Branch/ And there ought to be a

Re: Does Lucene Java 2.3.2 supports parsing of Microsoft office 2007 documents...

2008-06-27 Thread Hasan Diwan
Kumar: Assuming you want to index a pre-parsed document... 2008/6/27 Erick Erickson <[EMAIL PROTECTED]>: >> If it supports, what should be done in Lucene demo 2.3.2 to search queries >> on file with above mentioned extensions? The new ODF-compatible Office 2007 is not supported by POI. However, yo

Re: Does Lucene Java 2.3.2 supports parsing of Microsoft office 2007 documents...

2008-06-27 Thread Erick Erickson
Lucene doesn't actually support any of the document types. What happens is that some program is used to parse the files into an indexable stream and that stream is indexed. That used to be POI in the old days. I confess I haven't used the latest demo, but I assume that under the covers there's som

Does Lucene Java 2.3.2 supports parsing of Microsoft office 2007 documents...

2008-06-27 Thread Kumar Gaurav
Dear all, Currently I am using Lucene jave 2.3.2 demo to parse Microsoft 2003 and 2007 docs and PDF files. It is able to parse files with *.pdf, *.doc, *.xls etc. But it does not search in files of Microsoft 2007 docs. It shows indexing *.docx and other Microsoft 2007 doc files. Does Lu