RE: Looking to Index Various Document Types.

2008-03-12 Thread Steven A Rowe
'sup, DD: You should have posted your question, which is about *using* Lucene, to the java-user mailing list; the java-dev mailing list is instead intended for discussion of *development of* Lucene. Here's a Lius tutorial, in both French and English: http://www.doculibre.com/lius/ And here's

Looking to Index Various Document Types.

2008-03-12 Thread DURGA DEEP
HI Folks, I was looking at the Lucene FAQ and I found this very interesting. How can I index OpenOffice.org files? These files (.sxw, .sxc, etc) are ZIP archives that contain XML files. Uncompress the file using Java's ZIP support, then parse meta.xml to get title etc. and content.xml to get the