Building Lucene index for XML document

2007-01-24 Thread maureen tanuwidjaja
Hi... I am a Final Year Undergrad.My Final year project is about search engine for XML Document..I am currently building this system using Lucene. The example of XML element from an XML document : -- This is my

Re: Building Lucene index for XML document

2007-01-24 Thread Daniel Noll
maureen tanuwidjaja wrote: Before implementing this search engine,I have designed to build the index in such a way that every XML tag is converted using binary value,in order to reduce the size index and perhaps for faster searching.To illustrate: article will be converted to 0 article/body

Re: Building Lucene index for XML document

2007-01-25 Thread maureen tanuwidjaja
Thanks a lot Daniel :) Regards, Maureen Daniel Noll <[EMAIL PROTECTED]> wrote: maureen tanuwidjaja wrote: > Before implementing this search engine,I have designed to build the > index in such a way that every XML tag is converted using binary > value,in order to reduce the size ind

Re: Building Lucene index for XML document

2007-01-25 Thread maureen tanuwidjaja
btw Daniel,can please give me the reference to find the explanation about SegmentTermEnum/Field Infos if such one exist? I search but best can only find http://lucene.apache.org/java/docs/clover/org/apache/lucene/index/SegmentTermEnum.html which is the source code only... Many thanks and B

Re: Building Lucene index for XML document

2007-01-25 Thread Doron Cohen
Hi Maureen, Some relevant info in the file formats doc - http://lucene.apache.org/java/docs/fileformats.html Regards, Doron maureen tanuwidjaja <[EMAIL PROTECTED]> wrote on 25/01/2007 01:31:25: > btw Daniel,can please give me the reference to find the explanation > about SegmentTermEnum/Field I

Re: Building Lucene index for XML document

2007-01-25 Thread maureen tanuwidjaja
Thanks Doron =) Regards, Maureen Doron Cohen <[EMAIL PROTECTED]> wrote: Hi Maureen, Some relevant info in the file formats doc - http://lucene.apache.org/java/docs/fileformats.html Regards, Doron maureen tanuwidjaja wrote on 25/01/2007 01:31:25: > btw Daniel,can please give me the ref