Re: What is flexible indexing in Lucene 4.0 if it's not the ability to make new postings codecs?

2012-11-30 Thread Johannes.Lichtenberger
On 11/28/2012 01:11 AM, Michael McCandless wrote: Flexible indexing is the ability to make your own codec, which controls the reading and writing of all index parts (postings, stored fields, term vectors, deleted docs, etc.). So for example if you want to store some postings as a bit set

native, versioned XML-DBMS (that is full text search in versioned document collections)

2012-11-27 Thread Johannes.Lichtenberger
Hello, as posted some time ago I'm working on a native, versioned XML-DBMS [1]. I'd like to provide a full text index and I recently read about customized Codecs which can be plugged in. Usually data (for instance XML nodes) are stored on RecordPages. I'm still not sure if it is possible and

Re: Indexing and searching across versioned document collections

2012-11-09 Thread Johannes.Lichtenberger
On 11/09/2012 09:41 AM, jake dsouza wrote: Hello, Has any one worked on making Lucene index and search versioned document collections i.e any corpus with multiple versions of documents similar to wikipedia or source code. I am working on a project to index and search versioned collections while

Revisioned Lucene Index

2012-07-28 Thread Johannes.Lichtenberger
Hello, I'm currently working on revisioned index structures for a treebased storage system[1] and I want to provide an index-structure for fulltext search (more or less on XML text-nodes). Either I'm going to implement a Radix/PATRICIA-tree or I'm opting for Lucene. I thought about adding a