AW: Revisioned Lucene Index

2012-07-28 Thread Lutz Fechner
Hi, Lucene is storing it's data pretty much flat. You have Documents representing a seach result. This Documents are created during the indexing process you have to implement. For XML data I would recommend to store the Xpath of the indexed data in a field in the lucene documents in order to ge

Revisioned Lucene Index

2012-07-28 Thread Johannes.Lichtenberger
Hello, I'm currently working on revisioned index structures for a treebased storage system[1] and I want to provide an index-structure for fulltext search (more or less on XML text-nodes). Either I'm going to implement a Radix/PATRICIA-tree or I'm opting for Lucene. I thought about adding a n

Re: Reindexing after database change

2012-07-28 Thread Jack Krupansky
Lucene itself doesn't "pull" data in. Rather, you need to "push" data into it. With Solr you can push data in from the command line using standard tools such as curl or Java programs or any number of scripting tools or external "push" connector systems such as Nutch or ManifoldCF, but with Luce

Reindexing after database change

2012-07-28 Thread Rodrigo P. Bregalanti
Hello, I am working on a Data warehouse project and importing a huge mass of data directly to the database (MysQL). After conclude the project and roll out to production, I have discovered that one of the websites, which has to access this database, is using Lucene. Result: the data I have inser