Hierachical documents is a key concept towads a unified
structured+unstructured search. It should allow us to fully implement
things such as XQuery + Full-Text
(http://www.w3.org/TR/xquery-full-text/)

Additionally it solves a century old problem: how to deal with
section/sub-sections in very large documents. Long time ago I was
indexing text books (in PDF) and had to break down the book into pages
and store the main doc id in a field as pointer to maintain the
relation.

Mark, way to go!

-- Joaquin

On Mon, May 10, 2010 at 8:03 AM, Grant Ingersoll <gsing...@apache.org> wrote:
> Very cool stuff, Mark.
>
> Can you just open a JIRA and attach there?
>
> On May 10, 2010, at 8:38 AM, mark harwood wrote:
>
>> I've put up code, example data and tests for the Nested Document feature 
>> here: http://www.inperspective.com/lucene/LuceneNestedDocumentSupport.zip
>>
>> The data used in the unit tests is chosen to illustrate practical use of 
>> real-world content.
>> The final unit tests will work on more abstract data for more 
>> formal/exhaustive testing of functionality.
>>
>> This packaging changes no existing Lucene code and is bundled with 3.0.1 but 
>> should work with 2.9.1. The readme.txt highlights the issues with segment 
>> flushing that may need addressing before adoption.
>>
>>
>> Cheers
>> Mark
>>
>>
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
>> For additional commands, e-mail: dev-h...@lucene.apache.org
>>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to