[ 
https://issues.apache.org/jira/browse/LUCENE-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karl Wettin updated LUCENE-550:
-------------------------------

    Attachment: LUCENE-550_20070928_no_core_changes.txt

New in this patch:

 * Payloads added to TestIndicesEquals
 * Package level java docs with UMLet class diagram
 * Some additional todo-tags in the code that shows what can be improved

I've noticed that there are some differences in the behavior of IndexWriter and 
InstantiatedIndexWriter when a document containing multiple fields with the 
same name but different settings, such as:

{code}
 d.add(new Field("f", " All work and no play makes Jack a dull boy", 
Field.Store.YES, Field.Index.TOKENIZED, 
Field.TermVector.WITH_POSITIONS_OFFSETS));
 d.add(new Field("f", " All work and no play makes Jack a dull boy", 
Field.Store.NO));
 d.add(new Field("f", " All work and no play makes Jack a dull boy", 
Field.Store.YES, Field.Index.NO_NORMS, Field.TermVector.NO));
{code}

Would this be considered an invalid document? Should there be a term vector or 
not? Or perhaps just term vector for the tokens in the first field?

> InstantiatedIndex - faster but memory consuming index
> -----------------------------------------------------
>
>                 Key: LUCENE-550
>                 URL: https://issues.apache.org/jira/browse/LUCENE-550
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Store
>    Affects Versions: 2.0.0
>            Reporter: Karl Wettin
>            Assignee: Grant Ingersoll
>         Attachments: HitCollectionBench.jpg, lucene-550.jpg, 
> LUCENE-550_20070804_no_core_changes.txt, 
> LUCENE-550_20070808_no_core_changes.txt, 
> LUCENE-550_20070817_no_core_changes.txt, 
> LUCENE-550_20070928_no_core_changes.txt, test-reports.zip, trunk.diff.bz2, 
> trunk.diff.bz2, trunk.diff.bz2, trunk.diff.bz2, trunk.diff.bz2, 
> trunk.diff.bz2, trunk.diff.bz2, trunk.diff.bz2, trunk.diff.bz2, 
> trunk.diff.bz2, trunk.diff.bz2
>
>
> An non file centrinc all in memory index. Consumes some 2x the memory of a 
> RAMDirectory (in a term satured index) but is between 3x-60x faster depending 
> on application and how one counts. Average query is about 8x faster. 
> IndexWriter and IndexModifier have been realized in InterfaceIndexWriter and 
> InterfaceIndexModifier. 
> InstantiatedIndex is wrapped in a new top layer index facade (class Index) 
> that comes with factory methods for writers, readers and searchers for unison 
> index handeling. There are decorators with notification handling that can be 
> used for automatically syncronizing searchers on updates, et.c. 
> Index also comes with FS/RAMDirectory implementation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to