[ 
https://issues.apache.org/jira/browse/LUCENE-756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12462069
 ] 

Doron Cohen commented on LUCENE-756:
------------------------------------

I am updating the patch (nrm.patch.3.txt): 

- using a single constant for the norms file extension:
  static final String NORMS_EXTENSION = "nrm";
(This is more in line with existing extension constants in the code.)
(As a side comment, there are various extension names (e.g. ".cfs") in the code 
that are also candidate for factoring as a constant, but this is a separate 
issue.)

- adding a test - TestNorms
This test verifies that norm values assigned with field.setBoost() are 
preserved during the life cycle of an index, including adding documents, 
updating norms values (separate norms), addIndexes(), and optimize.

All tests pass.
On my side this is ready to go in.


> Maintain norms in a single file .nrm
> ------------------------------------
>
>                 Key: LUCENE-756
>                 URL: https://issues.apache.org/jira/browse/LUCENE-756
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>            Reporter: Doron Cohen
>         Assigned To: Doron Cohen
>            Priority: Minor
>         Attachments: nrm.patch.2.txt, nrm.patch.txt
>
>
> Non-compound indexes are ~10% faster at indexing, and perform 50% IO activity 
> comparing to compound indexes. But their file descriptors foot print is much 
> higher. 
> By maintaining all field norms in a single .nrm file, we can bound the number 
> of files used by non compound indexes, and possibly allow more applications 
> to use this format.
> More details on the motivation for this in: 
> http://www.nabble.com/potential-indexing-perormance-improvement-for-compound-index---cut-IO---have-more-files-though-tf2826909.html
>  (in particular 
> http://www.nabble.com/Re%3A-potential-indexing-perormance-improvement-for-compound-index---cut-IO---have-more-files-though-p7910403.html).

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to