[ 
https://issues.apache.org/jira/browse/LUCENE-9510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17196879#comment-17196879
 ] 

ASF subversion and git services commented on LUCENE-9510:
---------------------------------------------------------

Commit 93094ef7e4470dd9f0ade3a3d8403548729a4609 in lucene-solr's branch 
refs/heads/master from Adrien Grand
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=93094ef ]

LUCENE-9510: Don't compress temporary stored fields and term vectors when index 
sorting is enabled. (#1874)

When index sorting is enabled, stored fields and term vectors can't be
written on the fly like in the normal case, so they are written into
temporary files that then get resorted. For these temporary files,
disabling compression speeds up indexing significantly.

On a synthetic test that indexes stored fields and a doc value field
populated with random values that is used for index sorting, this
resulted in a 3x indexing speedup.

> SortingStoredFieldsConsumer should use a format that has better random-access
> -----------------------------------------------------------------------------
>
>                 Key: LUCENE-9510
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9510
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> We noticed some indexing rate regressions in Elasticsearch after upgrading to 
> a new Lucene snapshot. This is due to the fact that 
> SortingStoredFieldsConsumer is using the default codec to write stored fields 
> on flush. Compression doesn't matter much for this case since these are 
> temporary files that get removed on flush after the segment is sorted anyway 
> so we could switch to a format that has faster random access.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to