Rahul Goswami created SOLR-16838:
------------------------------------

             Summary: Atomic updates too slow in Solr 8 vs Solr 7
                 Key: SOLR-16838
                 URL: https://issues.apache.org/jira/browse/SOLR-16838
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: SearchComponents - other
    Affects Versions: 8.11.1
            Reporter: Rahul Goswami


Started experiencing slowness with updates in production after upgrading from 
Solr 7.7.2 to 8.11.1. Upon comparing the performance it turns out that indexing 
20 million docs via atomic updates through the same client program (running 15 
parallel threads indexing in batches of 1000) takes below time:
 
Solr 7 : 78 mins
Solr 8:  370 mins 
 
Environment details:
- Java 11 on Windows server
- Xms1536m Xmx3072m
- Indexing client code running 15 parallel threads indexing in batches of 1000
- using SimpleFSDirectoryFactory  (since Mmap doesn't  quite work well on 
Windows for our index sizes which commonly run north of 1 TB) 
 
Looking at the thread dump, the bottleneck seems to be RealTimeGet and I can 
see that Solr 7 takes a different code path than Solr 8. Note that the 
performance of regular updates (non-atomic) is still pretty good on Solr 8 
completing in < 1 hour for the same 20 million data set. 
 
Sharing the indexing code, solrconfig, schema and thread dumps in the link 
below:
[https://drive.google.com/drive/folders/1q2DPNTYQEU6fi3NeXIKJhaoq3KPnms0h?usp=sharing]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to