[ https://issues.apache.org/jira/browse/BEAM-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16417017#comment-16417017 ]
Tim Robertson edited comment on BEAM-3820 at 3/28/18 9:57 AM: -------------------------------------------------------------- Thanks [~echauchot] - understood and I would propose documenting it as such. I'll pick this up after the [BEAM-3848 pull request|https://github.com/apache/beam/pull/4905] completes as I believe it should be considered for inclusion (unless it is decided not to do it). My own setup currently uses batch sizes of {{1000 x #SOLR_SERVERS}} which yielded 26% increase in total throughout on production, and {{100}} on dev cluster which improved stability. was (Author: timrobertson100): Thanks [~echauchot] - understod and I would propose documenting it as such. I'll pick this up after the [BEAM-3848 pull request|https://github.com/apache/beam/pull/4905] completes as I believe it should be considered for inclusion. My own setup currently uses batch sizes of {{1000 x #SOLR_SERVERS}} which yielded 26% increase in total throughout on production, and {{100}} on dev cluster which improved stability. > SolrIO: Allow changing batchSize for writes > ------------------------------------------- > > Key: BEAM-3820 > URL: https://issues.apache.org/jira/browse/BEAM-3820 > Project: Beam > Issue Type: Improvement > Components: io-java-solr > Affects Versions: 2.2.0, 2.3.0 > Reporter: Tim Robertson > Assignee: Tim Robertson > Priority: Trivial > > The SolrIO hard codes the batchSize for writes at 1000. It would be a good > addition to allow the user to set the batchSize explicitly (similar to the > ElasticsearchIO) -- This message was sent by Atlassian JIRA (v7.6.3#76005)