EBernhardson has uploaded a new change for review.

  https://gerrit.wikimedia.org/r/266556

Change subject: Add field-name and noop-within parameters to transfer_to_es 
workflow
......................................................................

Add field-name and noop-within parameters to transfer_to_es workflow

These values will be needed to update the correct properties. Also
changed default executors back to 3, it turns out the bump in
latencies wasn't caused by indexing and was only a momentary blip.
Letting the indexing run overnight showed no appreciable increase
in search latency.

Change-Id: I26195de6acedb020a00c3691d5672c5385738e7f
---
M oozie/transfer_to_es/bundle.properties
M oozie/transfer_to_es/workflow.xml
2 files changed, 5 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.wikimedia.org:29418/wikimedia/discovery/analytics 
refs/changes/56/266556/1

diff --git a/oozie/transfer_to_es/bundle.properties 
b/oozie/transfer_to_es/bundle.properties
index 8fe3dc9..8c9286e 100644
--- a/oozie/transfer_to_es/bundle.properties
+++ b/oozie/transfer_to_es/bundle.properties
@@ -58,7 +58,7 @@
 # when spark alocates this resource. This partially controls the amount
 # of concurrency between the hadoop cluster and the elasticsearch cluster
 # during export.
-spark_number_executors            = 1
+spark_number_executors            = 3
 spark_executor_memory             = 1G
 spark_driver_memory               = 1G
 
diff --git a/oozie/transfer_to_es/workflow.xml 
b/oozie/transfer_to_es/workflow.xml
index 43e967f..66ded00 100644
--- a/oozie/transfer_to_es/workflow.xml
+++ b/oozie/transfer_to_es/workflow.xml
@@ -132,6 +132,10 @@
             <arg>${transfer_to_es_batch_size}</arg>
             <arg>--hostmap</arg>
             <arg>${transfer_to_es_hostmap_file}</arg>
+                       <arg>--noop-within</arg>
+                       <arg>20</arg>
+                       <arg>--field-name</arg>
+                       <arg>popularity_score</arg>
         </spark>
 
         <ok to="end"/>

-- 
To view, visit https://gerrit.wikimedia.org/r/266556
To unsubscribe, visit https://gerrit.wikimedia.org/r/settings

Gerrit-MessageType: newchange
Gerrit-Change-Id: I26195de6acedb020a00c3691d5672c5385738e7f
Gerrit-PatchSet: 1
Gerrit-Project: wikimedia/discovery/analytics
Gerrit-Branch: master
Gerrit-Owner: EBernhardson <ebernhard...@wikimedia.org>

_______________________________________________
MediaWiki-commits mailing list
MediaWiki-commits@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-commits

Reply via email to