[09/25] drill git commit: config options

tshiran Mon, 11 May 2015 22:57:12 -0700

config options


Project: http://git-wip-us.apache.org/repos/asf/drill/repo
Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/1ad5dbde
Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/1ad5dbde
Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/1ad5dbde

Branch: refs/heads/gh-pages
Commit: 1ad5dbde38cfa53bddb1f753051357bd4afdcb71
Parents: b65acbb
Author: Kristine Hahn <kh...@maprtech.com>
Authored: Fri May 8 14:56:26 2015 -0700
Committer: Kristine Hahn <kh...@maprtech.com>
Committed: Fri May 8 14:56:26 2015 -0700

----------------------------------------------------------------------
 .../010-configuration-options-introduction.md   | 29 +++++++++++---------
 1 file changed, 16 insertions(+), 13 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/drill/blob/1ad5dbde/_docs/configure-drill/configuration-options/010-configuration-options-introduction.md
----------------------------------------------------------------------
diff --git 
a/_docs/configure-drill/configuration-options/010-configuration-options-introduction.md
 
b/_docs/configure-drill/configuration-options/010-configuration-options-introduction.md
index 0298006..e2b184c 100644
--- 
a/_docs/configure-drill/configuration-options/010-configuration-options-introduction.md
+++ 
b/_docs/configure-drill/configuration-options/010-configuration-options-introduction.md
@@ -144,7 +144,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>drill.exec.functions.cast_empty_string_to_null</td>
     <td>FALSE</td>
-    <td></td>
+    <td>Not supported in this release.</td>
   </tr>
   <tr>
     <td>drill.exec.storage.file.partition.column.label</td>
@@ -189,22 +189,23 @@ The sys.options table lists the following options that 
you can set at the sessio
   <tr>
     <td>exec.queue.large</td>
     <td>10</td>
-    <td>Range: 0-1000</td>
+    <td>Sets the number of large queries that can run concurrently in the 
cluster.
+Range: 0-1000</td>
   </tr>
   <tr>
     <td>exec.queue.small</td>
     <td>100</td>
-    <td>Range: 0-1001</td>
+    <td>Sets the number of small queries that can run concurrently in the 
cluster. Range: 0-1001</td>
   </tr>
   <tr>
     <td>exec.queue.threshold</td>
     <td>30000000</td>
-    <td>Range: 0-9223372036854775807</td>
+    <td>Sets the cost threshold, which depends on the complexity of the 
queries in queue, for determining whether query is large or small. Complex 
queries have higher thresholds. Range: 0-9223372036854775807</td>
   </tr>
   <tr>
     <td>exec.queue.timeout_millis</td>
     <td>300000</td>
-    <td>Range: 0-9223372036854775807</td>
+    <td>Indicates how long a query can wait in queue before the query fails. 
Range: 0-9223372036854775807</td>
   </tr>
   <tr>
     <td>planner.add_producer_consumer</td>
@@ -224,7 +225,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.broadcast_threshold</td>
     <td>10000000</td>
-    <td>Threshold in number of rows that triggers a broadcast join for a query 
if the right side of the join contains fewer rows than the threshold. Avoids 
broadcasting too many rows to join. Range: 0-2147483647</td>
+    <td>The maximum number of records allowed to be broadcast as part of a 
query. After one million records, Drill reshuffles data rather than doing a 
broadcast to one side of the join. Range: 0-2147483647</td>
   </tr>
   <tr>
     <td>planner.disable_exchanges</td>
@@ -254,7 +255,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.enable_hashjoin</td>
     <td>TRUE</td>
-    <td>Enable the memory hungry hash join. Does not write to disk.</td>
+    <td>Enable the memory hungry hash join. Drill assumes that a query with 
have adequate memory to complete and tries to use the fastest operations 
possible to complete the planned inner, left, right, or full outer joins using 
a hash table. Does not write to disk. Disabling hash join allows Drill to 
manage arbitrarily large data in a small memory footprint.</td>
   </tr>
   <tr>
     <td>planner.enable_hashjoin_swap</td>
@@ -264,12 +265,13 @@ The sys.options table lists the following options that 
you can set at the sessio
   <tr>
     <td>planner.enable_mergejoin</td>
     <td>TRUE</td>
-    <td>Sort-based operation. Writes to disk.</td>
+    <td>Sort-based operation. A merge join is used for inner join, left and 
right outer joins.  Inputs to the merge join must be sorted. It reads the 
sorted input streams from both sides and finds matching rows. Writes to 
disk.</td>
   </tr>
   <tr>
     <td>planner.enable_multiphase_agg</td>
     <td>TRUE</td>
-    <td></td>
+    <td>Each minor fragment does a local aggregation in phase 1, distributes 
on a hash basis using GROUP-BY keys partially aggregated results to other 
fragments, and all the fragments perform a total aggregation using this data.  
+ </td>
   </tr>
   <tr>
     <td>planner.enable_mux_exchange</td>
@@ -304,7 +306,8 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.memory.enable_memory_estimation</td>
     <td>FALSE</td>
-    <td></td>
+    <td>Toggles the state of memory estimation and re-planning of the query.
+When enabled, Drill conservatively estimates memory requirements and typically 
excludes these operators from the plan and negatively impacts performance.</td>
   </tr>
   <tr>
     <td>planner.memory.hash_agg_table_factor</td>
@@ -319,7 +322,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.memory.max_query_memory_per_node</td>
     <td>2147483648</td>
-    <td></td>
+    <td>Sets the maximum estimate of memory for a query per node. If the 
estimate is too low, Drill re-plans the query without memory-constrained 
operators.</td>
   </tr>
   <tr>
     <td>planner.memory.non_blocking_operators_memory</td>
@@ -329,7 +332,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.partitioner_sender_max_threads</td>
     <td>8</td>
-    <td></td>
+    <td>Upper limit of threads for outbound queuing.</td>
   </tr>
   <tr>
     <td>planner.partitioner_sender_set_threads</td>
@@ -354,7 +357,7 @@ The sys.options table lists the following options that you 
can set at the sessio
   <tr>
     <td>planner.width.max_per_node</td>
     <td>3</td>
-    <td>The maximum degree of distribution of a query across cores and cluster 
nodes.</td>
+    <td>Maximum number of threads that can run in parallel for a query on a 
node. A slice is an individual thread. This number indicates the maximum number 
of slices per query for the queryâs major fragment on a node.</td>
   </tr>
   <tr>
     <td>planner.width.max_per_query</td>

[09/25] drill git commit: config options

Reply via email to