add option to let hive automatically run in local mode based on tunable
heuristics
----------------------------------------------------------------------------------
Key: HIVE-1408
URL: https://issues.apache.org/jira/browse/HIVE-1408
Project: Hadoop Hive
Issue Type: New Feature
Components: Query Processor
Reporter: Joydeep Sen Sarma
Assignee: Joydeep Sen Sarma
as a followup to HIVE-543 - we should have a simple option (enabled by default)
to let hive run in local mode if possible.
two levels of options are desirable:
1. hive.exec.mode.local.auto=true/false // control whether local mode is
automatically chosen
2. Options to control different heuristics, some naiive examples:
hive.exec.mode.local.auto.input.size.max=1G // don't choose local mode if
data > 1G
hive.exec.mode.local.auto.script.enable=true/false // choose if local mode
is enabled for queries with user scripts
this can be implemented as a pre/post execution hook. It makes sense to provide
this as a standard hook in the hive codebase since it's likely to improve
response time for many users (especially for test queries).
the initial proposal is to choose this at a query level and not at per
hive-task (ie. hadoop job) level. per job-level requires more changes to
compilation (to not pre-commit to hdfs or local scratch directories at compile
time).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.