[ https://issues.apache.org/jira/browse/IMPALA-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mostafa Mokhtar reassigned IMPALA-2564: --------------------------------------- Assignee: (was: Mostafa Mokhtar) > Introduce mechanism to limit query fan-out > ------------------------------------------ > > Key: IMPALA-2564 > URL: https://issues.apache.org/jira/browse/IMPALA-2564 > Project: IMPALA > Issue Type: New Feature > Components: Distributed Exec > Affects Versions: Impala 2.2 > Reporter: Mostafa Mokhtar > Priority: Minor > Labels: customer, performance, scalability > > The target use case is small queries on large clusters. > Today Impala schedules queries on all Impalad instances regardless of how > much data each Impalad would read, this results in spreading the work too > thin between nodes and exposes undesired scalability issues. > The proposal is to introduce a parameter that controls the Min/Max amount of > data read by a single Impala instance. > The SimpleScheduler would combine several splits together in order to satisfy > the Min size requirements for a single Impalad before moving on the to the > next node. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org