[ https://issues.apache.org/jira/browse/HIVE-474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12719368#action_12719368 ]
Min Zhou edited comment on HIVE-474 at 6/14/09 7:02 PM: -------------------------------------------------------- I thought there is another special case here. If the query has multiple distinct operations on the same column , we can push down the evaluation of those expressions into reducers. {code} Query: select a, count(distinct if(codition, b, null)) as col1, count(distinct if(!condition, null, b)) as col2, count(distinct b) as col3 Plan: Job : Map side: Emit: distribution_key: a, sort_key: a, b, value: nothing Reduce side: Group By a, count col1, col2, col3 by evaluating their expressions {code} was (Author: coderplay): I thought there is another special case here. If the query has multiple distinct operations on the same column , we can push down the evaluation of those expressions into reducers. Query: select a, count(distinct if(codition, b, null)) as col1, count(distinct if(!condition, null, b)) as col2, count(distinct b) as col3 Plan: Job : Map side: Emit: distribution_key: a, sort_key: a, b, value: nothing Reduce side: Group By a, count col1, col2, col3 by evaluating their expressions > Support for distinct selection on two or more columns > ----------------------------------------------------- > > Key: HIVE-474 > URL: https://issues.apache.org/jira/browse/HIVE-474 > Project: Hadoop Hive > Issue Type: Improvement > Components: Query Processor > Reporter: Alexis Rondeau > > The ability to select distinct several, individual columns as by example: > select count(distinct user), count(distinct session) from actions; > Currently returns the following failure: > FAILED: Error in semantic analysis: line 2:7 DISTINCT on Different Columns > not Supported user -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.