In HIVE TEZ job 15 task ran for reducer. Due to data skew 2 tasks are running for very long compared to others.
Looking at the counters REDUCE_INPUT_GROUPS are almost approximately same across reducer tasks. But REDUCE_INPUT_RECORDS of the skewed tasks are like 180 times more than others. How to avoid skew to reducers. Thanks, Kiran
