hello,
i may need your help with one query. Its always ending with reducucer
timeout in YARN.
I tried increase timeout to 30min but its still not enough and progress is
not moving at all.
Here is query:
INSERT INTO TABLE TMP2
SELECT a.rn
,MAX( a.date_report_end) over ( PARTITION BY a.field1 ORDER BY
a.date_report_start DESC, a.date_report_end ASC ROWS BETWEEN 1 FOLLOWING
and UNBOUNDED FOLLOWING) as max_report_end_prev
FROM TMP1 a;
Count of rows in table TMP1 is 22180264 .
I tried reduce task count to 128 - at end its only 4 reducers running
Any idead how to rewrite this query to finish in normal time? Hive v0.12
Thanks
JV