Na Yang created HIVE-7651: ----------------------------- Summary: Investigate why union two RDDs generated from two MapTrans does not get the right result Key: HIVE-7651 URL: https://issues.apache.org/jira/browse/HIVE-7651 Project: Hive Issue Type: Bug Components: Spark Reporter: Na Yang
If the SparkWork has two map works as root, then use the current generate(basework) API to generate two mapTran. union the RDDs processed by the two mapTrans does not generate the correct result. If two input RDDs come from different data tables, then the union result is empty. if two input RDDs come from the same data table, then the union result is not correct. The same row of data happen 4 times in the union result. Need to investigate why this happen and how to fix it. -- This message was sent by Atlassian JIRA (v6.2#6252)