Hello, hive> FROM ( > FROM subset > MAP subset.patient_mrn, subset.encounter_date > USING 'q1.txt' > AS mp1, mp2 > CLUSTER BY mp1) map_output > INSERT OVERWRITE TABLE t3 > REDUCE map_output.mp1 > USING 'retrieve' > AS reducef1; Total MapReduce jobs = 1 Launching Job 1 out of 1 Number of reduce tasks not specified. Estimated from input data size: 1 In order to change the average load for a reducer (in bytes): set hive.exec.reducers.bytes.per.reducer=<number> In order to limit the maximum number of reducers: set hive.exec.reducers.max=<number> In order to set a constant number of reducers: set mapred.reduce.tasks=<number> Starting Job = job_201112281627_0100, Tracking URL = http://localhost:50030/jobdetails.jsp?jobid=job_201112281627_0100 Kill Command = /home/hadoop/hadoop-0.20.2-cdh3u2//bin/hadoop job -Dmapred.job.tracker=localhost:54311 -kill job_201112281627_0100 2011-12-31 04:34:52,208 Stage-1 map = 0%, reduce = 0% 2011-12-31 04:35:52,939 Stage-1 map = 0%, reduce = 0% 2011-12-31 04:36:34,097 Stage-1 map = 100%, reduce = 100% Ended Job = job_201112281627_0100 with errors FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask hive>
In 'q1.txt' I have written query in Hive which returns 2 columns and In 'retrieve' I have wriiten a java code which takes 2 input and display the 1 column Is there any mistake in query. Pls suggest me some solution. -- Regards, Bhavesh Shah