Hi,
I'm not sure weather this belongs in the hive-dev or hive-user.
I have a folder with many small files.
I would like to reduce the number of files the way hive merges output .
I tried to understand from the source of 
org.apache.hadoop.hive.ql.optimizer.GenMRFileSink1 how to leverage the API to 
submit a job 
that merges output files.
I think I was able to identify:  
  private void createMergeJob(FileSinkOperator fsOp, GenMRProcContext ctx, 
String finalName)
      throws SemanticException 
As the entry point to the logic that performs the operation, but I did not find 
documentation as to how to use it

Is there an example that simulates the use of this API call?



                                          

Reply via email to