Thank you for your reply.

Could you tell me why it is slower if the two paremeters are true and how
slow it is?

2010/8/10 Namit Jain <nj...@facebook.com>

> Yes, it will try to run another map-reduce job to merge the files
> ________________________________________
> From: lei liu [liulei...@gmail.com]
> Sent: Monday, August 09, 2010 8:57 AM
> To: hive-user@hadoop.apache.org
> Subject: Re: How to merge small files
>
> Could you tell me whether the query is slower if I two parameters both are
> true?
>
> 2010/8/9 Namit Jain <nj...@facebook.com<mailto:nj...@facebook.com>>
> That's right
>
> ________________________________________
> From: lei liu [liulei...@gmail.com<mailto:liulei...@gmail.com>]
> Sent: Sunday, August 08, 2010 7:18 PM
> To: hive-user@hadoop.apache.org<mailto:hive-user@hadoop.apache.org>
> Subject: Re: How to merge small files
>
> Thank you for your reply.
>
> Your mean is I will execute below statement:
>
> statement.execute("set hive.merge.mapfiles=true");
> statement.execute("set hive.merge.mapredfiles=true");
>
> The two parementers are both true, right?
>
> 2010/8/6 Namit Jain <nj...@facebook.com<mailto:nj...@facebook.com><mailto:
> nj...@facebook.com<mailto:nj...@facebook.com>>>
>   HIVEMERGEMAPFILES("hive.merge.mapfiles", true),
>  HIVEMERGEMAPREDFILES("hive.merge.mapredfiles", false),
>
>
> Set the above parameters to true before your query.
>
>
>
> ________________________________________
> From: lei liu [liulei...@gmail.com<mailto:liulei...@gmail.com><mailto:
> liulei...@gmail.com<mailto:liulei...@gmail.com>>]
> Sent: Thursday, August 05, 2010 8:47 PM
> To: hive-user@hadoop.apache.org<mailto:hive-user@hadoop.apache.org
> ><mailto:hive-user@hadoop.apache.org<mailto:hive-user@hadoop.apache.org>>
> Subject: How to merge small files
>
> When I run below sql:  INSERT OVERWRITE TABLE tablename1 select_statement1
> FROM from_statement, there are many files which size is zero are stored to
> hadoop,
>
> How can I merge these small files?
>
> Thanks,
>
>
>
> LiuLei
>
>
>
>

Reply via email to