[jira] Commented: (HIVE-1330) fatal error check omitted for reducer-side operators

2010-04-28 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862106#action_12862106
 ] 

Namit Jain commented on HIVE-1330:
--

+1

looks good

> fatal error check omitted for reducer-side operators
> 
>
> Key: HIVE-1330
> URL: https://issues.apache.org/jira/browse/HIVE-1330
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1330.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1330) fatal error check omitted for reducer-side operators

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1330:
-

Attachment: HIVE-1330.patch

> fatal error check omitted for reducer-side operators
> 
>
> Key: HIVE-1330
> URL: https://issues.apache.org/jira/browse/HIVE-1330
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1330.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1330) fatal error check omitted for reducer-side operators

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1330:
-

   Status: Patch Available  (was: Open)
Affects Version/s: 0.6.0
Fix Version/s: 0.6.0

> fatal error check omitted for reducer-side operators
> 
>
> Key: HIVE-1330
> URL: https://issues.apache.org/jira/browse/HIVE-1330
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1330.patch
>
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (HIVE-1330) fatal error check omitted for reducer-side operators

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang reassigned HIVE-1330:


Assignee: Ning Zhang

> fatal error check omitted for reducer-side operators
> 
>
> Key: HIVE-1330
> URL: https://issues.apache.org/jira/browse/HIVE-1330
> Project: Hadoop Hive
>  Issue Type: Bug
>Reporter: Ning Zhang
>Assignee: Ning Zhang
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1330) fatal error check omitted for reducer-side operators

2010-04-28 Thread Ning Zhang (JIRA)
fatal error check omitted for reducer-side operators


 Key: HIVE-1330
 URL: https://issues.apache.org/jira/browse/HIVE-1330
 Project: Hadoop Hive
  Issue Type: Bug
Reporter: Ning Zhang




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1328) make mapred.input.dir.recursive work for select *

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862077#action_12862077
 ] 

John Sichi commented on HIVE-1328:
--

Hi Ed,

This is not a new feature--this is an inconsistency in an existing feature when 
a particular Hadoop parameter is enabled (it should not matter whether you use 
select * vs a more complex select, you should get the same results).

In general, prioritization is driven by a number of factors such as the overall 
project roadmap, quality, and the use cases which the developer wants or needs 
to make work (this one happens to be important for Facebook, which is why I'm 
working on it at the moment); if the ones you mention are high priority for 
you, please submit patches for them so we can get them resolved.

Regardless of that, thanks for all the bug reports that you have 
submitted--they're very valuable in themselves, and we want to get them all 
fixed too.


> make mapred.input.dir.recursive work for select *
> -
>
> Key: HIVE-1328
> URL: https://issues.apache.org/jira/browse/HIVE-1328
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Affects Versions: 0.6.0
>Reporter: John Sichi
>Assignee: John Sichi
> Fix For: 0.6.0
>
>
> For the script below, we would like the behavior from MAPREDUCE-1501 to apply 
> so that the select * returns two rows instead of none.
> create table fact_daily(x int)
> partitioned by (ds string);
> create table fact_tz(x int)
> partitioned by (ds string, hr string, gmtoffset string);
> alter table fact_tz 
> add partition (ds='2010-01-03', hr='1', gmtoffset='-8');
> insert overwrite table fact_tz
> partition (ds='2010-01-03', hr='1', gmtoffset='-8')
> select key+11 from src where key=484;
> alter table fact_tz 
> add partition (ds='2010-01-03', hr='2', gmtoffset='-7');
> insert overwrite table fact_tz
> partition (ds='2010-01-03', hr='2', gmtoffset='-7')
> select key+12 from src where key=484;
> alter table fact_daily
> set tblproperties('EXTERNAL'='TRUE');
> alter table fact_daily
> add partition (ds='2010-01-03')
> location '/user/hive/warehouse/fact_tz/ds=2010-01-03';
> set mapred.input.dir.recursive=true;
> select * from fact_daily where ds='2010-01-03';

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1328) make mapred.input.dir.recursive work for select *

2010-04-28 Thread Edward Capriolo (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862074#action_12862074
 ] 

Edward Capriolo commented on HIVE-1328:
---

Can we look at HIVE-1318 and maybe HIVE-1303 first. Already the external 
partitions seem to have bugs can we get them working properly before more 
features are added?

> make mapred.input.dir.recursive work for select *
> -
>
> Key: HIVE-1328
> URL: https://issues.apache.org/jira/browse/HIVE-1328
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Affects Versions: 0.6.0
>Reporter: John Sichi
>Assignee: John Sichi
> Fix For: 0.6.0
>
>
> For the script below, we would like the behavior from MAPREDUCE-1501 to apply 
> so that the select * returns two rows instead of none.
> create table fact_daily(x int)
> partitioned by (ds string);
> create table fact_tz(x int)
> partitioned by (ds string, hr string, gmtoffset string);
> alter table fact_tz 
> add partition (ds='2010-01-03', hr='1', gmtoffset='-8');
> insert overwrite table fact_tz
> partition (ds='2010-01-03', hr='1', gmtoffset='-8')
> select key+11 from src where key=484;
> alter table fact_tz 
> add partition (ds='2010-01-03', hr='2', gmtoffset='-7');
> insert overwrite table fact_tz
> partition (ds='2010-01-03', hr='2', gmtoffset='-7')
> select key+12 from src where key=484;
> alter table fact_daily
> set tblproperties('EXTERNAL'='TRUE');
> alter table fact_daily
> add partition (ds='2010-01-03')
> location '/user/hive/warehouse/fact_tz/ds=2010-01-03';
> set mapred.input.dir.recursive=true;
> select * from fact_daily where ds='2010-01-03';

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread Carl Steinbach (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862054#action_12862054
 ] 

Carl Steinbach commented on HIVE-1192:
--

@John: I attached a correctly formatted version of the md5 checksum file for 
the 0.20.1 tarball.

> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
> Attachments: hadoop-0.20.1.tar.gz.md5
>
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread Carl Steinbach (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1192:
-

Attachment: hadoop-0.20.1.tar.gz.md5

> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
> Attachments: hadoop-0.20.1.tar.gz.md5
>
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862033#action_12862033
 ] 

John Sichi commented on HIVE-1192:
--

If someone regenerates the correctly formatted .md5 file and tests it, I can 
get the bad one replaced on mirror.facebook.net.


> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread Carl Steinbach (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862005#action_12862005
 ] 

Carl Steinbach commented on HIVE-1192:
--

John/Bill: Ivy can't download the 0.20.1 tarball off of archive.apache.org 
because
the associated md5 checksum file does not adhere to convention, i.e. it was not 
created
by running 'md5sum' on the tarball.

For example, compare the following two files:
http://mirror.facebook.net/facebook/hive-deps/hadoop/core/hadoop-0.20.0/hadoop-0.20.0.tar.gz.md5
http://mirror.facebook.net/facebook/hive-deps/hadoop/core/hadoop-0.20.1/hadoop-0.20.1.tar.gz.md5

The checksum file for 0.20.0 contains the output of md5sum. The checksum file 
for 0.20.1
contains a bunch of checksums generated using some other tool. Ivy barfs when 
it tries to located
the md5 checksum in this mess.


> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861996#action_12861996
 ] 

John Sichi commented on HIVE-1192:
--

0.20.1 is there now...Bill, can you try it?


> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1329) for ALTER TABLE t SET TBLPROPERTIES ('EXTERNAL'='TRUE'), change TBL_TYPE attribute from MANAGED_TABLE to EXTERNAL_TABLE

2010-04-28 Thread John Sichi (JIRA)
for ALTER TABLE t SET TBLPROPERTIES ('EXTERNAL'='TRUE'), change TBL_TYPE 
attribute from MANAGED_TABLE to EXTERNAL_TABLE
---

 Key: HIVE-1329
 URL: https://issues.apache.org/jira/browse/HIVE-1329
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Metastore
Affects Versions: 0.6.0
Reporter: John Sichi
 Fix For: 0.6.0


Currently they are left inconsistent.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1329) for ALTER TABLE t SET TBLPROPERTIES ('EXTERNAL'='TRUE'), change TBL_TYPE attribute from MANAGED_TABLE to EXTERNAL_TABLE

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861959#action_12861959
 ] 

John Sichi commented on HIVE-1329:
--

And vice versa for setting 'EXTERNAL' = 'FALSE' (or removing the 'EXTERNAL' 
property if we allow that).


> for ALTER TABLE t SET TBLPROPERTIES ('EXTERNAL'='TRUE'), change TBL_TYPE 
> attribute from MANAGED_TABLE to EXTERNAL_TABLE
> ---
>
> Key: HIVE-1329
> URL: https://issues.apache.org/jira/browse/HIVE-1329
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Metastore
>Affects Versions: 0.6.0
>Reporter: John Sichi
> Fix For: 0.6.0
>
>
> Currently they are left inconsistent.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1328) make mapred.input.dir.recursive work for select *

2010-04-28 Thread John Sichi (JIRA)
make mapred.input.dir.recursive work for select *
-

 Key: HIVE-1328
 URL: https://issues.apache.org/jira/browse/HIVE-1328
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.6.0
Reporter: John Sichi
Assignee: John Sichi
 Fix For: 0.6.0


For the script below, we would like the behavior from MAPREDUCE-1501 to apply 
so that the select * returns two rows instead of none.

create table fact_daily(x int)
partitioned by (ds string);

create table fact_tz(x int)
partitioned by (ds string, hr string, gmtoffset string);

alter table fact_tz 
add partition (ds='2010-01-03', hr='1', gmtoffset='-8');
insert overwrite table fact_tz
partition (ds='2010-01-03', hr='1', gmtoffset='-8')
select key+11 from src where key=484;

alter table fact_tz 
add partition (ds='2010-01-03', hr='2', gmtoffset='-7');
insert overwrite table fact_tz
partition (ds='2010-01-03', hr='2', gmtoffset='-7')
select key+12 from src where key=484;

alter table fact_daily
set tblproperties('EXTERNAL'='TRUE');

alter table fact_daily
add partition (ds='2010-01-03')
location '/user/hive/warehouse/fact_tz/ds=2010-01-03';

set mapred.input.dir.recursive=true;
select * from fact_daily where ds='2010-01-03';


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861926#action_12861926
 ] 

John Sichi commented on HIVE-1192:
--

Meanwhile, I'm pinging Facebook ops to see if we can get 0.20.1 added to 
mirror.facebook.net/facebook/hive-deps.

> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1317) CombineHiveInputFormat throws exception when partition name contains special characters to URI

2010-04-28 Thread Ning Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861925#action_12861925
 ] 

Ning Zhang commented on HIVE-1317:
--

Found a bug in my local testing, I will update a new patch once it is fixed. 

> CombineHiveInputFormat throws exception when partition name contains special 
> characters to URI
> --
>
> Key: HIVE-1317
> URL: https://issues.apache.org/jira/browse/HIVE-1317
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1317.2.patch, HIVE-1317.patch
>
>
> If a partition name contains characters such as ':' and '|' which have 
> special meaning in URI (hdfs uses URI internally for Path), 
> CombineHiveInputFormat throws an exception. URI was created in 
> CombineHiveInputFormat to compare a path belongs to a partition in 
> partitionToPathInfo. We should bypass URI creation by just string 
> comparisons. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1317) CombineHiveInputFormat throws exception when partition name contains special characters to URI

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1317:
-

Status: Open  (was: Patch Available)

> CombineHiveInputFormat throws exception when partition name contains special 
> characters to URI
> --
>
> Key: HIVE-1317
> URL: https://issues.apache.org/jira/browse/HIVE-1317
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1317.2.patch, HIVE-1317.patch
>
>
> If a partition name contains characters such as ':' and '|' which have 
> special meaning in URI (hdfs uses URI internally for Path), 
> CombineHiveInputFormat throws an exception. URI was created in 
> CombineHiveInputFormat to compare a path belongs to a partition in 
> partitionToPathInfo. We should bypass URI creation by just string 
> comparisons. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861922#action_12861922
 ] 

John Sichi commented on HIVE-1192:
--

Hmmm, looks like the mirrors have been updated to 0.20.2.  Facebook hasn't yet 
gotten around to provisioning more archival versions under 
mirror.facebook.net/facebook/hive-deps.

If you can't use 0.20.2, you could 
* retry with archive.apache.org (sometimes it is under too much load)
* or see if you can google for a mirror which didn't get refreshed
* or else try downloading it through some other means such as wget (ivy seems 
to be very sensitive to network problems)


> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1083) allow sub-directories for an external table/partition

2010-04-28 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861920#action_12861920
 ] 

John Sichi commented on HIVE-1083:
--

Correction:  local file system is probably OK; I just realized that when I 
tested, I was using the stock hadoop 0.20 version which does not have 
MAPREDUCE-1501 in it.


> allow sub-directories for an external table/partition
> -
>
> Key: HIVE-1083
> URL: https://issues.apache.org/jira/browse/HIVE-1083
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Affects Versions: 0.6.0
>Reporter: Namit Jain
>Assignee: Zheng Shao
> Fix For: 0.6.0
>
>
> Sometimes users want to define an external table/partition based on all files 
> (recursively) inside a directory.
> Currently most of the Hadoop InputFormat classes do not support that. We 
> should extract all files recursively in the directory, and add them to the 
> input path of the job.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1327) Group by partition column returns wrong results

2010-04-28 Thread Ning Zhang (JIRA)
Group by partition column returns wrong results
---

 Key: HIVE-1327
 URL: https://issues.apache.org/jira/browse/HIVE-1327
 Project: Hadoop Hive
  Issue Type: Bug
Affects Versions: 0.6.0
Reporter: Ning Zhang
 Fix For: 0.6.0


hive> show partitions nzhang_part7;
show partitions nzhang_part7;
OK
ds=2010-01-11
ds=2010-01-23
ds=2010-04-03
ds=2010-04-19
ds=2010-04-22
Time taken: 0.431 seconds

[nzh...@dev303 /tmp] dfs -ls /user/facebook/warehouse/nzhang_part7/*
-rw-r--r--   3 nzhang supergroup1756123 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-01-11/attempt_201004162336_176893_r_00_0.gz
-rw-r--r--   3 nzhang supergroup1758227 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-01-11/attempt_201004162336_176893_r_01_0.gz
-rw-r--r--   3 nzhang supergroup1915969 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-01-23/attempt_201004162336_176893_r_00_0.gz
-rw-r--r--   3 nzhang supergroup1943830 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-01-23/attempt_201004162336_176893_r_01_0.gz
-rw-r--r--   3 nzhang supergroup1646739 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-03/attempt_201004162336_176893_r_00_0.gz
-rw-r--r--   3 nzhang supergroup1641052 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-03/attempt_201004162336_176893_r_01_0.gz
-rw-r--r--   3 nzhang supergroup  58601 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-19/attempt_201004162336_176893_r_00_0.gz
-rw-r--r--   3 nzhang supergroup  57465 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-19/attempt_201004162336_176893_r_01_0.gz
-rw-r--r--   3 nzhang supergroup1064491 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-22/attempt_201004162336_176893_r_00_0.gz
-rw-r--r--   3 nzhang supergroup1070580 2010-04-28 11:54 
/user/facebook/warehouse/nzhang_part7/ds=2010-04-22/attempt_201004162336_176893_r_01_0.gz

hive> select ds, count(1) from nzhang_part7 where ds is not null group by ds;
2010-04-03  1761129
Time taken: 187.692 seconds

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1192) Build fails when hadoop.version=0.20.1

2010-04-28 Thread Bill Au (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861889#action_12861889
 ] 

Bill Au commented on HIVE-1192:
---

I am trying to build 0.5.0 for Hadoop 0.20.1.  It looks like 0.20.1 is only 
available on http://archive.apache.org/dist.  So I set hadoop.mirror to that 
but I am getting the same error as in HIVE-984:

https://issues.apache.org/jira/browse/HIVE-984

I can connect and download the file but there is a problem with the checksum.

What do I need to do to proceed?

> Build fails when hadoop.version=0.20.1
> --
>
> Key: HIVE-1192
> URL: https://issues.apache.org/jira/browse/HIVE-1192
> Project: Hadoop Hive
>  Issue Type: Bug
>  Components: Build Infrastructure
>Reporter: Carl Steinbach
>
> Setting hadoop.version=0.20.1 causes the build to fail since
> mirror.facebook.net/facebook/hive-deps does not have 0.20.1
> (only 0.17.2.1, 0.18.3, 0.19.0, 0.20.0).
> Suggested fix:
> * remove/ignore the hadoop.version configuration parameter
> or
> * Remove the patch numbers from these archives and use only the major.minor 
> numbers specified by the user to locate the appropriate tarball to download, 
> so 0.20.0 and 0.20.1 would both map to hadoop-0.20.tar.gz.
> * Optionally create new tarballs that only contain the components that are 
> actually needed for the build (Hadoop jars), and remove things that aren't 
> needed (all of the source files).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1317) CombineHiveInputFormat throws exception when partition name contains special characters to URI

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1317:
-

Attachment: HIVE-1317.2.patch

Updated HIVE-1317.2.patch with the fix. 

> CombineHiveInputFormat throws exception when partition name contains special 
> characters to URI
> --
>
> Key: HIVE-1317
> URL: https://issues.apache.org/jira/browse/HIVE-1317
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1317.2.patch, HIVE-1317.patch
>
>
> If a partition name contains characters such as ':' and '|' which have 
> special meaning in URI (hdfs uses URI internally for Path), 
> CombineHiveInputFormat throws an exception. URI was created in 
> CombineHiveInputFormat to compare a path belongs to a partition in 
> partitionToPathInfo. We should bypass URI creation by just string 
> comparisons. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1317) CombineHiveInputFormat throws exception when partition name contains special characters to URI

2010-04-28 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1317:
-

Attachment: (was: HIVE-1317.2.patch)

> CombineHiveInputFormat throws exception when partition name contains special 
> characters to URI
> --
>
> Key: HIVE-1317
> URL: https://issues.apache.org/jira/browse/HIVE-1317
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1317.patch
>
>
> If a partition name contains characters such as ':' and '|' which have 
> special meaning in URI (hdfs uses URI internally for Path), 
> CombineHiveInputFormat throws an exception. URI was created in 
> CombineHiveInputFormat to compare a path belongs to a partition in 
> partitionToPathInfo. We should bypass URI creation by just string 
> comparisons. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1317) CombineHiveInputFormat throws exception when partition name contains special characters to URI

2010-04-28 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861824#action_12861824
 ] 

Namit Jain commented on HIVE-1317:
--

Ning, I am getting a compilation error after applying the patch -

[javac] symbol  : variable File
[javac] location: class 
org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.CombineFilter
[javac]   pString = p.toString() + File.separator;
[javac]^
[javac] Note: Some input files use or override a deprecated API.
[javac] Note: Recompile with -Xlint:deprecation for details.
[javac] Note: Some input files use unchecked or unsafe operations.
[javac] Note: Recompile with -Xlint:unchecked for details.
[javac] 1 error

BUILD FAILED


> CombineHiveInputFormat throws exception when partition name contains special 
> characters to URI
> --
>
> Key: HIVE-1317
> URL: https://issues.apache.org/jira/browse/HIVE-1317
> Project: Hadoop Hive
>  Issue Type: Bug
>Affects Versions: 0.6.0
>Reporter: Ning Zhang
>Assignee: Ning Zhang
> Fix For: 0.6.0
>
> Attachments: HIVE-1317.2.patch, HIVE-1317.patch
>
>
> If a partition name contains characters such as ':' and '|' which have 
> special meaning in URI (hdfs uses URI internally for Path), 
> CombineHiveInputFormat throws an exception. URI was created in 
> CombineHiveInputFormat to compare a path belongs to a partition in 
> partitionToPathInfo. We should bypass URI creation by just string 
> comparisons. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.