date:20121010

[jira] [Commented] (HIVE-3392) Hive unnecessarily validates table SerDes when dropping a table

2012-10-10 Thread Ajesh Kumar (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473103#comment-13473103
 ] 

Ajesh Kumar commented on HIVE-3392:
---

Hi Edward Capriolo,
So can we conclude this as not an issue. 
or I can make a change to display a message like below when the class is not 
found.

hive drop table temptbl;
FAILED: Hive Internal Error: 
java.lang.RuntimeException(MetaException(message:org.apache.hadoop.hive.serde2.SerDeException
 SerDe org.apache.hadoop.hive.serde2.TestDelimitedJSONSerDe does not exist))
hive

 Hive unnecessarily validates table SerDes when dropping a table
 ---

 Key: HIVE-3392
 URL: https://issues.apache.org/jira/browse/HIVE-3392
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.9.0
Reporter: Jonathan Natkins
Assignee: Ajesh Kumar
  Labels: patch
 Attachments: HIVE-3392.2.patch.txt, HIVE-3392.Test Case - After 
 Patch.txt, HIVE-3392.Test Case - Before Patch.txt


 natty@hadoop1:~$ hive
 hive add jar 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar;
 Added 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
  to class path
 Added resource: 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
 hive create table test (a int) row format serde 'hive.serde.JSONSerDe';  
   
 OK
 Time taken: 2.399 seconds
 natty@hadoop1:~$ hive
 hive drop table test;

 FAILED: Hive Internal Error: 
 java.lang.RuntimeException(MetaException(message:org.apache.hadoop.hive.serde2.SerDeException
  SerDe hive.serde.JSONSerDe does not exist))
 java.lang.RuntimeException: 
 MetaException(message:org.apache.hadoop.hive.serde2.SerDeException SerDe 
 hive.serde.JSONSerDe does not exist)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializerFromMetaStore(Table.java:262)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializer(Table.java:253)
   at org.apache.hadoop.hive.ql.metadata.Table.getCols(Table.java:490)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.checkValidity(Table.java:162)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:943)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeDropTable(DDLSemanticAnalyzer.java:700)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeInternal(DDLSemanticAnalyzer.java:210)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:243)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:430)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:337)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:889)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:255)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:212)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:403)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:671)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:554)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:208)
 Caused by: MetaException(message:org.apache.hadoop.hive.serde2.SerDeException 
 SerDe com.cloudera.hive.serde.JSONSerDe does not exist)
   at 
 org.apache.hadoop.hive.metastore.MetaStoreUtils.getDeserializer(MetaStoreUtils.java:211)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializerFromMetaStore(Table.java:260)
   ... 20 more
 hive add jar 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar;
 Added 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
  to class path
 Added resource: 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
 hive drop table test;
 OK
 Time taken: 0.658 seconds
 hive 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3561) Build a full SQL-compliant parser for Hive

2012-10-10 Thread Lianhui Wang (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-3561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473167#comment-13473167
]

Lianhui Wang commented on HIVE-3561:

for the first approach,there is a problem. standardSQL can not support the
HiveQL writting in historical.
because there is a big difference in some operators. example:join.
so that maybe spent a lot of time to transfering using hivesql to standardSQL.
in my opinion,in short time,both maybe co-exist.

Build a full SQL-compliant parser for Hive
--

Key: HIVE-3561
URL: https://issues.apache.org/jira/browse/HIVE-3561
Project: Hive
Issue Type: Sub-task
Components: Query Processor
Affects Versions: 0.10.0
Reporter: Shengsheng Huang

To build a full SQL compliant engine on Hive, we'll need a full SQL complant
parser. The current Hive parser missed a lot of grammar units from standard
SQL. To support full SQL there're possibly four approaches:
1.Extend the existing Hive parser to support full SQL constructs. We need to
modify the current Hive.g and add any missing grammar units and resolve
conflicts.
2.Reuse an existing open source SQL compliant parser and extend it to support
Hive extensions. We may need to adapt Semantic Analyzers to the new AST
structure.
3.Reuse an existing SQL compliant parser and make it co-exist with the
existing Hive parser. Both parsers share the same CliDriver interface. Use a
query mode configuration to switch the query mode between SQL and HQL (this
is the approach we're now using in the 0.9.0 demo project)
4.Reuse an existing SQL compliant parser and make it co-exist with the
existing Hive parser. Use a separate xxxCliDriver interface for standard SQL.

Let's discuss which is the best approach.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3276) optimize union sub-queries

2012-10-10 Thread Namit Jain (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3276:
-

Status: Open  (was: Patch Available)

will address comments on phabricator

 optimize union sub-queries
 --

 Key: HIVE-3276
 URL: https://issues.apache.org/jira/browse/HIVE-3276
 Project: Hive
  Issue Type: Bug
Reporter: Namit Jain
Assignee: Namit Jain
 Attachments: hive.3276.10.patch, HIVE-3276.1.patch, 
 hive.3276.2.patch, hive.3276.3.patch, hive.3276.4.patch, hive.3276.5.patch, 
 hive.3276.6.patch, hive.3276.7.patch, hive.3276.8.patch, hive.3276.9.patch


 It might be a good idea to optimize simple union queries containing 
 map-reduce jobs in at least one of the sub-qeuries.
 For eg:
 a query like:
 insert overwrite table T1 partition P1
 select * from 
 (
   subq1
 union all
   subq2
 ) u;
 today creates 3 map-reduce jobs, one for subq1, another for subq2 and 
 the final one for the union. 
 It might be a good idea to optimize this. Instead of creating the union 
 task, it might be simpler to create a move task (or something like a move
 task), where the outputs of the two sub-queries will be moved to the final 
 directory. This can easily extend to more than 2 sub-queries in the union.
 This is very useful if there is a select * followed by filesink after the
 union. This can be independently useful, and also be used to optimize the
 skewed joins https://cwiki.apache.org/Hive/skewed-join-optimization.html.
 If there is a select, filter between the union and the filesink, the select
 and the filter can be moved before the union, and the follow-up job can
 still be removed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Build failed in Jenkins: Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false #164

2012-10-10 Thread Apache Jenkins Server

See 
https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/164/

--
[...truncated 10125 lines...]
 [echo] Project: odbc
 [copy] Warning: 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/odbc/src/conf
 does not exist.

ivy-resolve-test:
 [echo] Project: odbc

ivy-retrieve-test:
 [echo] Project: odbc

compile-test:
 [echo] Project: odbc

create-dirs:
 [echo] Project: serde
 [copy] Warning: 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/serde/src/test/resources
 does not exist.

init:
 [echo] Project: serde

ivy-init-settings:
 [echo] Project: serde

ivy-resolve:
 [echo] Project: serde
[ivy:resolve] :: loading settings :: file = 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/ivy/ivysettings.xml
[ivy:report] Processing 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/resolution-cache/org.apache.hive-hive-serde-default.xml
 to 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/report/org.apache.hive-hive-serde-default.html

ivy-retrieve:
 [echo] Project: serde

dynamic-serde:

compile:
 [echo] Project: serde

ivy-resolve-test:
 [echo] Project: serde

ivy-retrieve-test:
 [echo] Project: serde

compile-test:
 [echo] Project: serde
[javac] Compiling 26 source files to 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/serde/test/classes
[javac] Note: Some input files use or override a deprecated API.
[javac] Note: Recompile with -Xlint:deprecation for details.
[javac] Note: Some input files use unchecked or unsafe operations.
[javac] Note: Recompile with -Xlint:unchecked for details.

create-dirs:
 [echo] Project: service
 [copy] Warning: 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/service/src/test/resources
 does not exist.

init:
 [echo] Project: service

ivy-init-settings:
 [echo] Project: service

ivy-resolve:
 [echo] Project: service
[ivy:resolve] :: loading settings :: file = 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/ivy/ivysettings.xml
[ivy:report] Processing 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/resolution-cache/org.apache.hive-hive-service-default.xml
 to 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/report/org.apache.hive-hive-service-default.html

ivy-retrieve:
 [echo] Project: service

compile:
 [echo] Project: service

ivy-resolve-test:
 [echo] Project: service

ivy-retrieve-test:
 [echo] Project: service

compile-test:
 [echo] Project: service
[javac] Compiling 2 source files to 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/service/test/classes

test:
 [echo] Project: hive

test-shims:
 [echo] Project: hive

test-conditions:
 [echo] Project: shims

gen-test:
 [echo] Project: shims

create-dirs:
 [echo] Project: shims
 [copy] Warning: 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/shims/src/test/resources
 does not exist.

init:
 [echo] Project: shims

ivy-init-settings:
 [echo] Project: shims

ivy-resolve:
 [echo] Project: shims
[ivy:resolve] :: loading settings :: file = 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/ivy/ivysettings.xml
[ivy:report] Processing 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/resolution-cache/org.apache.hive-hive-shims-default.xml
 to 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/ivy/report/org.apache.hive-hive-shims-default.html

ivy-retrieve:
 [echo] Project: shims

compile:
 [echo] Project: shims
 [echo] Building shims 0.20

build_shims:
 [echo] Project: shims
 [echo] Compiling 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/shims/src/common/java;/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/shims/src/0.20/java
 against hadoop 0.20.2 
(/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/build/hadoopcore/hadoop-0.20.2)

ivy-init-settings:
 [echo] Project: shims

ivy-resolve-hadoop-shim:
 [echo] Project: shims
[ivy:resolve] :: loading settings :: file = 
/x1/jenkins/jenkins-slave/workspace/Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false/hive/ivy/ivysettings.xml

ivy-retrieve-hadoop-shim:
 [echo] Project: shims
 [echo] Building shims 0.20S

build_shims:
 [echo] Project: shims
 [echo] Compiling

[jira] [Updated] (HIVE-3276) optimize union sub-queries

2012-10-10 Thread Namit Jain (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3276:
-

Status: Patch Available  (was: Open)

 optimize union sub-queries
 --

 Key: HIVE-3276
 URL: https://issues.apache.org/jira/browse/HIVE-3276
 Project: Hive
  Issue Type: Bug
Reporter: Namit Jain
Assignee: Namit Jain
 Attachments: hive.3276.10.patch, hive.3276.11.patch, 
 HIVE-3276.1.patch, hive.3276.2.patch, hive.3276.3.patch, hive.3276.4.patch, 
 hive.3276.5.patch, hive.3276.6.patch, hive.3276.7.patch, hive.3276.8.patch, 
 hive.3276.9.patch


 It might be a good idea to optimize simple union queries containing 
 map-reduce jobs in at least one of the sub-qeuries.
 For eg:
 a query like:
 insert overwrite table T1 partition P1
 select * from 
 (
   subq1
 union all
   subq2
 ) u;
 today creates 3 map-reduce jobs, one for subq1, another for subq2 and 
 the final one for the union. 
 It might be a good idea to optimize this. Instead of creating the union 
 task, it might be simpler to create a move task (or something like a move
 task), where the outputs of the two sub-queries will be moved to the final 
 directory. This can easily extend to more than 2 sub-queries in the union.
 This is very useful if there is a select * followed by filesink after the
 union. This can be independently useful, and also be used to optimize the
 skewed joins https://cwiki.apache.org/Hive/skewed-join-optimization.html.
 If there is a select, filter between the union and the filesink, the select
 and the filter can be moved before the union, and the follow-up job can
 still be removed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3554) Hive List Bucketing - Query logic

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3554:
---

Attachment: HIVE-3554.patch.2

 Hive List Bucketing - Query logic
 -

 Key: HIVE-3554
 URL: https://issues.apache.org/jira/browse/HIVE-3554
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.1, HIVE-3554.patch.2


 This is part of efforts for list bucketing feature: 
 https://cwiki.apache.org/Hive/listbucketing.html
 This patch includes:
 1. Query logic: hive chooses right sub-directory instead of partition 
 directory.
 2. alter table grammar which is required to support query logic
 This patch doesn't include list bucketing DML. Main reasons:
 1. risk. w/o DML, this patch won't impact any existing hive regression 
 features since no touch on any data manipulation so that very low risk.
 2. manageability. w/ DML, patch is getting bigger and hard to review. 
 Removing DML, it's easy to review.
 We still disable hive feature by default since DML is not in yet.
 DML will be in follow-up patch. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-1915) authorization on database level is broken.

2012-10-10 Thread Carl Steinbach (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-1915:
-

Component/s: Authorization

 authorization on database level is broken.
 --

 Key: HIVE-1915
 URL: https://issues.apache.org/jira/browse/HIVE-1915
 Project: Hive
  Issue Type: Bug
  Components: Authorization, Metastore, Security
Reporter: He Yongqiang
Assignee: He Yongqiang
 Fix For: 0.7.0

 Attachments: HIVE-1915.1.patch, HIVE-1915-2.patch, HIVE-1915-3.patch


 CREATE DATABASE IF NOT EXISTS test_db COMMENT 'Hive test database';
 SHOW DATABASES;
 grant `drop` on DATABASE test_db to user hive_test_user;
 grant `select` on DATABASE test_db to user hive_test_user;
 show grant user hive_test_user on DATABASE test_db;
 DROP DATABASE IF EXISTS test_db;
 will fail.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

2012-10-10 Thread Prasad Mujumdar (JIRA)

Prasad Mujumdar created HIVE-3563:
-

 Summary: Drop database cascade fails when there are indexes on any 
tables
 Key: HIVE-3563
 URL: https://issues.apache.org/jira/browse/HIVE-3563
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Metastore
Affects Versions: 0.9.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar


Drop database cascade fails if any of the table has index. 

create database db2;
use db2;

create table tab1 (id int, name string);
create index idx1 on table tab1(id) as  'COMPACT' with DEFERRED REBUILD;

drop database db2 cascade;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

2012-10-10 Thread Prasad Mujumdar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasad Mujumdar updated HIVE-3563:
--

Attachment: HIVE-3563-1.patch

 Drop database cascade fails when there are indexes on any tables
 

 Key: HIVE-3563
 URL: https://issues.apache.org/jira/browse/HIVE-3563
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Metastore
Affects Versions: 0.9.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-3563-1.patch


 Drop database cascade fails if any of the table has index. 
 create database db2;
 use db2;
 create table tab1 (id int, name string);
 create index idx1 on table tab1(id) as  'COMPACT' with DEFERRED REBUILD;
 drop database db2 cascade;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

2012-10-10 Thread Prasad Mujumdar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasad Mujumdar updated HIVE-3563:
--

Status: Patch Available  (was: Open)

 Drop database cascade fails when there are indexes on any tables
 

 Key: HIVE-3563
 URL: https://issues.apache.org/jira/browse/HIVE-3563
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Metastore
Affects Versions: 0.9.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-3563-1.patch


 Drop database cascade fails if any of the table has index. 
 create database db2;
 use db2;
 create table tab1 (id int, name string);
 create index idx1 on table tab1(id) as  'COMPACT' with DEFERRED REBUILD;
 drop database db2 cascade;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3564) hivetest.py: revision number and applied patch

2012-10-10 Thread Ivan Gorbachev (JIRA)

Ivan Gorbachev created HIVE-3564:


 Summary: hivetest.py: revision number and applied patch
 Key: HIVE-3564
 URL: https://issues.apache.org/jira/browse/HIVE-3564
 Project: Hive
  Issue Type: Improvement
  Components: Testing Infrastructure
Reporter: Ivan Gorbachev
Assignee: Ivan Gorbachev


It's required to add new option for hivetest.py which will allow to show base 
revision number and applied patch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

2012-10-10 Thread Prasad Mujumdar (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473572#comment-13473572
 ] 

Prasad Mujumdar commented on HIVE-3563:
---

Review request on https://reviews.apache.org/r/7517/

 Drop database cascade fails when there are indexes on any tables
 

 Key: HIVE-3563
 URL: https://issues.apache.org/jira/browse/HIVE-3563
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Metastore
Affects Versions: 0.9.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-3563-1.patch


 Drop database cascade fails if any of the table has index. 
 create database db2;
 use db2;
 create table tab1 (id int, name string);
 create index idx1 on table tab1(id) as  'COMPACT' with DEFERRED REBUILD;
 drop database db2 cascade;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Work started] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3556 started by Gang Tim Liu.

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3556:
---

Attachment: HIVE-3554.patch.3

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.3


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3556:
---

Status: Patch Available  (was: In Progress)

patch is available https://reviews.facebook.net/D5961

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.3


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Build failed in Jenkins: Hive-0.9.1-SNAPSHOT-h0.21 #164

2012-10-10 Thread Apache Jenkins Server

See https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/164/

--
[...truncated 36580 lines...]
[junit] POSTHOOK: query: select count(1) as cnt from testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Input: default@testhivedrivertable
[junit] POSTHOOK: Output: 
file:/tmp/hudson/hive_2012-10-10_15-03-03_489_31864290084643955/-mr-1
[junit] OK
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] PREHOOK: Input: default@testhivedrivertable
[junit] PREHOOK: Output: default@testhivedrivertable
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Input: default@testhivedrivertable
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] Hive history 
file=https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/164/artifact/hive/build/service/tmp/hive_job_log_hudson_201210101503_1427267392.txt
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] OK
[junit] PREHOOK: query: create table testhivedrivertable (num int)
[junit] PREHOOK: type: DROPTABLE
[junit] POSTHOOK: query: create table testhivedrivertable (num int)
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] PREHOOK: query: load data local inpath 
'https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/ws/hive/data/files/kv1.txt'
 into table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] PREHOOK: Output: default@testhivedrivertable
[junit] Copying file: 
https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/ws/hive/data/files/kv1.txt
[junit] Copying data from 
https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/ws/hive/data/files/kv1.txt
[junit] Loading data to table default.testhivedrivertable
[junit] POSTHOOK: query: load data local inpath 
'https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/ws/hive/data/files/kv1.txt'
 into table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] PREHOOK: query: select * from testhivedrivertable limit 10
[junit] PREHOOK: type: DROPTABLE
[junit] PREHOOK: Input: default@testhivedrivertable
[junit] PREHOOK: Output: 
file:/tmp/hudson/hive_2012-10-10_15-03-07_806_1628162782103306798/-mr-1
[junit] POSTHOOK: query: select * from testhivedrivertable limit 10
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Input: default@testhivedrivertable
[junit] POSTHOOK: Output: 
file:/tmp/hudson/hive_2012-10-10_15-03-07_806_1628162782103306798/-mr-1
[junit] OK
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] PREHOOK: Input: default@testhivedrivertable
[junit] PREHOOK: Output: default@testhivedrivertable
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Input: default@testhivedrivertable
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] Hive history 
file=https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/164/artifact/hive/build/service/tmp/hive_job_log_hudson_201210101503_1899548591.txt
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] OK
[junit] PREHOOK: query: create table testhivedrivertable (num int)
[junit] PREHOOK: type: DROPTABLE
[junit] POSTHOOK: query: create table testhivedrivertable (num int)
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] PREHOOK: Input: default@testhivedrivertable
[junit] PREHOOK: Output: default@testhivedrivertable
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type: DROPTABLE
[junit] POSTHOOK: Input: default@testhivedrivertable
[junit] POSTHOOK: Output: default@testhivedrivertable
[junit] OK
[junit] Hive history 
file=https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/164/artifact/hive/build/service/tmp/hive_job_log_hudson_201210101503_213735700.txt
[junit] Hive history 
file=https://builds.apache.org/job/Hive-0.9.1-SNAPSHOT-h0.21/164/artifact/hive/build/service/tmp/hive_job_log_hudson_201210101503_1747212389.txt
[junit] PREHOOK: query: drop table testhivedrivertable
[junit] PREHOOK: type: DROPTABLE
[junit] POSTHOOK: query: drop table testhivedrivertable
[junit] POSTHOOK: type:

[jira] [Commented] (HIVE-3392) Hive unnecessarily validates table SerDes when dropping a table

2012-10-10 Thread Edward Capriolo (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473612#comment-13473612
 ] 

Edward Capriolo commented on HIVE-3392:
---

Yes. I think a nice clean exception / error message is the way to go. 

 Hive unnecessarily validates table SerDes when dropping a table
 ---

 Key: HIVE-3392
 URL: https://issues.apache.org/jira/browse/HIVE-3392
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.9.0
Reporter: Jonathan Natkins
Assignee: Ajesh Kumar
  Labels: patch
 Attachments: HIVE-3392.2.patch.txt, HIVE-3392.Test Case - After 
 Patch.txt, HIVE-3392.Test Case - Before Patch.txt


 natty@hadoop1:~$ hive
 hive add jar 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar;
 Added 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
  to class path
 Added resource: 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
 hive create table test (a int) row format serde 'hive.serde.JSONSerDe';  
   
 OK
 Time taken: 2.399 seconds
 natty@hadoop1:~$ hive
 hive drop table test;

 FAILED: Hive Internal Error: 
 java.lang.RuntimeException(MetaException(message:org.apache.hadoop.hive.serde2.SerDeException
  SerDe hive.serde.JSONSerDe does not exist))
 java.lang.RuntimeException: 
 MetaException(message:org.apache.hadoop.hive.serde2.SerDeException SerDe 
 hive.serde.JSONSerDe does not exist)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializerFromMetaStore(Table.java:262)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializer(Table.java:253)
   at org.apache.hadoop.hive.ql.metadata.Table.getCols(Table.java:490)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.checkValidity(Table.java:162)
   at org.apache.hadoop.hive.ql.metadata.Hive.getTable(Hive.java:943)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeDropTable(DDLSemanticAnalyzer.java:700)
   at 
 org.apache.hadoop.hive.ql.parse.DDLSemanticAnalyzer.analyzeInternal(DDLSemanticAnalyzer.java:210)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:243)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:430)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:337)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:889)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:255)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:212)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:403)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:671)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:554)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:208)
 Caused by: MetaException(message:org.apache.hadoop.hive.serde2.SerDeException 
 SerDe com.cloudera.hive.serde.JSONSerDe does not exist)
   at 
 org.apache.hadoop.hive.metastore.MetaStoreUtils.getDeserializer(MetaStoreUtils.java:211)
   at 
 org.apache.hadoop.hive.ql.metadata.Table.getDeserializerFromMetaStore(Table.java:260)
   ... 20 more
 hive add jar 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar;
 Added 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
  to class path
 Added resource: 
 /home/natty/source/sample-code/custom-serdes/target/custom-serdes-1.0-SNAPSHOT.jar
 hive drop table test;
 OK
 Time taken: 0.658 seconds
 hive 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Hive-trunk-h0.21 - Build # 1732 - Still Failing

2012-10-10 Thread Apache Jenkins Server

Changes for Build #1728

Changes for Build #1729

Changes for Build #1730
[namit] HIVE-3550 Early skipping for limit operator at reduce stage
(Navis via namit)

[namit] HIVE-3536 Output of sort merge join is no longer bucketed
(Kevin Wilfong) 

For some reason, I missed merging the test in my previous merge

[kevinwilfong] HIVE-3522. Make separator for Entity name configurable. 
(Raghotham Murthy via kevinwilfong)


Changes for Build #1731

Changes for Build #1732



1 tests failed.
REGRESSION:  
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_stats_aggregator_error_1

Error Message:
Forked Java VM exited abnormally. Please note the time in the report does not 
reflect the time until the VM exit.

Stack Trace:
junit.framework.AssertionFailedError: Forked Java VM exited abnormally. Please 
note the time in the report does not reflect the time until the VM exit.
at 
net.sf.antcontrib.logic.ForTask.doSequentialIteration(ForTask.java:259)
at net.sf.antcontrib.logic.ForTask.doToken(ForTask.java:268)
at net.sf.antcontrib.logic.ForTask.doTheTasks(ForTask.java:324)
at net.sf.antcontrib.logic.ForTask.execute(ForTask.java:244)




The Apache Jenkins build system has built Hive-trunk-h0.21 (build #1732)

Status: Still Failing

Check console output at https://builds.apache.org/job/Hive-trunk-h0.21/1732/ to 
view the results.

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

2012-10-10 Thread Carl Steinbach (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-3563:
-

Status: Open  (was: Patch Available)

@Prasad: Please see my comments on RB. Thanks.

 Drop database cascade fails when there are indexes on any tables
 

 Key: HIVE-3563
 URL: https://issues.apache.org/jira/browse/HIVE-3563
 Project: Hive
  Issue Type: Bug
  Components: Indexing, Metastore
Affects Versions: 0.9.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
 Attachments: HIVE-3563-1.patch


 Drop database cascade fails if any of the table has index. 
 create database db2;
 use db2;
 create table tab1 (id int, name string);
 create index idx1 on table tab1(id) as  'COMPACT' with DEFERRED REBUILD;
 drop database db2 cascade;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3554) Hive List Bucketing - Query logic

2012-10-10 Thread Carl Steinbach (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473740#comment-13473740
 ] 

Carl Steinbach commented on HIVE-3554:
--

@Tim: I added some comments on phabricator. Also, can you please link this 
ticket to the other list bucketing tickets? Thanks.

 Hive List Bucketing - Query logic
 -

 Key: HIVE-3554
 URL: https://issues.apache.org/jira/browse/HIVE-3554
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.1, HIVE-3554.patch.2


 This is part of efforts for list bucketing feature: 
 https://cwiki.apache.org/Hive/listbucketing.html
 This patch includes:
 1. Query logic: hive chooses right sub-directory instead of partition 
 directory.
 2. alter table grammar which is required to support query logic
 This patch doesn't include list bucketing DML. Main reasons:
 1. risk. w/o DML, this patch won't impact any existing hive regression 
 features since no touch on any data manipulation so that very low risk.
 2. manageability. w/ DML, patch is getting bigger and hard to review. 
 Removing DML, it's easy to review.
 We still disable hive feature by default since DML is not in yet.
 DML will be in follow-up patch. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3554) Hive List Bucketing - Query logic

2012-10-10 Thread Carl Steinbach (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Carl Steinbach updated HIVE-3554:
-

Status: Open  (was: Patch Available)

 Hive List Bucketing - Query logic
 -

 Key: HIVE-3554
 URL: https://issues.apache.org/jira/browse/HIVE-3554
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.1, HIVE-3554.patch.2


 This is part of efforts for list bucketing feature: 
 https://cwiki.apache.org/Hive/listbucketing.html
 This patch includes:
 1. Query logic: hive chooses right sub-directory instead of partition 
 directory.
 2. alter table grammar which is required to support query logic
 This patch doesn't include list bucketing DML. Main reasons:
 1. risk. w/o DML, this patch won't impact any existing hive regression 
 features since no touch on any data manipulation so that very low risk.
 2. manageability. w/ DML, patch is getting bigger and hard to review. 
 Removing DML, it's easy to review.
 We still disable hive feature by default since DML is not in yet.
 DML will be in follow-up patch. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Namit Jain (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-3556:
-

Status: Open  (was: Patch Available)

comments on phabricator

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.3


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3565) use hbase tables for writing intermediate directories across map-reduce boundaries

2012-10-10 Thread Namit Jain (JIRA)

Namit Jain created HIVE-3565:


 Summary: use hbase tables for writing intermediate directories 
across map-reduce boundaries
 Key: HIVE-3565
 URL: https://issues.apache.org/jira/browse/HIVE-3565
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: Namit Jain




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3565) use hbase tables for writing intermediate directories across map-reduce boundaries

2012-10-10 Thread Namit Jain (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473788#comment-13473788
 ] 

Namit Jain commented on HIVE-3565:
--

Consider a query like:

select B.y, count(1) from
A join B on A.x=B.x
group by B.y;

This will require  2 MR jobs. The first MR job will perform the join, and the 
second MR job will perform the group by (note that the 2nd MR job would have a 
identity mapper). If the first MR job could write the output of the join to a 
HBase table (which is keyed by B.y), the 2nd MR can be a map-only job which can
simply scan the HBase table. This idea can be extended to joins as well.

 use hbase tables for writing intermediate directories across map-reduce 
 boundaries
 --

 Key: HIVE-3565
 URL: https://issues.apache.org/jira/browse/HIVE-3565
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
Assignee: Namit Jain



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Navis (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13473810#comment-13473810
 ] 

Navis commented on HIVE-3556:
-

It's better than HIVE-3209

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.3


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Work started] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3556 started by Gang Tim Liu.

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3554.patch.3


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3556:
---

Attachment: (was: HIVE-3554.patch.3)

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3556.patch.1


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

2012-10-10 Thread Gang Tim Liu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3556:
---

Attachment: HIVE-3556.patch.1

 Test Path - Alias for explain extended
 -

 Key: HIVE-3556
 URL: https://issues.apache.org/jira/browse/HIVE-3556
 Project: Hive
  Issue Type: Bug
  Components: Testing Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3556.patch.1


 Test framework masks output of Path - Alias for explain extended. This 
 makes it impossible to verify the output is right. 
 Design is to add a new entry Truncated Path - Alias to MapredWork. It has 
 the same content as Path - Alias except the prefix including file schema 
 and temp dir is removed. The following config will be used for prefix-removal:
 METASTOREWAREHOUSE(hive.metastore.warehouse.dir, /user/hive/warehouse),
 This will keep Path - Alias intact and also test it's result is right.
 The first use case is to verify list bucketing query's result is right.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3392) Hive unnecessarily validates table SerDes when dropping a table

[jira] [Commented] (HIVE-3561) Build a full SQL-compliant parser for Hive

[jira] [Updated] (HIVE-3276) optimize union sub-queries

Build failed in Jenkins: Hive-0.9.1-SNAPSHOT-h0.21-keepgoing=false #164

[jira] [Updated] (HIVE-3276) optimize union sub-queries

[jira] [Updated] (HIVE-3554) Hive List Bucketing - Query logic

[jira] [Updated] (HIVE-1915) authorization on database level is broken.

[jira] [Created] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

[jira] [Created] (HIVE-3564) hivetest.py: revision number and applied patch

[jira] [Commented] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

[jira] [Work started] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

Build failed in Jenkins: Hive-0.9.1-SNAPSHOT-h0.21 #164

[jira] [Commented] (HIVE-3392) Hive unnecessarily validates table SerDes when dropping a table

Hive-trunk-h0.21 - Build # 1732 - Still Failing

[jira] [Updated] (HIVE-3563) Drop database cascade fails when there are indexes on any tables

[jira] [Commented] (HIVE-3554) Hive List Bucketing - Query logic

[jira] [Updated] (HIVE-3554) Hive List Bucketing - Query logic

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Created] (HIVE-3565) use hbase tables for writing intermediate directories across map-reduce boundaries

[jira] [Commented] (HIVE-3565) use hbase tables for writing intermediate directories across map-reduce boundaries

[jira] [Commented] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Work started] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

[jira] [Updated] (HIVE-3556) Test Path - Alias for explain extended

28 matches

Site Navigation

Mail list logo

Footer information