[jira] Updated: (HIVE-547) NullPointerException in ExecDriver

2009-06-10 Thread Zheng Shao (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zheng Shao updated HIVE-547:


Attachment: HIVE-547.2.patch

This patch makes sure we don't output one line per second. The behavior is that 
we will output at least one line per minute if no status change is detected.


 NullPointerException in ExecDriver
 --

 Key: HIVE-547
 URL: https://issues.apache.org/jira/browse/HIVE-547
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: Zheng Shao
 Fix For: 0.4.0

 Attachments: HIVE-547.1.patch, HIVE-547.2.patch


 We saw a job failed with the following message in hive.log.
 {noformat}
 2009-06-06 22:50:55,275 ERROR exec.ExecDriver 
 (SessionState.java:printError(279)) - Ended Job = job_200905211352_145363 
 with exception 'java.lang.NullPointerException(null)'
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.jobProgress(ExecDriver.java:193)
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:395)
 at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:307)
 at org.apache.hadoop.hive.ql.Driver.run(Driver.java:213)
 at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:176)
 at 
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:216)
 at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:273)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:166)
 at org.apache.hadoop.mapred.JobShell.run(JobShell.java:194)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
 at org.apache.hadoop.mapred.JobShell.main(JobShell.java:220)
 {noformat}
 The corresponding code is:
 {code:java}
   public RunningJob jobProgress(JobClient jc, RunningJob rj) throws 
 IOException {
 String lastReport = ;
 while (!rj.isComplete()) {
   try {
 Thread.sleep(1000);
   } catch (InterruptedException e) {
   }
   rj = jc.getJob(rj.getJobID());
   String report = null;
 193:  report =  map =  + Math.round(rj.mapProgress() * 100) + %,  reduce =
   + Math.round(rj.reduceProgress() * 100) + %;
 {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Reopened: (HIVE-547) NullPointerException in ExecDriver

2009-06-10 Thread Zheng Shao (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zheng Shao reopened HIVE-547:
-


The current behavior is showing one log line per second. This may not be good 
in case the job runs a long time.

 NullPointerException in ExecDriver
 --

 Key: HIVE-547
 URL: https://issues.apache.org/jira/browse/HIVE-547
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: Zheng Shao
 Fix For: 0.4.0

 Attachments: HIVE-547.1.patch, HIVE-547.2.patch


 We saw a job failed with the following message in hive.log.
 {noformat}
 2009-06-06 22:50:55,275 ERROR exec.ExecDriver 
 (SessionState.java:printError(279)) - Ended Job = job_200905211352_145363 
 with exception 'java.lang.NullPointerException(null)'
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.jobProgress(ExecDriver.java:193)
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:395)
 at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:307)
 at org.apache.hadoop.hive.ql.Driver.run(Driver.java:213)
 at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:176)
 at 
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:216)
 at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:273)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:166)
 at org.apache.hadoop.mapred.JobShell.run(JobShell.java:194)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
 at org.apache.hadoop.mapred.JobShell.main(JobShell.java:220)
 {noformat}
 The corresponding code is:
 {code:java}
   public RunningJob jobProgress(JobClient jc, RunningJob rj) throws 
 IOException {
 String lastReport = ;
 while (!rj.isComplete()) {
   try {
 Thread.sleep(1000);
   } catch (InterruptedException e) {
   }
   rj = jc.getJob(rj.getJobID());
   String report = null;
 193:  report =  map =  + Math.round(rj.mapProgress() * 100) + %,  reduce =
   + Math.round(rj.reduceProgress() * 100) + %;
 {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-547) NullPointerException in ExecDriver

2009-06-10 Thread Zheng Shao (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zheng Shao updated HIVE-547:


Status: Patch Available  (was: Reopened)

 NullPointerException in ExecDriver
 --

 Key: HIVE-547
 URL: https://issues.apache.org/jira/browse/HIVE-547
 Project: Hadoop Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: Zheng Shao
 Fix For: 0.4.0

 Attachments: HIVE-547.1.patch, HIVE-547.2.patch


 We saw a job failed with the following message in hive.log.
 {noformat}
 2009-06-06 22:50:55,275 ERROR exec.ExecDriver 
 (SessionState.java:printError(279)) - Ended Job = job_200905211352_145363 
 with exception 'java.lang.NullPointerException(null)'
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.jobProgress(ExecDriver.java:193)
 at 
 org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:395)
 at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:307)
 at org.apache.hadoop.hive.ql.Driver.run(Driver.java:213)
 at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:176)
 at 
 org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:216)
 at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:273)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at org.apache.hadoop.util.RunJar.main(RunJar.java:166)
 at org.apache.hadoop.mapred.JobShell.run(JobShell.java:194)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
 at org.apache.hadoop.mapred.JobShell.main(JobShell.java:220)
 {noformat}
 The corresponding code is:
 {code:java}
   public RunningJob jobProgress(JobClient jc, RunningJob rj) throws 
 IOException {
 String lastReport = ;
 while (!rj.isComplete()) {
   try {
 Thread.sleep(1000);
   } catch (InterruptedException e) {
   }
   rj = jc.getJob(rj.getJobID());
   String report = null;
 193:  report =  map =  + Math.round(rj.mapProgress() * 100) + %,  reduce =
   + Math.round(rj.reduceProgress() * 100) + %;
 {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



JIRA_HIVE-547.2.patch_UNIT_TEST_SUCCEEDED

2009-06-10 Thread Murli Varadachari

SUCCESS: BUILD AND UNIT TEST using PATCH HIVE-547.2.patch PASSED!!



[jira] Created: (HIVE-555) create temporary function support not only udf, but also udaf, genericudf, etc.

2009-06-10 Thread Min Zhou (JIRA)
create temporary function support not only udf, but also udaf,  genericudf, etc.


 Key: HIVE-555
 URL: https://issues.apache.org/jira/browse/HIVE-555
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Min Zhou
Assignee: Min Zhou
 Fix For: 0.4.0


Right now, command 'create temporary function' only support  udf. 
we can also let user write their udaf, generic udf, and write generic udaf in 
the future. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-460) Improve ColumnPruner to prune more aggressively and keep column information for input tables

2009-06-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-460:
--

Attachment: hive-460-2009-06-10-2.patch

A new patch against trunk code(hive-460-2009-06-10-2.patch) and it passed all 
TestCliDriver and TestParse in my local.

 Improve ColumnPruner to prune more aggressively and keep column information 
 for input tables
 

 Key: HIVE-460
 URL: https://issues.apache.org/jira/browse/HIVE-460
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: He Yongqiang
 Attachments: hive-460-2009-05-29-noaliasallowed.patch, 
 hive-460-2009-06-04-2-195mapjoin.patch, 
 hive-460-2009-06-04-testcli-passed.patch, hive-460-2009-06-06-2.patch, 
 hive-460-2009-06-10-2.patch, HIVE-460.1.patch


 This is required for column-based table format.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-555) create temporary function support not only udf, but also udaf, genericudf, etc.

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-555:
--

Attachment: HIVE-555-1.patch

patch w/o testcase

 create temporary function support not only udf, but also udaf,  genericudf, 
 etc.
 

 Key: HIVE-555
 URL: https://issues.apache.org/jira/browse/HIVE-555
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Min Zhou
Assignee: Min Zhou
 Fix For: 0.4.0

 Attachments: HIVE-555-1.patch


 Right now, command 'create temporary function' only support  udf. 
 we can also let user write their udaf, generic udf, and write generic udaf in 
 the future. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-521) Move size, if, isnull, isnotnull to GenericUDF

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-521:
--

Attachment: HIVE-521-all-v2.patch

fixed issues commented by Zheng,  UDFArgumentException and 
UDFArgumentLengthException added.


 Move size, if, isnull, isnotnull to GenericUDF
 --

 Key: HIVE-521
 URL: https://issues.apache.org/jira/browse/HIVE-521
 Project: Hadoop Hive
  Issue Type: Improvement
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: Min Zhou
 Fix For: 0.4.0

 Attachments: HIVE-521-all-v1.patch, HIVE-521-all-v2.patch, 
 HIVE-521-IF-2.patch, HIVE-521-IF-3.patch, HIVE-521-IF-4.patch, 
 HIVE-521-IF-5.patch, HIVE-521-IF.patch


 See HIVE-511 for an example of the move.
 size, if, isnull, isnotnull are all implemented with UDF but they are 
 actually working on variable types of objects. We should move them to 
 GenericUDF for better type handling.
 This also helps to clean up the hack in doing type matching/type conversion 
 in UDF.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-521) Move size, if, isnull, isnotnull to GenericUDF

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-521:
--

Attachment: HIVE-521-all-v3.patch

catch UDFArgumentLengthException.

 Move size, if, isnull, isnotnull to GenericUDF
 --

 Key: HIVE-521
 URL: https://issues.apache.org/jira/browse/HIVE-521
 Project: Hadoop Hive
  Issue Type: Improvement
Affects Versions: 0.4.0
Reporter: Zheng Shao
Assignee: Min Zhou
 Fix For: 0.4.0

 Attachments: HIVE-521-all-v1.patch, HIVE-521-all-v2.patch, 
 HIVE-521-all-v3.patch, HIVE-521-IF-2.patch, HIVE-521-IF-3.patch, 
 HIVE-521-IF-4.patch, HIVE-521-IF-5.patch, HIVE-521-IF.patch


 See HIVE-511 for an example of the move.
 size, if, isnull, isnotnull are all implemented with UDF but they are 
 actually working on variable types of objects. We should move them to 
 GenericUDF for better type handling.
 This also helps to clean up the hack in doing type matching/type conversion 
 in UDF.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-556) let hive support theta join

2009-06-10 Thread Min Zhou (JIRA)
let hive support theta join
---

 Key: HIVE-556
 URL: https://issues.apache.org/jira/browse/HIVE-556
 Project: Hadoop Hive
  Issue Type: New Feature
Affects Versions: 0.4.0
Reporter: Min Zhou
 Fix For: 0.4.0


Right now , hive only support equal joins .  Somethings it's not enough, we 
must consider implementing theta joins like

{code:sql}
SELECT
  a.subid, a.id, t.url
FROM
  tbl t JOIN aux_tbl a ON t.url rlike a.url_pattern
WHERE
  t.dt='20090609'
  AND a.dt='20090609';
{code}

any condition expression following 'ON' is  appropriate.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.