[jira] Issue Comment Edited: (HIVE-338) Executing cli commands into thrift server

2009-06-10 Thread Min Zhou (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717651#action_12717651
 ] 

Min Zhou edited comment on HIVE-338 at 6/10/09 11:37 PM:
-

* exec/FunctionTask.java: is it necessary to specify the loader in the 
Class.forName call? I thought that that the current thread context loader was 
the always the first loader to be tried anyway during name resolution.
Yes, of course. the class loader holding by HiveConf is older than that of 
current thread.

this pacth support dfs, add/delete file/jar, set now.  

btw, Joydeep, would you do me a favor writing some test code that I am not 
familiar with?  you know, ' add jar'  need a separate jar, and i not quite sure 
how to organize them.

  was (Author: coderplay):

* exec/FunctionTask.java: is it necessary to specify the loader in the 
Class.forName call? I thought that that the current thread context loader was 
the always the first loader to be tried anyway during name resolution.
Yes, of course. the class loader holding by HiveConf is older than that of 
current thread.

this pacth support dfs, add/delete file/jar, set now.  

btw, Joydeep, would you do me a favor writing some test code that I' am not 
familiar with it ?  you know, ' add jar'  need a separate jar, and i not quite 
sure how to organize them.
  
> Executing cli commands into thrift server
> -
>
> Key: HIVE-338
> URL: https://issues.apache.org/jira/browse/HIVE-338
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Server Infrastructure
>Affects Versions: 0.3.0
>Reporter: Min Zhou
> Attachments: hiveserver-v1.patch, hiveserver-v2.patch
>
>
> Let thrift server support set, add/delete file/jar and normal HSQL query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-556) let hive support theta join

2009-06-10 Thread Min Zhou (JIRA)
let hive support theta join
---

 Key: HIVE-556
 URL: https://issues.apache.org/jira/browse/HIVE-556
 Project: Hadoop Hive
  Issue Type: New Feature
Affects Versions: 0.4.0
Reporter: Min Zhou
 Fix For: 0.4.0


Right now , hive only support equal joins .  Somethings it's not enough, we 
must consider implementing theta joins like

{code:sql}
SELECT
  a.subid, a.id, t.url
FROM
  tbl t JOIN aux_tbl a ON t.url rlike a.url_pattern
WHERE
  t.dt='20090609'
  AND a.dt='20090609';
{code}

any condition expression following 'ON' is  appropriate.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-521) Move size, if, isnull, isnotnull to GenericUDF

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-521:
--

Attachment: HIVE-521-all-v3.patch

catch UDFArgumentLengthException.

> Move size, if, isnull, isnotnull to GenericUDF
> --
>
> Key: HIVE-521
> URL: https://issues.apache.org/jira/browse/HIVE-521
> Project: Hadoop Hive
>  Issue Type: Improvement
>Affects Versions: 0.4.0
>Reporter: Zheng Shao
>Assignee: Min Zhou
> Fix For: 0.4.0
>
> Attachments: HIVE-521-all-v1.patch, HIVE-521-all-v2.patch, 
> HIVE-521-all-v3.patch, HIVE-521-IF-2.patch, HIVE-521-IF-3.patch, 
> HIVE-521-IF-4.patch, HIVE-521-IF-5.patch, HIVE-521-IF.patch
>
>
> See HIVE-511 for an example of the move.
> size, if, isnull, isnotnull are all implemented with UDF but they are 
> actually working on variable types of objects. We should move them to 
> GenericUDF for better type handling.
> This also helps to clean up the hack in doing type matching/type conversion 
> in UDF.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-521) Move size, if, isnull, isnotnull to GenericUDF

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-521:
--

Attachment: HIVE-521-all-v2.patch

fixed issues commented by Zheng,  UDFArgumentException and 
UDFArgumentLengthException added.


> Move size, if, isnull, isnotnull to GenericUDF
> --
>
> Key: HIVE-521
> URL: https://issues.apache.org/jira/browse/HIVE-521
> Project: Hadoop Hive
>  Issue Type: Improvement
>Affects Versions: 0.4.0
>Reporter: Zheng Shao
>Assignee: Min Zhou
> Fix For: 0.4.0
>
> Attachments: HIVE-521-all-v1.patch, HIVE-521-all-v2.patch, 
> HIVE-521-IF-2.patch, HIVE-521-IF-3.patch, HIVE-521-IF-4.patch, 
> HIVE-521-IF-5.patch, HIVE-521-IF.patch
>
>
> See HIVE-511 for an example of the move.
> size, if, isnull, isnotnull are all implemented with UDF but they are 
> actually working on variable types of objects. We should move them to 
> GenericUDF for better type handling.
> This also helps to clean up the hack in doing type matching/type conversion 
> in UDF.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-549) UNION ALL statements should be run in parallel

2009-06-10 Thread He Yongqiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718074#action_12718074
 ] 

He Yongqiang commented on HIVE-549:
---

Shoud we edit this issue to "parallel execution mechanism"? or we can open a 
new jira and link this to that.

> UNION ALL statements should be run in parallel
> --
>
> Key: HIVE-549
> URL: https://issues.apache.org/jira/browse/HIVE-549
> Project: Hadoop Hive
>  Issue Type: Wish
>  Components: Query Processor
>Reporter: Adam Kramer
>
> In a massively parallel database system, it would be awesome to also 
> parallelize some of the mapreduce phases that our data needs to go through.
> One example that just occurred to me is UNION ALL: when you union two SELECT 
> statements, effectively you could run those statements in parallel. There's 
> no situation (that I can think of, but I don't have a formal proof) in which 
> the left statement would rely on the right statement, or vice versa. So, they 
> could be run at the same time...and perhaps they should be. Or, perhaps there 
> should be a way to make this happen...PARALLEL UNION ALL? PUNION ALL?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-555) create temporary function support not only udf, but also udaf, genericudf, etc.

2009-06-10 Thread Min Zhou (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Min Zhou updated HIVE-555:
--

Attachment: HIVE-555-1.patch

patch w/o testcase

> create temporary function support not only udf, but also udaf,  genericudf, 
> etc.
> 
>
> Key: HIVE-555
> URL: https://issues.apache.org/jira/browse/HIVE-555
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Affects Versions: 0.4.0
>Reporter: Min Zhou
>Assignee: Min Zhou
> Fix For: 0.4.0
>
> Attachments: HIVE-555-1.patch
>
>
> Right now, command 'create temporary function' only support  udf. 
> we can also let user write their udaf, generic udf, and write generic udaf in 
> the future. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-460) Improve ColumnPruner to prune more aggressively and keep column information for input tables

2009-06-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-460:
--

Attachment: hive-460-2009-06-10-2.patch

A new patch against trunk code(hive-460-2009-06-10-2.patch) and it passed all 
TestCliDriver and TestParse in my local.

> Improve ColumnPruner to prune more aggressively and keep column information 
> for input tables
> 
>
> Key: HIVE-460
> URL: https://issues.apache.org/jira/browse/HIVE-460
> Project: Hadoop Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Affects Versions: 0.4.0
>Reporter: Zheng Shao
>Assignee: He Yongqiang
> Attachments: hive-460-2009-05-29-noaliasallowed.patch, 
> hive-460-2009-06-04-2-195mapjoin.patch, 
> hive-460-2009-06-04-testcli-passed.patch, hive-460-2009-06-06-2.patch, 
> hive-460-2009-06-10-2.patch, HIVE-460.1.patch
>
>
> This is required for column-based table format.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-555) create temporary function support not only udf, but also udaf, genericudf, etc.

2009-06-10 Thread Min Zhou (JIRA)
create temporary function support not only udf, but also udaf,  genericudf, etc.


 Key: HIVE-555
 URL: https://issues.apache.org/jira/browse/HIVE-555
 Project: Hadoop Hive
  Issue Type: Improvement
  Components: Query Processor
Affects Versions: 0.4.0
Reporter: Min Zhou
Assignee: Min Zhou
 Fix For: 0.4.0


Right now, command 'create temporary function' only support  udf. 
we can also let user write their udaf, generic udf, and write generic udaf in 
the future. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



JIRA_HIVE-547.2.patch_UNIT_TEST_SUCCEEDED

2009-06-10 Thread Murli Varadachari

SUCCESS: BUILD AND UNIT TEST using PATCH HIVE-547.2.patch PASSED!!