[jira] Updated: (HIVE-1497) support COMMENT clause on CREATE INDEX, and add new command for SHOW INDEXES

2010-11-12 Thread John Sichi (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Sichi updated HIVE-1497:
-

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

Committed.  Thanks Russell!


> support COMMENT clause on CREATE INDEX, and add new command for SHOW INDEXES
> 
>
> Key: HIVE-1497
> URL: https://issues.apache.org/jira/browse/HIVE-1497
> Project: Hive
>  Issue Type: Improvement
>  Components: Indexing
>Affects Versions: 0.7.0
>Reporter: John Sichi
>Assignee: Russell Melick
> Fix For: 0.7.0
>
> Attachments: HIVE-1497.4.patch, HIVE-1497.5.patch, HIVE-1497.6.patch, 
> HIVE-1497.7.patch, HIVE-1497.8.patch, hive-1497.p1.patch, hive-1497.p2.patch, 
> hive-1497.p3.patch
>
>
> We need to work out the syntax for SHOW/DESCRIBE, taking partitioning into 
> account.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1764) Add command for DESCRIBE indexes

2010-11-12 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931628#action_12931628
 ] 

John Sichi commented on HIVE-1764:
--

Also, one other followup from HIVE-1497:

* Use MetaStoreUtils.getColumnNamesFromFieldSchema in 
getAllColumnsInformation(Index)

> Add command for DESCRIBE indexes
> 
>
> Key: HIVE-1764
> URL: https://issues.apache.org/jira/browse/HIVE-1764
> Project: Hive
>  Issue Type: Improvement
>  Components: Indexing
>Affects Versions: 0.7.0
>Reporter: Russell Melick
>Assignee: Jeffrey Lym
> Fix For: 0.7.0
>
>
> Followup for HIVE-1497
>  * Work out the syntax for DESCRIBE, taking partitioning into account.
>  * Allow regular expressions for selecting tables (like SHOW TABLES )
>  * Enable showing indexes which are only on specific partitions

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1642) Convert join queries to map-join based on size of table/row

2010-11-12 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931620#action_12931620
 ] 

Namit Jain commented on HIVE-1642:
--

great work Liyin - I will take a look

> Convert join queries to map-join based on size of table/row
> ---
>
> Key: HIVE-1642
> URL: https://issues.apache.org/jira/browse/HIVE-1642
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Liyin Tang
> Fix For: 0.7.0
>
> Attachments: hive_1642_1.patch
>
>
> Based on the number of rows and size of each table, Hive should automatically 
> be able to convert a join into map-join.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1790) Patch to support HAVING clause in Hive

2010-11-12 Thread Vaibhav Aggarwal (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vaibhav Aggarwal updated HIVE-1790:
---

Attachment: HIVE-1790.patch

> Patch to support HAVING clause in Hive
> --
>
> Key: HIVE-1790
> URL: https://issues.apache.org/jira/browse/HIVE-1790
> Project: Hive
>  Issue Type: Improvement
>Reporter: Vaibhav Aggarwal
>Assignee: Vaibhav Aggarwal
> Attachments: HIVE-1790.patch
>
>
> Currently Hive users have to do nested queries in order to apply filter on 
> group by expressions.
> This patch allows users to directly apply filter on group by expressions by 
> using HAVING clause.
> This patch also helps us integrate Hive with other data analysis tools which 
> rely on HAVING expression.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1790) Patch to support HAVING clause in Hive

2010-11-12 Thread Vaibhav Aggarwal (JIRA)
Patch to support HAVING clause in Hive
--

 Key: HIVE-1790
 URL: https://issues.apache.org/jira/browse/HIVE-1790
 Project: Hive
  Issue Type: Improvement
Reporter: Vaibhav Aggarwal
Assignee: Vaibhav Aggarwal


Currently Hive users have to do nested queries in order to apply filter on 
group by expressions.
This patch allows users to directly apply filter on group by expressions by 
using HAVING clause.
This patch also helps us integrate Hive with other data analysis tools which 
rely on HAVING expression.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1642) Convert join queries to map-join based on size of table/row

2010-11-12 Thread Liyin Tang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liyin Tang updated HIVE-1642:
-

Attachment: hive_1642_1.patch

> Convert join queries to map-join based on size of table/row
> ---
>
> Key: HIVE-1642
> URL: https://issues.apache.org/jira/browse/HIVE-1642
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Liyin Tang
> Fix For: 0.7.0
>
> Attachments: hive_1642_1.patch
>
>
> Based on the number of rows and size of each table, Hive should automatically 
> be able to convert a join into map-join.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1642) Convert join queries to map-join based on size of table/row

2010-11-12 Thread Liyin Tang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931599#action_12931599
 ] 

Liyin Tang commented on HIVE-1642:
--

I just finished converting common join into map join based on the file size.  
There are 2 flags to control this optimization.
1)  set hive.auto.convert.join = true; It means this optimization is 
enabled. By default right now, this flag is disabled in order not to break any 
existing test cases. Also I put 25 additional test cases, auto_join0.q - 
auto_join25.q, which covers this optimization code.
2)  Set hive.hashtable.max.memory.usage = 0.9;  It means if the memory 
usage of local task is more than 90% of its heap size, then the local task will 
abort by itself. The Driver will know the local work fails and it won't submit 
the MapJoinTask (a Map Only MapRedTask)  to Hadoop, but instead, it will submit 
the originally CommonJoinTask to Hadoop to run.
3)  Set hive.smalltable.filesize = 2500L;  It means if the summary of 
the small table file size is less than 25M, then it will run the map join task. 
If not, just run the originally common join task.
 The following is the basic flow how it works. For each common join, create a 
conditional task.
1)  For each join table, generate a mapjoin task by assuming this table is 
big table. 
a.  The left side of right outer join must be small table.
b.  The right side of left outer join must be small table.
c.  No full outer join can be optimized. 
d.  Eg. A left outer join B right outer join C. Only C can be big table 
table.
e.  Eg. A right outer join B left outer join C. Only B can be big table 
table.
f.  Eg. A left outer join B left outer join C. Only A can be big table 
table.
g.  Eg. A right outer join B right outer join C. Both B and C can be big 
table table.
2)  Put all these generated map join tasks into conditional task and set 
the mapping between big table's alias with the corresponding map join task.
3)  During the execution time, the resolver will read the input file size. 
If the input file size of small table is less than a threshold, than run the 
converted map join task. 
4)  Set each map join task with a backup task. The backup task is the 
originally common join task.
This mapping relationship is set during execution time.
5)  If the map join task return abnormally, launch the backup task.



> Convert join queries to map-join based on size of table/row
> ---
>
> Key: HIVE-1642
> URL: https://issues.apache.org/jira/browse/HIVE-1642
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Liyin Tang
> Fix For: 0.7.0
>
>
> Based on the number of rows and size of each table, Hive should automatically 
> be able to convert a join into map-join.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1789) Extend metastore schema to store column properties

2010-11-12 Thread Ashish Thusoo (JIRA)
Extend metastore schema to store column properties
--

 Key: HIVE-1789
 URL: https://issues.apache.org/jira/browse/HIVE-1789
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Ashish Thusoo
Assignee: Ashish Thusoo


A common use case in data warehousing deployments is to set column level 
policies, e.g. certain columns in the table need to be retained for a longer 
period than others. Without column level properties, you would have to encode 
this information into table properties and maintain those properties as columns 
change. A better solution is to have properties associated with columns.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1788) Add more calls to the metastore thrift interface

2010-11-12 Thread Ashish Thusoo (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931533#action_12931533
 ] 

Ashish Thusoo commented on HIVE-1788:
-

3 can be addressed by the current thrift interface so only an iterator over 
tables owned by particular users is needed.

> Add more calls to the metastore thrift interface
> 
>
> Key: HIVE-1788
> URL: https://issues.apache.org/jira/browse/HIVE-1788
> Project: Hive
>  Issue Type: New Feature
>Reporter: Ashish Thusoo
>Assignee: Ashish Thusoo
>
> For administrative purposes the following calls to the metastore thrift 
> interface would be very useful:
> 1. Get the table metadata for all the tables owned by a particular users
> 2. Ability to iterate over this set of tables
> 3. Ability to change a particular key value property of the table

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-78) Authorization infrastructure for Hive

2010-11-12 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-78?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931518#action_12931518
 ] 

Namit Jain commented on HIVE-78:


Also, can you refresh and re-apply the patch ? It does not apply cleanly and is 
therefore not possible to actually compile/test and understand.

> Authorization infrastructure for Hive
> -
>
> Key: HIVE-78
> URL: https://issues.apache.org/jira/browse/HIVE-78
> Project: Hive
>  Issue Type: New Feature
>  Components: Metastore, Query Processor, Server Infrastructure
>Reporter: Ashish Thusoo
>Assignee: He Yongqiang
> Attachments: createuser-v1.patch, hive-78-metadata-v1.patch, 
> hive-78-syntax-v1.patch, HIVE-78.1.nothrift.patch, HIVE-78.1.thrift.patch, 
> HIVE-78.2.nothrift.patch, HIVE-78.2.thrift.patch, hive-78.diff
>
>
> Allow hive to integrate with existing user repositories for authentication 
> and authorization infromation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (HIVE-1788) Add more calls to the metastore thrift interface

2010-11-12 Thread Ashish Thusoo (JIRA)
Add more calls to the metastore thrift interface


 Key: HIVE-1788
 URL: https://issues.apache.org/jira/browse/HIVE-1788
 Project: Hive
  Issue Type: New Feature
Reporter: Ashish Thusoo
Assignee: Ashish Thusoo


For administrative purposes the following calls to the metastore thrift 
interface would be very useful:

1. Get the table metadata for all the tables owned by a particular users
2. Ability to iterate over this set of tables
3. Ability to change a particular key value property of the table


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1497) support COMMENT clause on CREATE INDEX, and add new command for SHOW INDEXES

2010-11-12 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931512#action_12931512
 ] 

John Sichi commented on HIVE-1497:
--

+1.  Will commit when tests pass.

You could have used MetaStoreUtils.getColumnNamesFromFieldSchema, but I'll put 
a cleanup note for that on HIVE-1764 so we can get this one committed as is 
(assuming tests pass).


> support COMMENT clause on CREATE INDEX, and add new command for SHOW INDEXES
> 
>
> Key: HIVE-1497
> URL: https://issues.apache.org/jira/browse/HIVE-1497
> Project: Hive
>  Issue Type: Improvement
>  Components: Indexing
>Affects Versions: 0.7.0
>Reporter: John Sichi
>Assignee: Russell Melick
> Fix For: 0.7.0
>
> Attachments: HIVE-1497.4.patch, HIVE-1497.5.patch, HIVE-1497.6.patch, 
> HIVE-1497.7.patch, HIVE-1497.8.patch, hive-1497.p1.patch, hive-1497.p2.patch, 
> hive-1497.p3.patch
>
>
> We need to work out the syntax for SHOW/DESCRIBE, taking partitioning into 
> account.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-78) Authorization infrastructure for Hive

2010-11-12 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-78?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931504#action_12931504
 ] 

Namit Jain commented on HIVE-78:


Can you add the tests in the non-thrift patch ? It becomes easier to review

> Authorization infrastructure for Hive
> -
>
> Key: HIVE-78
> URL: https://issues.apache.org/jira/browse/HIVE-78
> Project: Hive
>  Issue Type: New Feature
>  Components: Metastore, Query Processor, Server Infrastructure
>Reporter: Ashish Thusoo
>Assignee: He Yongqiang
> Attachments: createuser-v1.patch, hive-78-metadata-v1.patch, 
> hive-78-syntax-v1.patch, HIVE-78.1.nothrift.patch, HIVE-78.1.thrift.patch, 
> HIVE-78.2.nothrift.patch, HIVE-78.2.thrift.patch, hive-78.diff
>
>
> Allow hive to integrate with existing user repositories for authentication 
> and authorization infromation.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1786) better documentation for str_to_map

2010-11-12 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-1786:
-

   Resolution: Fixed
Fix Version/s: 0.7.0
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

Committed. Thanks Siying

> better documentation for str_to_map
> ---
>
> Key: HIVE-1786
> URL: https://issues.apache.org/jira/browse/HIVE-1786
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Siying Dong
> Fix For: 0.7.0
>
> Attachments: HIVE.1786.1.patch
>
>
> Currently, desc and desc extended return the same info.
> There is no mention of defaults anywhere

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1786) better documentation for str_to_map

2010-11-12 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931448#action_12931448
 ] 

Namit Jain commented on HIVE-1786:
--

+1

> better documentation for str_to_map
> ---
>
> Key: HIVE-1786
> URL: https://issues.apache.org/jira/browse/HIVE-1786
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Siying Dong
> Attachments: HIVE.1786.1.patch
>
>
> Currently, desc and desc extended return the same info.
> There is no mention of defaults anywhere

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (HIVE-1538) FilterOperator is applied twice with ppd on.

2010-11-12 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931447#action_12931447
 ] 

Namit Jain commented on HIVE-1538:
--

Talked to Amareshwari offline - the proposal looks good.
As a follow-up, we can merge consecutive filters in a follow-up task

> FilterOperator is applied twice with ppd on.
> 
>
> Key: HIVE-1538
> URL: https://issues.apache.org/jira/browse/HIVE-1538
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: Amareshwari Sriramadasu
>Assignee: Amareshwari Sriramadasu
>
> With hive.optimize.ppd set to true, FilterOperator is applied twice. And it 
> seems second operator is always filtering zero rows.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1786) better documentation for str_to_map

2010-11-12 Thread Siying Dong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siying Dong updated HIVE-1786:
--

Status: Patch Available  (was: Open)

> better documentation for str_to_map
> ---
>
> Key: HIVE-1786
> URL: https://issues.apache.org/jira/browse/HIVE-1786
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Siying Dong
> Attachments: HIVE.1786.1.patch
>
>
> Currently, desc and desc extended return the same info.
> There is no mention of defaults anywhere

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1786) better documentation for str_to_map

2010-11-12 Thread Siying Dong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siying Dong updated HIVE-1786:
--

Attachment: HIVE.1786.1.patch

Added "describe function extend" to function str_to_map(), mentioning the 
default values.

> better documentation for str_to_map
> ---
>
> Key: HIVE-1786
> URL: https://issues.apache.org/jira/browse/HIVE-1786
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Siying Dong
> Attachments: HIVE.1786.1.patch
>
>
> Currently, desc and desc extended return the same info.
> There is no mention of defaults anywhere

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.