[jira] [Commented] (HIVE-4044) Add URL type

2013-06-12 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13681573#comment-13681573
 ] 

Samuel Yuan commented on HIVE-4044:
---

I tried breaking the URL into parts and encoding them as individual columns; 
the dictionary shrunk, but the overhead of the other ORC columns introduced 
(mostly the column of indices) made a bigger impact, so compression was 
actually worse overall. I also tried storing the query string as a map and 
putting common keys into separate columns; this improved compression somewhat, 
but still not enough to offset the overhead of new columns for the query string.

> Add URL type
> 
>
> Key: HIVE-4044
> URL: https://issues.apache.org/jira/browse/HIVE-4044
> Project: Hive
>  Issue Type: Improvement
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4044.HIVE-4044.HIVE-4044.D8799.1.patch
>
>
> Having a separate type for URLs would enable improvements in storage 
> efficiency based on breaking up a URL into its components. The new type will 
> be named "URL" and made a non-reserved keyword (see HIVE-701).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4492) Revert HIVE-4322

2013-05-03 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4492:
--

Status: Patch Available  (was: Open)

> Revert HIVE-4322
> 
>
> Key: HIVE-4492
> URL: https://issues.apache.org/jira/browse/HIVE-4492
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4492.1.patch.txt
>
>
> See HIVE-4432 and HIVE-4433. It's possible to work around these issues but a 
> better solution is probably to roll back the "fix" and change the API to use 
> a primitive type as the map key (in a backwards-compatible manner).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4492) Revert HIVE-4322

2013-05-03 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4492:
--

Attachment: HIVE-4492.1.patch.txt

> Revert HIVE-4322
> 
>
> Key: HIVE-4492
> URL: https://issues.apache.org/jira/browse/HIVE-4492
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4492.1.patch.txt
>
>
> See HIVE-4432 and HIVE-4433. It's possible to work around these issues but a 
> better solution is probably to roll back the "fix" and change the API to use 
> a primitive type as the map key (in a backwards-compatible manner).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4433) Fix C++ Thrift bindings broken in HIVE-4322

2013-05-03 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13648683#comment-13648683
 ] 

Samuel Yuan commented on HIVE-4433:
---

I'm thinking it's possible to work around this by defining '<' since it's 
present in the auto-generated header file. Given that other language bindings 
might also have been broken by HIVE-4322 though it's probably better to change 
the map key to a primitive type instead. I have filed HIVE-4492 to revert the 
original change.

> Fix C++ Thrift bindings broken in HIVE-4322
> ---
>
> Key: HIVE-4433
> URL: https://issues.apache.org/jira/browse/HIVE-4433
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Affects Versions: 0.12.0
>Reporter: Carl Steinbach
>Assignee: Samuel Yuan
>Priority: Blocker
> Fix For: 0.12.0
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4492) Revert HIVE-4322

2013-05-03 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4492:
-

 Summary: Revert HIVE-4322
 Key: HIVE-4492
 URL: https://issues.apache.org/jira/browse/HIVE-4492
 Project: Hive
  Issue Type: Bug
  Components: Metastore, Thrift API
Reporter: Samuel Yuan
Assignee: Samuel Yuan


See HIVE-4432 and HIVE-4433. It's possible to work around these issues but a 
better solution is probably to roll back the "fix" and change the API to use a 
primitive type as the map key (in a backwards-compatible manner).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4474) Column access not tracked properly for partitioned tables

2013-05-01 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4474:
--

Status: Patch Available  (was: Open)

> Column access not tracked properly for partitioned tables
> -
>
> Key: HIVE-4474
> URL: https://issues.apache.org/jira/browse/HIVE-4474
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4474.1.patch.txt
>
>
> The columns recorded as being accessed is incorrect for partitioned tables. 
> The index of accessed columns is a position in the list of non-partition 
> columns, but a list of all columns is being used right now to do the lookup.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4474) Column access not tracked properly for partitioned tables

2013-05-01 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4474:
--

Attachment: HIVE-4474.1.patch.txt

> Column access not tracked properly for partitioned tables
> -
>
> Key: HIVE-4474
> URL: https://issues.apache.org/jira/browse/HIVE-4474
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4474.1.patch.txt
>
>
> The columns recorded as being accessed is incorrect for partitioned tables. 
> The index of accessed columns is a position in the list of non-partition 
> columns, but a list of all columns is being used right now to do the lookup.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4474) Column access not tracked properly for partitioned tables

2013-05-01 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4474:
-

 Summary: Column access not tracked properly for partitioned tables
 Key: HIVE-4474
 URL: https://issues.apache.org/jira/browse/HIVE-4474
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Samuel Yuan
Assignee: Samuel Yuan


The columns recorded as being accessed is incorrect for partitioned tables. The 
index of accessed columns is a position in the list of non-partition columns, 
but a list of all columns is being used right now to do the lookup.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4221) Stripe-level merge for ORC files

2013-04-30 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4221:
--

Status: Patch Available  (was: Open)

> Stripe-level merge for ORC files
> 
>
> Key: HIVE-4221
> URL: https://issues.apache.org/jira/browse/HIVE-4221
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4221.HIVE-4221.HIVE-4221.HIVE-4221.D9759.1.patch, 
> HIVE-4221.HIVE-4221.HIVE-4221.HIVE-4221.D9759.2.patch
>
>
> As with RC files, we would like to be able to merge ORC files efficiently by 
> reading/writing stripes without decompressing/recompressing them. This will 
> be similar to the RC file merge, except that footers will have to be updated 
> with the stripe positions in the new file.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4433) Fix C++ Thrift bindings broken in HIVE-4322

2013-04-26 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643497#comment-13643497
 ] 

Samuel Yuan commented on HIVE-4433:
---

The problem is that we were originally using a list as the map key, which is 
worse (at least for Python). Other alternatives were considered, but since the 
list could contain arbitrary strings, replacing it with a struct wrapper seemed 
to be the best solution.

Would you mind posting a log of what's breaking in C++?

> Fix C++ Thrift bindings broken in HIVE-4322
> ---
>
> Key: HIVE-4433
> URL: https://issues.apache.org/jira/browse/HIVE-4433
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Affects Versions: 0.11.0
>Reporter: Carl Steinbach
>Assignee: Samuel Yuan
>Priority: Blocker
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4432) Follow-up to HIVE-4322 - make metastore API changes backwards compatible

2013-04-26 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4432:
--

Description: Right now the fix for HIVE-4322 makes different versions of 
the metastore server and client incompatible with each other. This can make 
deployment very painful.

> Follow-up to HIVE-4322 - make metastore API changes backwards compatible
> 
>
> Key: HIVE-4432
> URL: https://issues.apache.org/jira/browse/HIVE-4432
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>
> Right now the fix for HIVE-4322 makes different versions of the metastore 
> server and client incompatible with each other. This can make deployment very 
> painful.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4432) Follow-up to HIVE-4322 - make metastore API changes backwards compatible

2013-04-26 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4432:
-

 Summary: Follow-up to HIVE-4322 - make metastore API changes 
backwards compatible
 Key: HIVE-4432
 URL: https://issues.apache.org/jira/browse/HIVE-4432
 Project: Hive
  Issue Type: Bug
  Components: Metastore, Thrift API
Reporter: Samuel Yuan
Assignee: Samuel Yuan




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4344) CREATE VIEW fails when redundant casts are rewritten

2013-04-12 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4344:
--

Status: Patch Available  (was: Open)

> CREATE VIEW fails when redundant casts are rewritten
> 
>
> Key: HIVE-4344
> URL: https://issues.apache.org/jira/browse/HIVE-4344
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4344.HIVE-4344.HIVE-4344.HIVE-4344.D10221.1.patch
>
>
> e.g. create view v as select cast(key as string) from src;
> The rewriter tries to replace both cast(key as string) and key as 
> `src`.`key`, because cast(key as string) is a no-op.
> There may be other cases like this one.
> See HIVE-2439 for context.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4344) CREATE VIEW fails when redundant casts are rewritten

2013-04-12 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4344:
--

Description: 
e.g. create view v as select cast(key as string) from src;

The rewriter tries to replace both cast(key as string) and key as `src`.`key`, 
because cast(key as string) is a no-op.

There may be other cases like this one.

See HIVE-2439 for context.

  was:
e.g. create view v as select cast(key as string) from src;

The rewriter tries to replace both cast(key as string) and key as `src`.`key`, 
because cast(key as string) is a no-op.

There may be other cases like this one.


> CREATE VIEW fails when redundant casts are rewritten
> 
>
> Key: HIVE-4344
> URL: https://issues.apache.org/jira/browse/HIVE-4344
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>
> e.g. create view v as select cast(key as string) from src;
> The rewriter tries to replace both cast(key as string) and key as 
> `src`.`key`, because cast(key as string) is a no-op.
> There may be other cases like this one.
> See HIVE-2439 for context.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4322) SkewedInfo in Metastore Thrift API cannot be deserialized in Python

2013-04-11 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4322:
--

Status: Patch Available  (was: Open)

> SkewedInfo in Metastore Thrift API cannot be deserialized in Python
> ---
>
> Key: HIVE-4322
> URL: https://issues.apache.org/jira/browse/HIVE-4322
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore, Thrift API
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-4322.HIVE-4322.HIVE-4322.HIVE-4322.D10203.1.patch
>
>
> The Thrift-generated Python code that deserializes Thrift objects fails 
> whenever a complex type is used as a map key, because by default mutable 
> Python objects such as lists do not have a hash function. See 
> https://issues.apache.org/jira/browse/THRIFT-162 for related discussion.
> The SkewedInfo struct contains a map which uses a list as a key, breaking the 
> Python Thrift interface. It is not possible to specify the mapping from 
> Thrift types to Python types, or otherwise we could map Thrift lists to 
> Python tuples. Instead, the proposed workaround wraps the list inside a new 
> struct. This alone does not accomplish anything, but allows Python clients to 
> define a hash function for the struct class, e.g.:
> def f(object):
> return hash(tuple(object.skewedValueList))
> SkewedValueList.__hash__ = f
> In practice a more efficient hash might be defined that does not involve 
> copying the list. The advantage of wrapping the list inside a struct is that 
> the client does not have to define the hash on the list itself, which would 
> change the behaviour of lists everywhere else in the code.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4344) CREATE VIEW fails when redundant casts are rewritten

2013-04-11 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4344:
-

 Summary: CREATE VIEW fails when redundant casts are rewritten
 Key: HIVE-4344
 URL: https://issues.apache.org/jira/browse/HIVE-4344
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan


e.g. create view v as select cast(key as string) from src;

The rewriter tries to replace both cast(key as string) and key as `src`.`key`, 
because cast(key as string) is a no-op.

There may be other cases like this one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4337) Update list bucketing test results

2013-04-10 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4337:
--

Status: Patch Available  (was: Open)

> Update list bucketing test results
> --
>
> Key: HIVE-4337
> URL: https://issues.apache.org/jira/browse/HIVE-4337
> Project: Hive
>  Issue Type: Test
>  Components: Tests
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Trivial
> Attachments: HIVE-4337.HIVE-4337.HIVE-4337.D10131.1.patch
>
>
> A recent change resulted in different output for the list bucketing tests, 
> which run for Hadoop23. The output files were not updated to reflect this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4337) Update list bucketing test results

2013-04-10 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4337:
--

Status: Open  (was: Patch Available)

> Update list bucketing test results
> --
>
> Key: HIVE-4337
> URL: https://issues.apache.org/jira/browse/HIVE-4337
> Project: Hive
>  Issue Type: Test
>  Components: Tests
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Trivial
> Attachments: HIVE-4337.HIVE-4337.HIVE-4337.D10131.1.patch
>
>
> A recent change resulted in different output for the list bucketing tests, 
> which run for Hadoop23. The output files were not updated to reflect this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4337) Update list bucketing test results

2013-04-10 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4337:
-

 Summary: Update list bucketing test results
 Key: HIVE-4337
 URL: https://issues.apache.org/jira/browse/HIVE-4337
 Project: Hive
  Issue Type: Test
  Components: Tests
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Trivial


A recent change resulted in different output for the list bucketing tests, 
which run for Hadoop23. The output files were not updated to reflect this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4322) SkewedInfo in Metastore Thrift API cannot be deserialized in Python

2013-04-09 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4322:
-

 Summary: SkewedInfo in Metastore Thrift API cannot be deserialized 
in Python
 Key: HIVE-4322
 URL: https://issues.apache.org/jira/browse/HIVE-4322
 Project: Hive
  Issue Type: Bug
  Components: Metastore, Thrift API
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Minor


The Thrift-generated Python code that deserializes Thrift objects fails 
whenever a complex type is used as a map key, because by default mutable Python 
objects such as lists do not have a hash function. See 
https://issues.apache.org/jira/browse/THRIFT-162 for related discussion.

The SkewedInfo struct contains a map which uses a list as a key, breaking the 
Python Thrift interface. It is not possible to specify the mapping from Thrift 
types to Python types, or otherwise we could map Thrift lists to Python tuples. 
Instead, the proposed workaround wraps the list inside a new struct. This alone 
does not accomplish anything, but allows Python clients to define a hash 
function for the struct class, e.g.:

def f(object):
return hash(tuple(object.skewedValueList))

SkewedValueList.__hash__ = f

In practice a more efficient hash might be defined that does not involve 
copying the list. The advantage of wrapping the list inside a struct is that 
the client does not have to define the hash on the list itself, which would 
change the behaviour of lists everywhere else in the code.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4067) Followup to HIVE-701: reduce ambiguity in grammar

2013-04-03 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4067:
--

Status: Patch Available  (was: Open)

> Followup to HIVE-701: reduce ambiguity in grammar
> -
>
> Key: HIVE-4067
> URL: https://issues.apache.org/jira/browse/HIVE-4067
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-4067.D8883.1.patch, 
> HIVE-4067.HIVE-4067.HIVE-4067.HIVE-4067.D8883.2.patch
>
>
> After HIVE-701 the grammar has become much more ambiguous, and the 
> compilation generates a large number of warnings. Making FROM, DISTINCT, 
> PRESERVE, COLUMN, ALL, AND, OR, and NOT reserved keywords again reduces the 
> number of warnings to 134, up from the original 81 warnings but down from the 
> 565 after HIVE-701. Most of the remaining ambiguity is trivial, an example 
> being "KW_ELEM_TYPE | KW_KEY_TYPE | KW_VALUE_TYPE | identifier", and they are 
> all correctly handled by ANTLR.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4289) HCatalog build fails when behind a firewall

2013-04-03 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4289:
--

Status: Patch Available  (was: Open)

> HCatalog build fails when behind a firewall
> ---
>
> Key: HIVE-4289
> URL: https://issues.apache.org/jira/browse/HIVE-4289
> Project: Hive
>  Issue Type: Bug
>  Components: Build Infrastructure, HCatalog
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4289.HIVE-4289.HIVE-4289.HIVE-4289.D9921.1.patch
>
>
> A bug in Maven makes it impossible to set a proxy for a Maven Ant POM task 
> (see https://jira.codehaus.org/browse/MANTTASKS-216). Building behind a 
> firewall results in the following error:
> [artifact:pom] Downloading: org/apache/apache/11/apache-11.pom from 
> repository central at http://repo1.maven.org/maven2
> [artifact:pom] Transferring 14K from central
> [artifact:pom] [WARNING] Unable to get resource 'org.apache:apache:pom:11' 
> from repository central (http://repo1.maven.org/maven2): Error transferring 
> file: No route to host
> [artifact:pom] An error has occurred while processing the Maven artifact 
> tasks.
> [artifact:pom]  Diagnosis:
> [artifact:pom]
> [artifact:pom] Unable to initialize POM pom.xml: Cannot find parent: 
> org.apache:apache for project: 
> org.apache.hcatalog:hcatalog:pom:0.11.0-SNAPSHOT for project 
> org.apache.hcatalog:hcatalog:pom:0.11.0-SNAPSHOT
> [artifact:pom] Unable to download the artifact from any repository
> Despite the error message, Ant/Maven is actually able to retrieve the POM 
> file by using the proxy set for Ant. However, it mysteriously fails when 
> trying to retrieve the checksum, which causes the entire operation to fail. 
> Regardless, a proxy should be set through Maven's settings.xml file. Since 
> this is not possible, the only way to build HCat behind a firewall right now 
> is to manually fetch the POM file and have Maven read it from the cache.
> Ideally we would fix this in Maven, but given that this issue has been 
> reported for a long time in a number of separate places I think it is more 
> practical to modify the HCatalog build to specify the POM as a dependency, 
> fetching it into the cache so that the artifact:pom task can succeed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4289) HCatalog build fails when behind a firewall

2013-04-02 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4289:
-

 Summary: HCatalog build fails when behind a firewall
 Key: HIVE-4289
 URL: https://issues.apache.org/jira/browse/HIVE-4289
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure, HCatalog
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan


A bug in Maven makes it impossible to set a proxy for a Maven Ant POM task (see 
https://jira.codehaus.org/browse/MANTTASKS-216). Building behind a firewall 
results in the following error:

[artifact:pom] Downloading: org/apache/apache/11/apache-11.pom from repository 
central at http://repo1.maven.org/maven2
[artifact:pom] Transferring 14K from central
[artifact:pom] [WARNING] Unable to get resource 'org.apache:apache:pom:11' from 
repository central (http://repo1.maven.org/maven2): Error transferring file: No 
route to host
[artifact:pom] An error has occurred while processing the Maven artifact tasks.
[artifact:pom]  Diagnosis:
[artifact:pom]
[artifact:pom] Unable to initialize POM pom.xml: Cannot find parent: 
org.apache:apache for project: org.apache.hcatalog:hcatalog:pom:0.11.0-SNAPSHOT 
for project org.apache.hcatalog:hcatalog:pom:0.11.0-SNAPSHOT
[artifact:pom] Unable to download the artifact from any repository

Despite the error message, Ant/Maven is actually able to retrieve the POM file 
by using the proxy set for Ant. However, it mysteriously fails when trying to 
retrieve the checksum, which causes the entire operation to fail. Regardless, a 
proxy should be set through Maven's settings.xml file. Since this is not 
possible, the only way to build HCat behind a firewall right now is to manually 
fetch the POM file and have Maven read it from the cache.

Ideally we would fix this in Maven, but given that this issue has been reported 
for a long time in a number of separate places I think it is more practical to 
modify the HCatalog build to specify the POM as a dependency, fetching it into 
the cache so that the artifact:pom task can succeed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4221) Stripe-level merge for ORC files

2013-03-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4221:
--

Status: Patch Available  (was: Open)

> Stripe-level merge for ORC files
> 
>
> Key: HIVE-4221
> URL: https://issues.apache.org/jira/browse/HIVE-4221
> Project: Hive
>  Issue Type: Improvement
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4221.HIVE-4221.HIVE-4221.HIVE-4221.D9759.1.patch
>
>
> As with RC files, we would like to be able to merge ORC files efficiently by 
> reading/writing stripes without decompressing/recompressing them. This will 
> be similar to the RC file merge, except that footers will have to be updated 
> with the stripe positions in the new file.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4221) Stripe-level merge for ORC files

2013-03-22 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4221:
-

 Summary: Stripe-level merge for ORC files
 Key: HIVE-4221
 URL: https://issues.apache.org/jira/browse/HIVE-4221
 Project: Hive
  Issue Type: Improvement
  Components: Query Processor
Reporter: Samuel Yuan
Assignee: Samuel Yuan


As with RC files, we would like to be able to merge ORC files efficiently by 
reading/writing stripes without decompressing/recompressing them. This will be 
similar to the RC file merge, except that footers will have to be updated with 
the stripe positions in the new file.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4199) ORC writer doesn't handle non-UTF8 encoded Text properly

2013-03-18 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4199:
--

Status: Patch Available  (was: Open)

> ORC writer doesn't handle non-UTF8 encoded Text properly
> 
>
> Key: HIVE-4199
> URL: https://issues.apache.org/jira/browse/HIVE-4199
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-4199.HIVE-4199.HIVE-4199.D9501.1.patch, 
> HIVE-4199.HIVE-4199.HIVE-4199.D9501.2.patch, 
> HIVE-4199.HIVE-4199.HIVE-4199.D9501.3.patch
>
>
> StringTreeWriter currently converts fields stored as Text objects into 
> Strings. This can lose information (see 
> http://en.wikipedia.org/wiki/Replacement_character#Replacement_character), 
> and is also unnecessary since the dictionary stores Text objects.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4199) ORC writer doesn't handle non-UTF8 encoded Text properly

2013-03-18 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4199:
-

 Summary: ORC writer doesn't handle non-UTF8 encoded Text properly
 Key: HIVE-4199
 URL: https://issues.apache.org/jira/browse/HIVE-4199
 Project: Hive
  Issue Type: Bug
  Components: Serializers/Deserializers
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Minor


StringTreeWriter currently converts fields stored as Text objects into Strings. 
This can lose information (see 
http://en.wikipedia.org/wiki/Replacement_character#Replacement_character), and 
is also unnecessary since the dictionary stores Text objects.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4067) Followup to HIVE-701: reduce ambiguity in grammar

2013-03-15 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13604068#comment-13604068
 ] 

Samuel Yuan commented on HIVE-4067:
---

The primary motivation was that it was tricky to add new keywords, which are 
almost always reserved words by default in Hive, because doing so could easily 
break existing queries. The changes for HIVE-701 make it easy to add 
non-reserved keywords in the future. HIVE-701 also removes the reserved status 
of most keywords, to prevent recently introduced keywords from breaking queries.

I can undo the changes for the Hive keywords which are reserved in SQL 2003, 
but would there be any reason to do so besides to adhere to the standard, given 
that the grammar can support leaving them non-reserved?

> Followup to HIVE-701: reduce ambiguity in grammar
> -
>
> Key: HIVE-4067
> URL: https://issues.apache.org/jira/browse/HIVE-4067
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-4067.D8883.1.patch
>
>
> After HIVE-701 the grammar has become much more ambiguous, and the 
> compilation generates a large number of warnings. Making FROM, DISTINCT, 
> PRESERVE, COLUMN, ALL, AND, OR, and NOT reserved keywords again reduces the 
> number of warnings to 134, up from the original 81 warnings but down from the 
> 565 after HIVE-701. Most of the remaining ambiguity is trivial, an example 
> being "KW_ELEM_TYPE | KW_KEY_TYPE | KW_VALUE_TYPE | identifier", and they are 
> all correctly handled by ANTLR.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4125) Expose metastore JMX metrics

2013-03-11 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4125:
--

Status: Patch Available  (was: Open)

> Expose metastore JMX metrics
> 
>
> Key: HIVE-4125
> URL: https://issues.apache.org/jira/browse/HIVE-4125
> Project: Hive
>  Issue Type: Improvement
>  Components: Metastore
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Trivial
> Attachments: HIVE-4125.HIVE-4125.HIVE-4125.D9123.1.patch, 
> HIVE-4125.HIVE-4125.HIVE-4125.D9123.2.patch
>
>
> Add a safe way to access the metrics stored for each MetricsScope, so that 
> they can be used outside of JMX.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4067) Followup to HIVE-701: reduce ambiguity in grammar

2013-03-10 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13598566#comment-13598566
 ] 

Samuel Yuan commented on HIVE-4067:
---

This is referring to the Java code in the @members section. The section is 
still being properly inserted into the generated HiveParser.java file, which 
then compiles fine, so I don't know why ANTLR is complaining. I'll look into it.

> Followup to HIVE-701: reduce ambiguity in grammar
> -
>
> Key: HIVE-4067
> URL: https://issues.apache.org/jira/browse/HIVE-4067
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-4067.D8883.1.patch
>
>
> After HIVE-701 the grammar has become much more ambiguous, and the 
> compilation generates a large number of warnings. Making FROM, DISTINCT, 
> PRESERVE, COLUMN, ALL, AND, OR, and NOT reserved keywords again reduces the 
> number of warnings to 134, up from the original 81 warnings but down from the 
> 565 after HIVE-701. Most of the remaining ambiguity is trivial, an example 
> being "KW_ELEM_TYPE | KW_KEY_TYPE | KW_VALUE_TYPE | identifier", and they are 
> all correctly handled by ANTLR.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4067) Followup to HIVE-701: reduce ambiguity in grammar

2013-03-08 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4067:
--

Affects Version/s: 0.11.0
   Status: Patch Available  (was: Open)

> Followup to HIVE-701: reduce ambiguity in grammar
> -
>
> Key: HIVE-4067
> URL: https://issues.apache.org/jira/browse/HIVE-4067
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
>
> After HIVE-701 the grammar has become much more ambiguous, and the 
> compilation generates a large number of warnings. Making FROM, DISTINCT, 
> PRESERVE, COLUMN, ALL, AND, OR, and NOT reserved keywords again reduces the 
> number of warnings to 134, up from the original 81 warnings but down from the 
> 565 after HIVE-701. Most of the remaining ambiguity is trivial, an example 
> being "KW_ELEM_TYPE | KW_KEY_TYPE | KW_VALUE_TYPE | identifier", and they are 
> all correctly handled by ANTLR.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4125) Expose metastore JMX metrics

2013-03-05 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4125:
-

 Summary: Expose metastore JMX metrics
 Key: HIVE-4125
 URL: https://issues.apache.org/jira/browse/HIVE-4125
 Project: Hive
  Issue Type: Improvement
  Components: Metastore
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Trivial


Add a safe way to access the metrics stored for each MetricsScope, so that they 
can be used outside of JMX.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4092) Store complete names of tables in column access analyzer

2013-02-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4092:
--

Status: Patch Available  (was: Open)

> Store complete names of tables in column access analyzer
> 
>
> Key: HIVE-4092
> URL: https://issues.apache.org/jira/browse/HIVE-4092
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Trivial
> Fix For: 0.11.0
>
> Attachments: HIVE-4092.HIVE-4092.HIVE-4092.D8985.1.patch
>
>
> Right now the db name is not being stored. We should store the complete name, 
> which includes the db name, as the table access analyzer does.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4092) Store complete names of tables in column access analyzer

2013-02-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4092:
--

Description: Right now the db name is not being stored. We should store the 
complete name, which includes the db name, as the table access analyzer does.

> Store complete names of tables in column access analyzer
> 
>
> Key: HIVE-4092
> URL: https://issues.apache.org/jira/browse/HIVE-4092
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Trivial
> Fix For: 0.11.0
>
>
> Right now the db name is not being stored. We should store the complete name, 
> which includes the db name, as the table access analyzer does.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4092) Store complete names of tables in column access analyzer

2013-02-28 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4092:
-

 Summary: Store complete names of tables in column access analyzer
 Key: HIVE-4092
 URL: https://issues.apache.org/jira/browse/HIVE-4092
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Trivial
 Fix For: 0.11.0




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-4044) Add URL type

2013-02-27 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1357#comment-1357
 ] 

Samuel Yuan commented on HIVE-4044:
---

You're right, the idea is that it will enable better encoding of URLs. Kevin 
found that breaking up the URL into its components and storing them as separate 
columns results in significant space savings. The original plan was to 
implement this idea with RCFile, but with the new ORC file format I decided to 
wait for that instead, and to submit this part separately.

However, it looks like the improvements of the ORC file have erased any gains 
we would have gotten by breaking up URLs into the individual components, so 
this won't be needed any more.

> Add URL type
> 
>
> Key: HIVE-4044
> URL: https://issues.apache.org/jira/browse/HIVE-4044
> Project: Hive
>  Issue Type: Improvement
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4044.HIVE-4044.HIVE-4044.D8799.1.patch
>
>
> Having a separate type for URLs would enable improvements in storage 
> efficiency based on breaking up a URL into its components. The new type will 
> be named "URL" and made a non-reserved keyword (see HIVE-701).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-701) lots of reserved keywords in hive

2013-02-22 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13584786#comment-13584786
 ] 

Samuel Yuan commented on HIVE-701:
--

Removing FROM, DISTINCT, PRESERVE, COLUMN, ALL, AND, OR, and NOT brings the 
number of warnings down to 134. Before this change it was at 81. Most of these 
other cases are trivial, an example being "KW_ELEM_TYPE | KW_KEY_TYPE | 
KW_VALUE_TYPE | identifier", and they are all correctly handled by ANTLR.

I have created a followup JIRA: HIVE-4067.

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
> Fix For: 0.11.0
>
> Attachments: HIVE-701.1.patch.txt, HIVE-701.2.patch.txt, 
> HIVE-701.D8397.1.patch, HIVE-701.HIVE-701.D8397.2.patch, 
> HIVE-701.HIVE-701.D8397.3.patch
>
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4067) Followup to HIVE-701: reduce ambiguity in grammar

2013-02-22 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4067:
-

 Summary: Followup to HIVE-701: reduce ambiguity in grammar
 Key: HIVE-4067
 URL: https://issues.apache.org/jira/browse/HIVE-4067
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Minor


After HIVE-701 the grammar has become much more ambiguous, and the compilation 
generates a large number of warnings. Making FROM, DISTINCT, PRESERVE, COLUMN, 
ALL, AND, OR, and NOT reserved keywords again reduces the number of warnings to 
134, up from the original 81 warnings but down from the 565 after HIVE-701. 
Most of the remaining ambiguity is trivial, an example being "KW_ELEM_TYPE | 
KW_KEY_TYPE | KW_VALUE_TYPE | identifier", and they are all correctly handled 
by ANTLR.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-701) lots of reserved keywords in hive

2013-02-21 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13584073#comment-13584073
 ] 

Samuel Yuan commented on HIVE-701:
--

Taking out DISTINCT and ALL takes the lines of warnings down from about 2900 to 
around 2300. We were originally at around 1000. I can try taking out more 
keywords to reduce the ambiguity.

By the way, what is the concern about the ambiguity? There is no impact to the 
user, and I don't think there's much of a performance hit. Is it just that the 
warnings are clogging up the build log, or is there something else?

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
> Fix For: 0.11.0
>
> Attachments: HIVE-701.1.patch.txt, HIVE-701.2.patch.txt, 
> HIVE-701.D8397.1.patch, HIVE-701.HIVE-701.D8397.2.patch, 
> HIVE-701.HIVE-701.D8397.3.patch
>
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-4044) Add URL type

2013-02-21 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-4044:
--

Status: Patch Available  (was: Open)

> Add URL type
> 
>
> Key: HIVE-4044
> URL: https://issues.apache.org/jira/browse/HIVE-4044
> Project: Hive
>  Issue Type: Improvement
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
> Attachments: HIVE-4044.HIVE-4044.HIVE-4044.D8799.1.patch
>
>
> Having a separate type for URLs would enable improvements in storage 
> efficiency based on breaking up a URL into its components. The new type will 
> be named "URL" and made a non-reserved keyword (see HIVE-701).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4046) Column masking

2013-02-20 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4046:
-

 Summary: Column masking
 Key: HIVE-4046
 URL: https://issues.apache.org/jira/browse/HIVE-4046
 Project: Hive
  Issue Type: New Feature
  Components: CLI, Metastore, Query Processor
Affects Versions: 0.11.0
Reporter: Samuel Yuan
Assignee: Samuel Yuan


Sometimes data in a table needs to be kept around but made inaccessible. Right 
now it is possible to offline a table or a partition, but not a specific column 
of a partition. Also, accessing an offlined table results in an error. With 
this change, it will be possible to mask a column at the partition level, 
causing all further queries to that column to return null.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-4044) Add URL type

2013-02-20 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-4044:
-

 Summary: Add URL type
 Key: HIVE-4044
 URL: https://issues.apache.org/jira/browse/HIVE-4044
 Project: Hive
  Issue Type: Improvement
Reporter: Samuel Yuan
Assignee: Samuel Yuan


Having a separate type for URLs would enable improvements in storage efficiency 
based on breaking up a URL into its components. The new type will be named 
"URL" and made a non-reserved keyword (see HIVE-701).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-701) lots of reserved keywords in hive

2013-02-05 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-701:
-

Attachment: HIVE-701.2.patch.txt

Updated, see Phabricator.

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
> Attachments: HIVE-701.1.patch.txt, HIVE-701.2.patch.txt, 
> HIVE-701.D8397.1.patch, HIVE-701.HIVE-701.D8397.2.patch
>
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-701) lots of reserved keywords in hive

2013-02-05 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-701:
-

Status: Patch Available  (was: Open)

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
> Attachments: HIVE-701.1.patch.txt, HIVE-701.D8397.1.patch
>
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-701) lots of reserved keywords in hive

2013-02-05 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-701:
-

Attachment: HIVE-701.1.patch.txt

https://reviews.facebook.net/D8397

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
> Attachments: HIVE-701.1.patch.txt, HIVE-701.D8397.1.patch
>
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Assigned] (HIVE-701) lots of reserved keywords in hive

2013-01-31 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan reassigned HIVE-701:


Assignee: Samuel Yuan

> lots of reserved keywords in hive
> -
>
> Key: HIVE-701
> URL: https://issues.apache.org/jira/browse/HIVE-701
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Reporter: Namit Jain
>Assignee: Samuel Yuan
>
> There is a problem if we want to use some reserved keywords:
> for example, creating a function of name left/right ? left/right is already a 
> reserved keyword.
> The other way around should also be possible - if we want to add a 'show 
> tables status' and some applications already use status as a column name, 
> they should not break

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3940) Track columns accessed in each table in a query

2013-01-30 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3940:
--

Attachment: HIVE-3940.3.patch.txt

Updated.

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt, HIVE-3940.2.patch.txt, 
> HIVE-3940.3.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3252) Add environment context to metastore Thrift calls

2013-01-29 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3252:
--

Attachment: HIVE-3252.2.patch.txt

Updated, see comments in Phabricator

> Add environment context to metastore Thrift calls
> -
>
> Key: HIVE-3252
> URL: https://issues.apache.org/jira/browse/HIVE-3252
> Project: Hive
>  Issue Type: Improvement
>  Components: Metastore
>Reporter: John Reese
>Assignee: John Reese
>Priority: Minor
> Attachments: HIVE-3252.1.patch.txt, HIVE-3252.2.patch.txt
>
>
> Currently in the Hive Thrift metastore API create_table, add_partition, 
> alter_table, alter_partition have with_environment_context analogs.  It would 
> be really useful to add similar methods from drop_partition, drop_table, and 
> append_partition.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3252) Add environment context to metastore Thrift calls

2013-01-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3252:
--

Attachment: HIVE-3252.1.patch.txt

https://reviews.facebook.net/D8241

> Add environment context to metastore Thrift calls
> -
>
> Key: HIVE-3252
> URL: https://issues.apache.org/jira/browse/HIVE-3252
> Project: Hive
>  Issue Type: Improvement
>  Components: Metastore
>Reporter: John Reese
>Assignee: John Reese
>Priority: Minor
> Attachments: HIVE-3252.1.patch.txt
>
>
> Currently in the Hive Thrift metastore API create_table, add_partition, 
> alter_table, alter_partition have with_environment_context analogs.  It would 
> be really useful to add similar methods from drop_partition, drop_table, and 
> append_partition.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3940) Track columns accessed in each table in a query

2013-01-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3940:
--

Attachment: HIVE-3940.2.patch.txt

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt, HIVE-3940.2.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3940) Track columns accessed in each table in a query

2013-01-28 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3940:
--

Status: Patch Available  (was: Open)

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt, HIVE-3940.2.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3940) Track columns accessed in each table in a query

2013-01-25 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3940:
--

Status: Patch Available  (was: Open)

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-3940) Track columns accessed in each table in a query

2013-01-25 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13562940#comment-13562940
 ] 

Samuel Yuan commented on HIVE-3940:
---

https://reviews.facebook.net/D8193

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3940) Track columns accessed in each table in a query

2013-01-25 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3940:
--

Attachment: HIVE-3940.1.patch.txt

> Track columns accessed in each table in a query
> ---
>
> Key: HIVE-3940
> URL: https://issues.apache.org/jira/browse/HIVE-3940
> Project: Hive
>  Issue Type: Task
>  Components: Query Processor
>Reporter: Samuel Yuan
>Assignee: Samuel Yuan
>Priority: Minor
> Attachments: HIVE-3940.1.patch.txt
>
>
> Similar to partition access logs, we need to have columns access logs, so 
> later we can build tools/reports to inform users if there are wasted columns 
> in a table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HIVE-3940) Track columns accessed in each table in a query

2013-01-24 Thread Samuel Yuan (JIRA)
Samuel Yuan created HIVE-3940:
-

 Summary: Track columns accessed in each table in a query
 Key: HIVE-3940
 URL: https://issues.apache.org/jira/browse/HIVE-3940
 Project: Hive
  Issue Type: Task
  Components: Query Processor
Reporter: Samuel Yuan
Assignee: Samuel Yuan
Priority: Minor


Similar to partition access logs, we need to have columns access logs, so later 
we can build tools/reports to inform users if there are wasted columns in a 
table to be trimmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-23 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3903:
--

Status: Patch Available  (was: Open)

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
> Attachments: HIVE-3903.1.patch.txt, HIVE-3903.2.patch.txt
>
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-23 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3903:
--

Attachment: HIVE-3903.2.patch.txt

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
> Attachments: HIVE-3903.1.patch.txt, HIVE-3903.2.patch.txt
>
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-23 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3903:
--

Status: Patch Available  (was: Open)

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
> Attachments: HIVE-3903.1.patch.txt
>
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-23 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan updated HIVE-3903:
--

Attachment: HIVE-3903.1.patch.txt

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
> Attachments: HIVE-3903.1.patch.txt
>
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-23 Thread Samuel Yuan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13561078#comment-13561078
 ] 

Samuel Yuan commented on HIVE-3903:
---

https://reviews.facebook.net/D8127

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Assigned] (HIVE-3903) Allow updating bucketing/sorting metadata of a partition through the CLI

2013-01-17 Thread Samuel Yuan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Samuel Yuan reassigned HIVE-3903:
-

Assignee: Samuel Yuan

> Allow updating bucketing/sorting metadata of a partition through the CLI
> 
>
> Key: HIVE-3903
> URL: https://issues.apache.org/jira/browse/HIVE-3903
> Project: Hive
>  Issue Type: New Feature
>  Components: Query Processor
>Affects Versions: 0.11.0
>Reporter: Kevin Wilfong
>Assignee: Samuel Yuan
>
> Right now users can update the bucketing/sorting metadata of a table through 
> the CLI, but not a partition.  
> Use case:
> Need to merge a partition's files, but it's bucketed/sorted, so want to mark 
> the partition as unbucketed/unsorted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira