[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13

2014-06-26 Thread Sumit Kumar (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045065#comment-14045065
 ] 

Sumit Kumar commented on HIVE-7097:
---

[~sunrui] I hit this today and found following references useful:

# 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AlterColumn
# 
https://issues.apache.org/jira/secure/attachment/12618321/QuotedIdentifier.html

In short the functionality is still there but you need to set 
hive.support.quoted.identifiers to none to get the pre-0.13 behavior. I was 
able to run my query after
{code:actionscript}
hive set hive.support.quoted.identifiers=none;
{code}

My query was something like:
{code:actionscript}
hive select `(col1|col2|col3)?+.+` from testTable1;
{code}


 The Support for REGEX Column Broken in HIVE 0.13
 

 Key: HIVE-7097
 URL: https://issues.apache.org/jira/browse/HIVE-7097
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Sun Rui

 The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13.
 For example:
 {code:sql}
 select `key.*` from src limit 1;
 {code}
 will fail in HIVE 0.13 with the following error from SemanticAnalyzer:
 {noformat}
 FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or 
 column reference 'key.*': (possible column names are: key, value)
 {noformat}
 This issue is related to HIVE-6037. When set 
 hive.support.quoted.identifiers=none, the issue will be gone.
 I am not sure the configuration was intended to break regex column. But at 
 least the documentation needs to be updated: 
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification
 I would argue backward compatibility is more important.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13

2014-06-26 Thread Sumit Kumar (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045084#comment-14045084
 ] 

Sumit Kumar commented on HIVE-7097:
---

Basically this doesn't seem to be an issue but it would help if we clarify this 
in [Select 
documentation|https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select]
 as well . 

 The Support for REGEX Column Broken in HIVE 0.13
 

 Key: HIVE-7097
 URL: https://issues.apache.org/jira/browse/HIVE-7097
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Sun Rui

 The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13.
 For example:
 {code:sql}
 select `key.*` from src limit 1;
 {code}
 will fail in HIVE 0.13 with the following error from SemanticAnalyzer:
 {noformat}
 FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or 
 column reference 'key.*': (possible column names are: key, value)
 {noformat}
 This issue is related to HIVE-6037. When set 
 hive.support.quoted.identifiers=none, the issue will be gone.
 I am not sure the configuration was intended to break regex column. But at 
 least the documentation needs to be updated: 
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification
 I would argue backward compatibility is more important.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13

2014-06-26 Thread Lefty Leverenz (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045562#comment-14045562
 ] 

Lefty Leverenz commented on HIVE-7097:
--

I added information about this to the bullet list after the SELECT syntax (same 
as for Create Table) and gave version information in the section REGEX Column 
Specification:

* [Select Syntax | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-SelectSyntax]
* [REGEX Column Specification | 
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification]

 The Support for REGEX Column Broken in HIVE 0.13
 

 Key: HIVE-7097
 URL: https://issues.apache.org/jira/browse/HIVE-7097
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Sun Rui

 The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13.
 For example:
 {code:sql}
 select `key.*` from src limit 1;
 {code}
 will fail in HIVE 0.13 with the following error from SemanticAnalyzer:
 {noformat}
 FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or 
 column reference 'key.*': (possible column names are: key, value)
 {noformat}
 This issue is related to HIVE-6037. When set 
 hive.support.quoted.identifiers=none, the issue will be gone.
 I am not sure the configuration was intended to break regex column. But at 
 least the documentation needs to be updated: 
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification
 I would argue backward compatibility is more important.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13

2014-05-20 Thread Carter Shanklin (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14003492#comment-14003492
 ] 

Carter Shanklin commented on HIVE-7097:
---

Sun,

The issue is related to HIVE-6013. Harish and I debated about this change so 
you can put all the blame on me.

We decided to go this path because:
1. We had a user who was trying to import about 50,000 tables from existing 
databases that contained all kinds of strange characters in column names.
2. The new behavior is consistent with SQL standards
3. Most Hive users did not know about the regex feature and did not use it.
4. Other databases allow .* as part of the column name.

It's worthwhile for others to give their opinion on this. Personally I think 
the breaking change is better in the long run. Can you give more detail about 
why you favor the old path? Is it because of Shark compatibility? Something 
else?

 The Support for REGEX Column Broken in HIVE 0.13
 

 Key: HIVE-7097
 URL: https://issues.apache.org/jira/browse/HIVE-7097
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Sun Rui

 The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13.
 For example:
 {code:sql}
 select `key.*` from src limit 1;
 {code}
 will fail in HIVE 0.13 with the following error from SemanticAnalyzer:
 {noformat}
 FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or 
 column reference 'key.*': (possible column names are: key, value)
 {noformat}
 This issue is related to HIVE-6037. When set 
 hive.support.quoted.identifiers=none, the issue will be gone.
 I am not sure the configuration was intended to break regex column. But at 
 least the documentation needs to be updated: 
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification
 I would argue backward compatibility is more important.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13

2014-05-20 Thread Sun Rui (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14004168#comment-14004168
 ] 

Sun Rui commented on HIVE-7097:
---

[~cartershanklin] Thanks for your explanation. I happened to use REGEX columns 
to shorten several queries. I can do a one-shot change to not use it, since the 
community have not opposed to this breaking of backward compatibility. Could 
you update the documentation to reflect the new behavior?

 The Support for REGEX Column Broken in HIVE 0.13
 

 Key: HIVE-7097
 URL: https://issues.apache.org/jira/browse/HIVE-7097
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0
Reporter: Sun Rui

 The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13.
 For example:
 {code:sql}
 select `key.*` from src limit 1;
 {code}
 will fail in HIVE 0.13 with the following error from SemanticAnalyzer:
 {noformat}
 FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or 
 column reference 'key.*': (possible column names are: key, value)
 {noformat}
 This issue is related to HIVE-6037. When set 
 hive.support.quoted.identifiers=none, the issue will be gone.
 I am not sure the configuration was intended to break regex column. But at 
 least the documentation needs to be updated: 
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification
 I would argue backward compatibility is more important.



--
This message was sent by Atlassian JIRA
(v6.2#6252)