[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13
[ https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045065#comment-14045065 ] Sumit Kumar commented on HIVE-7097: --- [~sunrui] I hit this today and found following references useful: # https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-AlterColumn # https://issues.apache.org/jira/secure/attachment/12618321/QuotedIdentifier.html In short the functionality is still there but you need to set hive.support.quoted.identifiers to none to get the pre-0.13 behavior. I was able to run my query after {code:actionscript} hive set hive.support.quoted.identifiers=none; {code} My query was something like: {code:actionscript} hive select `(col1|col2|col3)?+.+` from testTable1; {code} The Support for REGEX Column Broken in HIVE 0.13 Key: HIVE-7097 URL: https://issues.apache.org/jira/browse/HIVE-7097 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.13.0 Reporter: Sun Rui The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13. For example: {code:sql} select `key.*` from src limit 1; {code} will fail in HIVE 0.13 with the following error from SemanticAnalyzer: {noformat} FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or column reference 'key.*': (possible column names are: key, value) {noformat} This issue is related to HIVE-6037. When set hive.support.quoted.identifiers=none, the issue will be gone. I am not sure the configuration was intended to break regex column. But at least the documentation needs to be updated: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification I would argue backward compatibility is more important. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13
[ https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045084#comment-14045084 ] Sumit Kumar commented on HIVE-7097: --- Basically this doesn't seem to be an issue but it would help if we clarify this in [Select documentation|https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select] as well . The Support for REGEX Column Broken in HIVE 0.13 Key: HIVE-7097 URL: https://issues.apache.org/jira/browse/HIVE-7097 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.13.0 Reporter: Sun Rui The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13. For example: {code:sql} select `key.*` from src limit 1; {code} will fail in HIVE 0.13 with the following error from SemanticAnalyzer: {noformat} FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or column reference 'key.*': (possible column names are: key, value) {noformat} This issue is related to HIVE-6037. When set hive.support.quoted.identifiers=none, the issue will be gone. I am not sure the configuration was intended to break regex column. But at least the documentation needs to be updated: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification I would argue backward compatibility is more important. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13
[ https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14045562#comment-14045562 ] Lefty Leverenz commented on HIVE-7097: -- I added information about this to the bullet list after the SELECT syntax (same as for Create Table) and gave version information in the section REGEX Column Specification: * [Select Syntax | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-SelectSyntax] * [REGEX Column Specification | https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification] The Support for REGEX Column Broken in HIVE 0.13 Key: HIVE-7097 URL: https://issues.apache.org/jira/browse/HIVE-7097 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.13.0 Reporter: Sun Rui The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13. For example: {code:sql} select `key.*` from src limit 1; {code} will fail in HIVE 0.13 with the following error from SemanticAnalyzer: {noformat} FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or column reference 'key.*': (possible column names are: key, value) {noformat} This issue is related to HIVE-6037. When set hive.support.quoted.identifiers=none, the issue will be gone. I am not sure the configuration was intended to break regex column. But at least the documentation needs to be updated: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification I would argue backward compatibility is more important. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13
[ https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14003492#comment-14003492 ] Carter Shanklin commented on HIVE-7097: --- Sun, The issue is related to HIVE-6013. Harish and I debated about this change so you can put all the blame on me. We decided to go this path because: 1. We had a user who was trying to import about 50,000 tables from existing databases that contained all kinds of strange characters in column names. 2. The new behavior is consistent with SQL standards 3. Most Hive users did not know about the regex feature and did not use it. 4. Other databases allow .* as part of the column name. It's worthwhile for others to give their opinion on this. Personally I think the breaking change is better in the long run. Can you give more detail about why you favor the old path? Is it because of Shark compatibility? Something else? The Support for REGEX Column Broken in HIVE 0.13 Key: HIVE-7097 URL: https://issues.apache.org/jira/browse/HIVE-7097 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.13.0 Reporter: Sun Rui The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13. For example: {code:sql} select `key.*` from src limit 1; {code} will fail in HIVE 0.13 with the following error from SemanticAnalyzer: {noformat} FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or column reference 'key.*': (possible column names are: key, value) {noformat} This issue is related to HIVE-6037. When set hive.support.quoted.identifiers=none, the issue will be gone. I am not sure the configuration was intended to break regex column. But at least the documentation needs to be updated: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification I would argue backward compatibility is more important. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (HIVE-7097) The Support for REGEX Column Broken in HIVE 0.13
[ https://issues.apache.org/jira/browse/HIVE-7097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14004168#comment-14004168 ] Sun Rui commented on HIVE-7097: --- [~cartershanklin] Thanks for your explanation. I happened to use REGEX columns to shorten several queries. I can do a one-shot change to not use it, since the community have not opposed to this breaking of backward compatibility. Could you update the documentation to reflect the new behavior? The Support for REGEX Column Broken in HIVE 0.13 Key: HIVE-7097 URL: https://issues.apache.org/jira/browse/HIVE-7097 Project: Hive Issue Type: Bug Components: Query Processor Affects Versions: 0.13.0 Reporter: Sun Rui The Support for REGEX Column is OK in HIVE 0.12, but is broken in HIVE 0.13. For example: {code:sql} select `key.*` from src limit 1; {code} will fail in HIVE 0.13 with the following error from SemanticAnalyzer: {noformat} FAILED: SemanticException [Error 10004]: Line 1:7 Invalid table alias or column reference 'key.*': (possible column names are: key, value) {noformat} This issue is related to HIVE-6037. When set hive.support.quoted.identifiers=none, the issue will be gone. I am not sure the configuration was intended to break regex column. But at least the documentation needs to be updated: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Select#LanguageManualSelect-REGEXColumnSpecification I would argue backward compatibility is more important. -- This message was sent by Atlassian JIRA (v6.2#6252)