[jira] [Commented] (SPARK-22954) ANALYZE TABLE fails with NoSuchTableException for temporary tables (but should have reported "not supported on views")
[ https://issues.apache.org/jira/browse/SPARK-22954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16315336#comment-16315336 ] Suchith J N commented on SPARK-22954: - I have opened a pull request. > ANALYZE TABLE fails with NoSuchTableException for temporary tables (but > should have reported "not supported on views") > -- > > Key: SPARK-22954 > URL: https://issues.apache.org/jira/browse/SPARK-22954 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.0 > Environment: {code} > $ ./bin/spark-shell --version > Welcome to > __ > / __/__ ___ _/ /__ > _\ \/ _ \/ _ `/ __/ '_/ >/___/ .__/\_,_/_/ /_/\_\ version 2.3.0-SNAPSHOT > /_/ > Using Scala version 2.11.8, Java HotSpot(TM) 64-Bit Server VM, 1.8.0_152 > Branch master > Compiled by user jacek on 2018-01-04T05:44:05Z > Revision 7d045c5f00e2c7c67011830e2169a4e130c3ace8 > {code} >Reporter: Jacek Laskowski >Priority: Minor > > {{ANALYZE TABLE}} fails with {{NoSuchTableException: Table or view 'names' > not found in database 'default';}} for temporary tables (views) while the > reason is that it can only work with permanent tables (which [it can > report|https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala#L38] > if it had a chance). > {code} > scala> names.createOrReplaceTempView("names") > scala> sql("ANALYZE TABLE names COMPUTE STATISTICS") > org.apache.spark.sql.catalyst.analysis.NoSuchTableException: Table or view > 'names' not found in database 'default'; > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.requireTableExists(SessionCatalog.scala:181) > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.getTableMetadata(SessionCatalog.scala:398) > at > org.apache.spark.sql.execution.command.AnalyzeTableCommand.run(AnalyzeTableCommand.scala:36) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:79) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$51.apply(Dataset.scala:3244) > at > org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:77) > at org.apache.spark.sql.Dataset.withAction(Dataset.scala:3243) > at org.apache.spark.sql.Dataset.(Dataset.scala:187) > at org.apache.spark.sql.Dataset$.ofRows(Dataset.scala:72) > at org.apache.spark.sql.SparkSession.sql(SparkSession.scala:638) > ... 50 elided > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22954) ANALYZE TABLE fails with NoSuchTableException for temporary tables (but should have reported "not supported on views")
[ https://issues.apache.org/jira/browse/SPARK-22954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16313956#comment-16313956 ] Suchith J N commented on SPARK-22954: - I found another method in org.apache.spark.sql.catalyst.catalog.SessionCatalog - *getTempViewOrPermanentTableMetadata()*. > ANALYZE TABLE fails with NoSuchTableException for temporary tables (but > should have reported "not supported on views") > -- > > Key: SPARK-22954 > URL: https://issues.apache.org/jira/browse/SPARK-22954 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.0 > Environment: {code} > $ ./bin/spark-shell --version > Welcome to > __ > / __/__ ___ _/ /__ > _\ \/ _ \/ _ `/ __/ '_/ >/___/ .__/\_,_/_/ /_/\_\ version 2.3.0-SNAPSHOT > /_/ > Using Scala version 2.11.8, Java HotSpot(TM) 64-Bit Server VM, 1.8.0_152 > Branch master > Compiled by user jacek on 2018-01-04T05:44:05Z > Revision 7d045c5f00e2c7c67011830e2169a4e130c3ace8 > {code} >Reporter: Jacek Laskowski >Priority: Minor > > {{ANALYZE TABLE}} fails with {{NoSuchTableException: Table or view 'names' > not found in database 'default';}} for temporary tables (views) while the > reason is that it can only work with permanent tables (which [it can > report|https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala#L38] > if it had a chance). > {code} > scala> names.createOrReplaceTempView("names") > scala> sql("ANALYZE TABLE names COMPUTE STATISTICS") > org.apache.spark.sql.catalyst.analysis.NoSuchTableException: Table or view > 'names' not found in database 'default'; > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.requireTableExists(SessionCatalog.scala:181) > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.getTableMetadata(SessionCatalog.scala:398) > at > org.apache.spark.sql.execution.command.AnalyzeTableCommand.run(AnalyzeTableCommand.scala:36) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:79) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$51.apply(Dataset.scala:3244) > at > org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:77) > at org.apache.spark.sql.Dataset.withAction(Dataset.scala:3243) > at org.apache.spark.sql.Dataset.(Dataset.scala:187) > at org.apache.spark.sql.Dataset$.ofRows(Dataset.scala:72) > at org.apache.spark.sql.SparkSession.sql(SparkSession.scala:638) > ... 50 elided > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22954) ANALYZE TABLE fails with NoSuchTableException for temporary tables (but should have reported "not supported on views")
[ https://issues.apache.org/jira/browse/SPARK-22954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16313915#comment-16313915 ] Suchith J N commented on SPARK-22954: - I run the commands you mentioned. Actually, there are two catalogs and they are different. Try these out. {code:java} scala > names.sparkSession.catalog.tableExists("names") res1: Boolean = true scala> names.sparkSession.sessionState.catalog.tableExists (TableIdentifier("names")) res2: Boolean = false {code} According to the stack trace, spark is looking up the sessionState.catalog. Actually, it is stored in sparkSession.catalog. > ANALYZE TABLE fails with NoSuchTableException for temporary tables (but > should have reported "not supported on views") > -- > > Key: SPARK-22954 > URL: https://issues.apache.org/jira/browse/SPARK-22954 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.0 > Environment: {code} > $ ./bin/spark-shell --version > Welcome to > __ > / __/__ ___ _/ /__ > _\ \/ _ \/ _ `/ __/ '_/ >/___/ .__/\_,_/_/ /_/\_\ version 2.3.0-SNAPSHOT > /_/ > Using Scala version 2.11.8, Java HotSpot(TM) 64-Bit Server VM, 1.8.0_152 > Branch master > Compiled by user jacek on 2018-01-04T05:44:05Z > Revision 7d045c5f00e2c7c67011830e2169a4e130c3ace8 > {code} >Reporter: Jacek Laskowski >Priority: Minor > > {{ANALYZE TABLE}} fails with {{NoSuchTableException: Table or view 'names' > not found in database 'default';}} for temporary tables (views) while the > reason is that it can only work with permanent tables (which [it can > report|https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala#L38] > if it had a chance). > {code} > scala> names.createOrReplaceTempView("names") > scala> sql("ANALYZE TABLE names COMPUTE STATISTICS") > org.apache.spark.sql.catalyst.analysis.NoSuchTableException: Table or view > 'names' not found in database 'default'; > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.requireTableExists(SessionCatalog.scala:181) > at > org.apache.spark.sql.catalyst.catalog.SessionCatalog.getTableMetadata(SessionCatalog.scala:398) > at > org.apache.spark.sql.execution.command.AnalyzeTableCommand.run(AnalyzeTableCommand.scala:36) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68) > at > org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:79) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$6.apply(Dataset.scala:187) > at org.apache.spark.sql.Dataset$$anonfun$51.apply(Dataset.scala:3244) > at > org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:77) > at org.apache.spark.sql.Dataset.withAction(Dataset.scala:3243) > at org.apache.spark.sql.Dataset.(Dataset.scala:187) > at org.apache.spark.sql.Dataset$.ofRows(Dataset.scala:72) > at org.apache.spark.sql.SparkSession.sql(SparkSession.scala:638) > ... 50 elided > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Issue Comment Deleted] (SPARK-22943) OneHotEncoder supports manual specification of categorySizes
[ https://issues.apache.org/jira/browse/SPARK-22943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22943: Comment: was deleted (was: I would like to work on this. Could someone tell me if they have already started work on this issue?) > OneHotEncoder supports manual specification of categorySizes > > > Key: SPARK-22943 > URL: https://issues.apache.org/jira/browse/SPARK-22943 > Project: Spark > Issue Type: Improvement > Components: ML >Affects Versions: 2.2.0 >Reporter: yuhao yang >Priority: Minor > > OHE should support configurable categorySizes, as n-values in > http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html. > which allows consistent and foreseeable conversion. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22943) OneHotEncoder supports manual specification of categorySizes
[ https://issues.apache.org/jira/browse/SPARK-22943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16313241#comment-16313241 ] Suchith J N commented on SPARK-22943: - I would like to work on this. Could someone tell me if they have already started work on this issue? > OneHotEncoder supports manual specification of categorySizes > > > Key: SPARK-22943 > URL: https://issues.apache.org/jira/browse/SPARK-22943 > Project: Spark > Issue Type: Improvement > Components: ML >Affects Versions: 2.2.0 >Reporter: yuhao yang >Priority: Minor > > OHE should support configurable categorySizes, as n-values in > http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html. > which allows consistent and foreseeable conversion. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16310068#comment-16310068 ] Suchith J N commented on SPARK-22553: - The change breaks 3 tests 1. *TableIdentifierParserSuite* : table identifier - strict keywords{code:java} org.apache.spark.sql.catalyst.parser.ParseException: mismatched input 'from' expecting {'SELECT', 'ADD', 'AS', 'ALL', 'DISTINCT', 'WHERE', {code} 2. *PlanParserSuite* : simple select query {code:java} org.apache.spark.sql.catalyst.parser.ParseException: no viable alternative at input 'select from'(line 1, pos 7) {code} 3. *DataTypeParserSuite* : parse struct {code:java} org.apache.spark.sql.catalyst.parser.ParseException: mismatched input 'from' expecting {'SELECT', 'ADD', 'AS', 'ALL', 'DISTINCT', 'WHERE', {code} > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Issue Comment Deleted] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22553: Comment: was deleted (was: Shall I submit a PR?) > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307499#comment-16307499 ] Suchith J N commented on SPARK-22553: - Shall I submit a PR? > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307467#comment-16307467 ] Suchith J N edited comment on SPARK-22553 at 1/1/18 3:01 PM: - Removed FROM from nonReserved [https://issues.apache.org/jira/secure/attachment/12904128/Removed_FROM_from_nonReserved_list.patch] was (Author: suchithjn22): Removed FROM from nonReserved > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Issue Comment Deleted] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22553: Comment: was deleted (was: {code} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {code} ) > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22553: Attachment: Removed_FROM_from_nonReserved_list.patch Removed FROM from nonReserved > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > Attachments: Removed_FROM_from_nonReserved_list.patch > > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307466#comment-16307466 ] Suchith J N edited comment on SPARK-22553 at 1/1/18 2:58 PM: - {code} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {code} was (Author: suchithjn22): {code:java} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {code} > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Issue Comment Deleted] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22553: Comment: was deleted (was: {panel:title=Removed_FROM_from_nonReserved_list.patch} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {panel} ) > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307466#comment-16307466 ] Suchith J N commented on SPARK-22553: - {code:java} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {code} > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307464#comment-16307464 ] Suchith J N commented on SPARK-22553: - {panel:title=Removed_FROM_from_nonReserved_list.patch} --- sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (date 1514565609000) +++ sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 (revision ) @@ -746,7 +746,7 @@ | NULL | ORDER | OUTER | TABLE | TRUE | WITH | RLIKE | AND | CASE | CAST | DISTINCT | DIV | ELSE | END | FUNCTION | INTERVAL | MACRO | OR | STRATIFY | THEN | UNBOUNDED | WHEN -| DATABASE | SELECT | FROM | WHERE | HAVING | TO | TABLE | WITH | NOT +| DATABASE | SELECT | WHERE | HAVING | TO | TABLE | WITH | NOT | DIRECTORY | BOTH | LEADING | TRAILING ; {panel} > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307462#comment-16307462 ] Suchith J N commented on SPARK-22553: - [^file:///home/suchithjn/OpenSource/Removed_FROM_from_nonReserved_list.patch] > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Issue Comment Deleted] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suchith J N updated SPARK-22553: Comment: was deleted (was: [^file:///home/suchithjn/OpenSource/Removed_FROM_from_nonReserved_list.patch] ) > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22553) Drop FROM in nonReserved
[ https://issues.apache.org/jira/browse/SPARK-22553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16307458#comment-16307458 ] Suchith J N commented on SPARK-22553: - Shall I work on this issue? > Drop FROM in nonReserved > > > Key: SPARK-22553 > URL: https://issues.apache.org/jira/browse/SPARK-22553 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.2.0 >Reporter: Takeshi Yamamuro >Priority: Trivial > > A simple query below throws a misleading error because nonReserved has > `SELECT` in SqlBase.q4: > {code} > scala> Seq((1, 2)).toDF("a", "b").createTempView("t") > scala> sql("select a, count(1), from t group by 1").show > org.apache.spark.sql.AnalysisException: cannot resolve '`a`' given input > columns: []; line 1 pos 7; > 'Aggregate [unresolvedordinal(1)], ['a, count(1) AS count(1)#13L, 'from AS > t#11] > +- OneRowRelation$ > at > org.apache.spark.sql.catalyst.analysis.package$AnalysisErrorAt.failAnalysis(package.scala:42) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:88) > at > org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1$$anonfun$apply$2.applyOrElse(CheckAnalysis.scala:85) > at > org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$transformUp$1.apply(TreeNode.scala:289) > {code} > I know nonReserved currently has `SELECT` because of the historical reason > (https://github.com/apache/spark/pull/18079#discussion_r118842186). But, > since IMHO this is a kind of common mistakes (This message annoyed me a few > days ago in large SQL queries...), it might be worth dropping it in the > reserved. > FYI: In postgresql throws an explicit error in this case: > {code} > postgres=# select a, count(1), from test group by b; > ERROR: syntax error at or near "from" at character 21 > STATEMENT: select a, count(1), from test group by b; > ERROR: syntax error at or near "from" > LINE 1: select a, count(1), from test group by b; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org