[GitHub] spark issue #16677: [SPARK-19355][SQL] Use map output statistices to improve...

2017-02-06 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/16677 @viirya i tested with the above mentioned approach with sample data, it has improved the performance almost into 3X Please find the test report Total No of Executers = 3 Total

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-08 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r100033968 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-04 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r99463628 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala --- @@ -216,7 +216,7 @@ class PlannerSuite extends SharedSQLContext

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r111696709 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe_tbleproperty_validation.sql --- @@ -0,0 +1,24 @@ +CREATE TABLE table_with_comment

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r111696418 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala --- @@ -232,7 +232,9 @@ case class AlterTableSetPropertiesCommand

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile @wzhfy updated the PR based on the review comments, please let me know for any suggestions. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/17649 [SPARK-20023][SQL][follow up] Output table comment for DESC FORMATTED after adding/modifying table comment using Alter TableSetPropertiesCommand ### What changes were proposed in this pull

[GitHub] spark issue #17649: [SPARK-20023][SQL][follow up] Output table comment for D...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 cc @wzhfy --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112840921 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe-table-after-alter-table.sql --- @@ -0,0 +1,29 @@ +CREATE TABLE table_with_comment

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112838259 --- Diff: sql/core/src/test/resources/sql-tests/results/describe-table-after-alter-table.sql.out --- @@ -0,0 +1,162 @@ +-- Automatically generated

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-25 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile @wzhfy updated the PR by removing the 'comment' from table properties . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-26 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r113427428 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/InMemoryCatalog.scala --- @@ -295,7 +295,9 @@ class InMemoryCatalog

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-26 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r113599057 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/InMemoryCatalog.scala --- @@ -295,7 +295,9 @@ class InMemoryCatalog

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-05-05 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @wzhfy @gatorsmile fixed all the comments, thanks for reviewing the changes and providing me valuable sharings and comments. thanks. --- If your project is set up for it, you can reply

[GitHub] spark issue #17649: [SPARK-20380][SQL] Unable to set/unset table comment pro...

2017-05-05 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile you are right, pr should address the issue which is handled in alter table set/unset properties ddls, updated the description and title, let me know for any clarifications. thanks

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Basically this validation stands good for both cases where scheme can come as null and not null, i will update the logic as Sean told. Thanks

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Thanks for the comments guys, i am working on it.,will update the PR based on comments. --- - To unsubscribe, e-mail

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 @gatorsmile @HyukjinKwon @srowen Please review as i modified the code as per provided comments. thanks --- - To unsubscribe

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-26 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 ![loaddataissue_verificationresult](https://user-images.githubusercontent.com/12999161/33243050-73d8ec10-d304-11e7-8247-14eff2059f38.PNG

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-26 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/19823 [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing hdfs file path ## What changes were proposed in this pull request? When user tries to load data

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r153693359 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,13 @@ abstract class DDLSuite extends

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r153693386 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,13 @@ abstract class DDLSuite extends

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-30 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r154187840 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,14 @@ abstract class DDLSuite extends

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-30 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Thanks all for the review and guidance. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @HyukjinKwon @jiangxb1987 @gatorsmile @vinodkc, Please let me know for any suggestions regarding this PR. I covered all scenarios related to this PR. Working fine

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Currently all the load command use-cases with wildcard in a load path are blocked for the users --- - To unsubscribe, e

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-12 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Spark supports wildcards characters in both local and non local scenarios, the only restriction spark imposes is using wildcards in folder names of the local file-system path

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-17 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98val uri = Utils.resolveURI(path) , if you wont mention any scheme , this api will return --- - To unsubscribe

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-18 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Even below kind of queries cannot run as exist() api while validation cannot identify wild chars, this is the reason in my PR i am using globStatus() API load data inpath 'hdfs

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-16 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Thanks for you feedback, i will test once all my scenario with your fix, but here one more point i need to mention is like my fix has also has impact in the logic of local path

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 As @kevinyu98 mentioned below usecase where '?' is been used in the load command will fail as when we create a Path instance with uri , the chars followed by ? will be removed as part

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-19 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Now my PR addresses the scenario what you mentioned. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-01 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 This PR is open from many days, we need this fix in our version code also, can someone please review and provide me suggestions if any

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-06-20 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r196670750 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,50 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @HyukjinKwon Thanks for the re look. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #20611: [SPARK-23425][SQL]When wild card is been used in ...

2018-02-14 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/20611 [SPARK-23425][SQL]When wild card is been used in load command system is throwing analysis exception ## What changes were proposed in this pull request? A validaton logic is been added

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Please review and let me know for any suggestions. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 retest please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-18 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 ![image](https://user-images.githubusercontent.com/12999161/36362491-5a8437e4-155b-11e8-80fd-885a1ebf045c.png) seems to be an invalid failure

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-23 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Seems to be a random failures, each time random set of test cases are failing. Please let me know for any suggestions

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-19 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile @jiangxb1987 @wzhfy @HyukjinKwon @vinodkc please review and let me know for any suggestions. i think all the gaps which we discussed as part of the PR is been addressed

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-02 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 No other changes in the load command behavior --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-02 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 i think support wildcard is confusing term :) --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-02 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 I will reiterate again, actually this PR was intended for fixing the issue related to wildcard character issue in the hdfs file system scenarios , with the current solution we are also able

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-08-02 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r207320333 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-31 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206411960 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-08-01 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206961528 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-01 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Yes there is a change in the behavior, As i mentioned above in descriptions now we will be able to support wildcard even in the folder level for local file systems. Previous

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-30 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 ok, i missed it, let me have a look. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-30 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206037303 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile i added the comment. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22120: [SPARK-25131]Event logs missing applicationAttemp...

2018-08-16 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22120#discussion_r210599272 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/scheduler/cluster/YarnClientSchedulerBackend.scala --- @@ -62,6 +62,10 @@ private

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-17 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Working fine with latest code. Thanks !!! --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-17 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Did some testing in my cluster with updated code for verifying the load command with hdfs paths, please find the test results. Local path testing is already covered in my UT

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-07 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Updated the PR by fixing the comment from sean. Hope i addressed all the issues :) --- - To unsubscribe, e-mail: reviews

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-13 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Hi All, can we have a re-look into this PR and let me know whether is it looking fine. Thanks --- - To unsubscribe, e-mail

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen Make this method private -- can be right? This is more like a Util method where any feature deals with file system can use this method to form a path instance without

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen got your point, i will update --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-21 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen Fixed the pending comments. Kindly recheck. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212356336 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212354862 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212356651 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212392740 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212370528 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][Yarn] AM and Executor Memory valida...

2018-08-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/22199#discussion_r212396099 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -338,13 +338,14 @@ private[spark] class Client

[GitHub] spark pull request #22199: [SPARK-25073][SQL]When wild card is been used in ...

2018-08-23 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/22199 [SPARK-25073][SQL]When wild card is been used in load command system ## What changes were proposed in this pull request? When the yarn.nodemanager.resource.memory-mb

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 sure, i will update the PR based on the comments, Thanks for suggestions. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-15 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/21775 [SPARK-24812][SQL] Last Access Time in the table description is not valid ## What changes were proposed in this pull request? Last Access Time will always displayed wrong date Wed Dec 31 15

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-24 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 @HyukjinKwon @gatorsmile All issues has been addressed, please let me know how this patch looks like. Thanks

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 @HyukjinKwon seems to be a limitation as of now even from hive, better we can follow the hive behavior unless the limitation has been resolved from hive. >> Hive-2526 is th

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/21775#discussion_r204287713 --- Diff: docs/sql-programming-guide.md --- @@ -1843,6 +1843,7 @@ working with timestamps in `pandas_udf`s to get the best performance, see

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/21775#discussion_r204286227 --- Diff: docs/sql-programming-guide.md --- @@ -1843,6 +1843,7 @@ working with timestamps in `pandas_udf`s to get the best performance, see

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistics to i...

2018-07-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r204361301 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/TakeOrderedAndProjectSuite.scala --- @@ -22,6 +22,7 @@ import scala.util.Random

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistics to i...

2018-07-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r204362254 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchangeExec.scala --- @@ -231,6 +231,12 @@ object

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen The only possible objection I can see is that the behavior for paths that contains a ? or * would now change :- There is no behavior change currently, currently user cannot

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-10 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216417629 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-10 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216425911 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216638725 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216638992 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen Thanks for the review. all comments has been addressed from my side. let me know for any clarifications

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255185 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala --- @@ -1912,11 +1912,58 @@ class SQLQuerySuite extends

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255230 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255324 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255494 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202429058 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216685077 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216693375 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216692677 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216694466 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-09-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r216693154 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-07 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 As above changes are applicable only for hdfs related paths, i did testing manually, please find the attached test report Usecase 1: Load data by specifying wild card character in the hdfs

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-07 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy Hive supports specifying wildcard in both file and folder level, please find test report attached below Use-case related to Hive File level wild card support ![hive_file

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r174868215 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,8 +385,12 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @gatorsmile @jiangxb1987 updated the test results and PR title based on zhenhuas comment, please review and let me know if still any improvement area is present for this particular PR

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r174791662 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,8 +385,12 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Yeah there is a bit confusing, i updated the snapshot, data validation and consistency i verified --- - To unsubscribe, e

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-09 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy sure will do it. Thanks for suggestions --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-04-10 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy i am working on it, when i ran locally few test-cases were failing, correcting the same. once done i will update. Thanks

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-04-14 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r181543985 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -304,45 +304,14 @@ case class LoadDataCommand

  1   2   >