[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-08-02 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r207320333 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-08-01 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Yes there is a change in the behavior, As i mentioned above in descriptions now we will be able to support wildcard even in the folder level for local file systems. Previous

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-08-01 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206961528 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-31 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206411960 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-30 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r206037303 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-30 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 ok, i missed it, let me have a look. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-24 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 @HyukjinKwon @gatorsmile All issues has been addressed, please let me know how this patch looks like. Thanks

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistics to i...

2018-07-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r204362254 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchangeExec.scala --- @@ -231,6 +231,12 @@ object

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistics to i...

2018-07-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r204361301 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/TakeOrderedAndProjectSuite.scala --- @@ -22,6 +22,7 @@ import scala.util.Random

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/21775#discussion_r204287713 --- Diff: docs/sql-programming-guide.md --- @@ -1843,6 +1843,7 @@ working with timestamps in `pandas_udf`s to get the best performance, see

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/21775#discussion_r204286227 --- Diff: docs/sql-programming-guide.md --- @@ -1843,6 +1843,7 @@ working with timestamps in `pandas_udf`s to get the best performance, see

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-22 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 @HyukjinKwon seems to be a limitation as of now even from hive, better we can follow the hive behavior unless the limitation has been resolved from hive. >> Hive-2526 is th

[GitHub] spark issue #21775: [SPARK-24812][SQL] Last Access Time in the table descrip...

2018-07-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/21775 sure, i will update the PR based on the comments, Thanks for suggestions. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #21775: [SPARK-24812][SQL] Last Access Time in the table ...

2018-07-15 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/21775 [SPARK-24812][SQL] Last Access Time in the table description is not valid ## What changes were proposed in this pull request? Last Access Time will always displayed wrong date Wed Dec 31 15

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen The only possible objection I can see is that the behavior for paths that contains a ? or * would now change :- There is no behavior change currently, currently user cannot

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202429058 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,44 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @srowen Thanks for the review. all comments has been addressed from my side. let me know for any clarifications

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255494 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255324 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255185 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala --- @@ -1912,11 +1912,58 @@ class SQLQuerySuite extends

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-07-13 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r202255230 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,49 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-07-01 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 This PR is open from many days, we need this fix in our version code also, can someone please review and provide me suggestions if any

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @HyukjinKwon Thanks for the re look. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-06-20 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r196670750 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -303,94 +303,50 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-19 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile @jiangxb1987 @wzhfy @HyukjinKwon @vinodkc please review and let me know for any suggestions. i think all the gaps which we discussed as part of the PR is been addressed

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-19 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Now my PR addresses the scenario what you mentioned. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-06-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 As @kevinyu98 mentioned below usecase where '?' is been used in the load command will fail as when we create a Path instance with uri , the chars followed by ? will be removed as part

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-18 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Even below kind of queries cannot run as exist() api while validation cannot identify wild chars, this is the reason in my PR i am using globStatus() API load data inpath 'hdfs

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-17 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98val uri = Utils.resolveURI(path) , if you wont mention any scheme , this api will return --- - To unsubscribe

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-16 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Thanks for you feedback, i will test once all my scenario with your fix, but here one more point i need to mention is like my fix has also has impact in the logic of local path

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-12 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @kevinyu98 Spark supports wildcards characters in both local and non local scenarios, the only restriction spark imposes is using wildcards in folder names of the local file-system path

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Currently all the load command use-cases with wildcard in a load path are blocked for the users --- - To unsubscribe, e

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-05-08 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @HyukjinKwon @jiangxb1987 @gatorsmile @vinodkc, Please let me know for any suggestions regarding this PR. I covered all scenarios related to this PR. Working fine

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-04-15 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @HyukjinKwon @jiangxb1987 @gatorsmile @vinodkc All test-cases has been passed, please let me know for any other suggestions. With current solution following improvement has

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-04-14 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r181543985 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -304,45 +304,14 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-04-10 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy i am working on it, when i ran locally few test-cases were failing, correcting the same. once done i will update. Thanks

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-04-03 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r179030611 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,7 +385,9 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-04-03 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r179030399 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,7 +385,9 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-29 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy Actually its not a behavior change , i think this condition was working in other spark 2.x versions. Some time back a PR is been added for load command hdfs path validation, in this PR

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @jiangxb1987 This usecase has dependency on hdfs file path and i could not find any testcases which validates or executes any scenarios against hdfs file path. can you please suggest me some

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy updated as per comments --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-26 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @gatorsmile @jiangxb1987 due to some inconsistency in the previous test report due to inconsistent hdfs file paths i had updated it, i added the test results in one more section Test

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-16 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @gatorsmile @jiangxb1987 updated the code based on comments, also added one more use-case to verify the normal case of data-loading without specifying wildcard in file path. let me

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r174868215 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,8 +385,12 @@ case class LoadDataCommand

[GitHub] spark pull request #20611: [SPARK-23425][SQL]Support wildcard in HDFS path f...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/20611#discussion_r174791662 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -385,8 +385,12 @@ case class LoadDataCommand

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-15 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Yeah there is a bit confusing, i updated the snapshot, data validation and consistency i verified --- - To unsubscribe, e

[GitHub] spark issue #20611: [SPARK-23425][SQL]Support wildcard in HDFS path for load...

2018-03-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy @gatorsmile @jiangxb1987 updated the test results and PR title based on zhenhuas comment, please review and let me know if still any improvement area is present for this particular PR

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-09 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy sure will do it. Thanks for suggestions --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-07 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @wzhfy Hive supports specifying wildcard in both file and folder level, please find test report attached below Use-case related to Hive File level wild card support ![hive_file

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-03-07 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 As above changes are applicable only for hdfs related paths, i did testing manually, please find the attached test report Usecase 1: Load data by specifying wild card character in the hdfs

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Is any issue with this PR? can you please re look into this. Thanks --- - To unsubscribe, e-mail: reviews

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-26 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @jiangxb1987 Yes, Hive supports such way of data loading, suppose we have many files which starts with same naming conventions, user will preferto use such kind of queries where he can take

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-26 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Build is passed, Please review and let me know for any suggestions. Thanks --- - To unsubscribe, e-mail

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-23 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 @gatorsmile Seems to be a random failures, each time random set of test cases are failing. Please let me know for any suggestions

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 retest please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-18 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 ![image](https://user-images.githubusercontent.com/12999161/36362491-5a8437e4-155b-11e8-80fd-885a1ebf045c.png) seems to be an invalid failure

[GitHub] spark issue #20611: [SPARK-23425][SQL]When wild card is been used in load co...

2018-02-14 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/20611 Please review and let me know for any suggestions. Thanks --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #20611: [SPARK-23425][SQL]When wild card is been used in ...

2018-02-14 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/20611 [SPARK-23425][SQL]When wild card is been used in load command system is throwing analysis exception ## What changes were proposed in this pull request? A validaton logic is been added

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-30 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Thanks all for the review and guidance. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-30 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r154187840 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,14 @@ abstract class DDLSuite extends

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r153693386 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,13 @@ abstract class DDLSuite extends

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/19823#discussion_r153693359 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2392,5 +2392,13 @@ abstract class DDLSuite extends

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-28 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 @gatorsmile @HyukjinKwon @srowen Please review as i modified the code as per provided comments. thanks --- - To unsubscribe

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Basically this validation stands good for both cases where scheme can come as null and not null, i will update the logic as Sean told. Thanks

[GitHub] spark issue #19823: [WIP][SPARK-22601][SQL] Data load is getting displayed s...

2017-11-27 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 Thanks for the comments guys, i am working on it.,will update the PR based on comments. --- - To unsubscribe, e-mail

[GitHub] spark issue #19823: [SPARK-22601][SQL] Data load is getting displayed succes...

2017-11-26 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/19823 ![loaddataissue_verificationresult](https://user-images.githubusercontent.com/12999161/33243050-73d8ec10-d304-11e7-8247-14eff2059f38.PNG

[GitHub] spark pull request #19823: [SPARK-22601][SQL] Data load is getting displayed...

2017-11-26 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/19823 [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing hdfs file path ## What changes were proposed in this pull request? When user tries to load data

[GitHub] spark issue #17649: [SPARK-20380][SQL] Unable to set/unset table comment pro...

2017-05-05 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile you are right, pr should address the issue which is handled in alter table set/unset properties ddls, updated the description and title, let me know for any clarifications. thanks

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-05-05 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @wzhfy @gatorsmile fixed all the comments, thanks for reviewing the changes and providing me valuable sharings and comments. thanks. --- If your project is set up for it, you can reply

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-26 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r113599057 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/InMemoryCatalog.scala --- @@ -295,7 +295,9 @@ class InMemoryCatalog

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-26 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r113427428 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/InMemoryCatalog.scala --- @@ -295,7 +295,9 @@ class InMemoryCatalog

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-25 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile @wzhfy updated the PR by removing the 'comment' from table properties . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112840921 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe-table-after-alter-table.sql --- @@ -0,0 +1,29 @@ +CREATE TABLE table_with_comment

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-23 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112838259 --- Diff: sql/core/src/test/resources/sql-tests/results/describe-table-after-alter-table.sql.out --- @@ -0,0 +1,162 @@ +-- Automatically generated

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-20 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile @wzhfy updated the PR based on the review comments, please let me know for any suggestions. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r111696709 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe_tbleproperty_validation.sql --- @@ -0,0 +1,24 @@ +CREATE TABLE table_with_comment

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r111696418 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala --- @@ -232,7 +232,9 @@ case class AlterTableSetPropertiesCommand

[GitHub] spark issue #17649: [SPARK-20023][SQL][follow up] Output table comment for D...

2017-04-16 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/17649 cc @wzhfy --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17649: [SPARK-20023][SQL][follow up] Output table commen...

2017-04-16 Thread sujith71955
GitHub user sujith71955 opened a pull request: https://github.com/apache/spark/pull/17649 [SPARK-20023][SQL][follow up] Output table comment for DESC FORMATTED after adding/modifying table comment using Alter TableSetPropertiesCommand ### What changes were proposed in this pull

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-08 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r100033968 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark issue #16677: [SPARK-19355][SQL] Use map output statistices to improve...

2017-02-06 Thread sujith71955
Github user sujith71955 commented on the issue: https://github.com/apache/spark/pull/16677 @viirya i tested with the above mentioned approach with sample data, it has improved the performance almost into 3X Please find the test report Total No of Executers = 3 Total

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-04 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r99463628 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala --- @@ -216,7 +216,7 @@ class PlannerSuite extends SharedSQLContext

<    1   2