[GitHub] spark pull request #19463: Cleanup comment in RDDSuite test

2017-10-10 Thread sohum2002
Github user sohum2002 closed the pull request at: https://github.com/apache/spark/pull/19463 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19463: Cleanup comment in RDDSuite test

2017-10-09 Thread sohum2002
Github user sohum2002 commented on the issue: https://github.com/apache/spark/pull/19463 I just added "Removed one comment from RDDSuite." to the PR description. Will this suffice? --- - To unsubscri

[GitHub] spark pull request #19463: Cleanup comment in RDDSuite test

2017-10-09 Thread sohum2002
GitHub user sohum2002 opened a pull request: https://github.com/apache/spark/pull/19463 Cleanup comment in RDDSuite test ## What changes were proposed in this pull request? There were not changes proposed in this pull request. ## How was this patch tested

[GitHub] spark pull request #19454: [SPARK-22152][SPARK-18855][SQL] Added flatten fun...

2017-10-09 Thread sohum2002
Github user sohum2002 closed the pull request at: https://github.com/apache/spark/pull/19454 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19454: [SPARK-22152][SPARK-18855][SQL] Added flatten functions ...

2017-10-09 Thread sohum2002
Github user sohum2002 commented on the issue: https://github.com/apache/spark/pull/19454 Thank you all for your comments. I hope to improve in my future PRs. Cheers! --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark issue #19454: [SPARK-22152][SPARK-18855][SQL] Added flatten functions ...

2017-10-09 Thread sohum2002
Github user sohum2002 commented on the issue: https://github.com/apache/spark/pull/19454 @HyukjinKwon - Thank you for your comments and analysis of this PR. I will also try to improve the `flatMap(identity)` as mentioned by @srowen. Also, will add a python implementation

[GitHub] spark issue #19454: [SPARK-22152][SPARK-18855 ][SQL] Added flatten functions...

2017-10-09 Thread sohum2002
Github user sohum2002 commented on the issue: https://github.com/apache/spark/pull/19454 Would appreciate some help in the Python implementation of the `flatten` function as I have never used pyspark. Could someone help me out

[GitHub] spark pull request #19454: Added flatten functions for RDD and Dataset

2017-10-08 Thread sohum2002
GitHub user sohum2002 opened a pull request: https://github.com/apache/spark/pull/19454 Added flatten functions for RDD and Dataset ## What changes were proposed in this pull request? This PR creates a _flatten_ function in two places: RDD and Dataset classes. This PR resolves

[GitHub] spark pull request #19453: Added selectAllColumns function in Dataset class

2017-10-07 Thread sohum2002
Github user sohum2002 closed the pull request at: https://github.com/apache/spark/pull/19453 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19453: Added selectAllColumns function in Dataset class

2017-10-07 Thread sohum2002
Github user sohum2002 commented on the issue: https://github.com/apache/spark/pull/19453 @srowen - This is a good point, let me close this PR. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #19453: Added selectAllColumns function in Dataset class

2017-10-07 Thread sohum2002
GitHub user sohum2002 opened a pull request: https://github.com/apache/spark/pull/19453 Added selectAllColumns function in Dataset class The proposed two new additional functions is to help select all the columns in a Dataset except for given columns. You can merge this pull

[GitHub] spark pull request #19446: Dataset optimization

2017-10-06 Thread sohum2002
Github user sohum2002 closed the pull request at: https://github.com/apache/spark/pull/19446 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19446: Dataset optimization

2017-10-06 Thread sohum2002
GitHub user sohum2002 opened a pull request: https://github.com/apache/spark/pull/19446 Dataset optimization The proposed two new additional functions is to help select all the columns in a Dataset except for given columns. You can merge this pull request into a Git repository

[GitHub] spark pull request #19445: Dataset select all columns

2017-10-06 Thread sohum2002
Github user sohum2002 closed the pull request at: https://github.com/apache/spark/pull/19445 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19445: Dataset select all columns

2017-10-06 Thread sohum2002
GitHub user sohum2002 opened a pull request: https://github.com/apache/spark/pull/19445 Dataset select all columns The proposed two new additional functions is to help select all the columns in a Dataset except for given columns. You can merge this pull request into a Git