[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-07-24 Thread bbossy
Github user bbossy commented on the issue: https://github.com/apache/spark/pull/18269 @HyukjinKwon I'll see that I can address the outstanding review comments in the next day or two. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-07-23 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/18269 gentle ping @bbossy, I just want to be sure if it is in progress in any way. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-21 Thread mallman
Github user mallman commented on the issue: https://github.com/apache/spark/pull/18269 Hi @bbossy > Does it match your scenario? It does not match my scenario. I'm reading files from HDFS. In your test, you're reading files from the local filesystem. Can you try a

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-18 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/18269 let's wait @mallman 's response to make sure this patch does fix the problem --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-18 Thread bbossy
Github user bbossy commented on the issue: https://github.com/apache/spark/pull/18269 @mallman I'm not sure where this difference in behaviour is coming from. The following test in `FileIndexSuite` passes: ``` test("mallman's scenario") {

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-18 Thread bbossy
Github user bbossy commented on the issue: https://github.com/apache/spark/pull/18269 @cloud-fan Could you take another look, please? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-16 Thread mallman
Github user mallman commented on the issue: https://github.com/apache/spark/pull/18269 @bbossy I've built and deployed a branch of Spark 2.2 with your patch and compared its behavior to the same branch of Spark 2.2 without your patch. I'm seeing different behavior, but not what I

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-13 Thread bbossy
Github user bbossy commented on the issue: https://github.com/apache/spark/pull/18269 ping @gatorsmile @srowen and possibly @cloud-fan : Would like to hear your thoughts on this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-12 Thread bbossy
Github user bbossy commented on the issue: https://github.com/apache/spark/pull/18269 @gatorsmile : I ran a synthetic scenario to show what changes, since deploying this branch would be more involved. I created two very simple relations on a small HDFS cluster (4

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-12 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18269 Could you please update the PR description by copying the contents from the JIRA? Any performance number you can share? --- If your project is set up for it, you can reply to this

[GitHub] spark issue #18269: [SPARK-21056][SQL] Use at most one spark job to list fil...

2017-06-11 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18269 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this