[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19355 **[Test build #82209 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82209/testReport)** for PR 19355 at commit

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82208 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82208/testReport)** for PR 19327 at commit

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82208/ Test FAILed. ---

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19355 **[Test build #82209 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82209/testReport)** for PR 19355 at commit

[GitHub] spark pull request #19356: Merge pull request #1 from apache/master

2017-09-26 Thread yaozhang2016
GitHub user yaozhang2016 opened a pull request: https://github.com/apache/spark/pull/19356 Merge pull request #1 from apache/master update from origin ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How

[GitHub] spark issue #19356: Merge pull request #1 from apache/master

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19356 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19349: [SPARK-22125][PYSPARK][SQL] Enable Arrow Stream format f...

2017-09-26 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/19349 Nice job on refactoring `PythonRunner`! I think we should just replace the arrow file format with stream format for pandas udf instead of having a new conf to enable it, as long as all the

[GitHub] spark issue #19353: [SPARK-22103][FOLLOWUP] Rename addExtraCode to addInnerC...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19353 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19353: [SPARK-22103][FOLLOWUP] Rename addExtraCode to addInnerC...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19353 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82204/ Test PASSed. ---

[GitHub] spark issue #19356: Merge pull request #1 from apache/master

2017-09-26 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/19356 Close this @yaozhang2016 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19355 **[Test build #82205 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82205/testReport)** for PR 19355 at commit

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82205/ Test PASSed. ---

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82208 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82208/testReport)** for PR 19327 at commit

[GitHub] spark issue #19343: [SPARK-22121][SQL] Correct database location for namenod...

2017-09-26 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19343 Thanks! Maybe we can close it now and revisit it when we have a better way to resolve the file system specific issues? ---

[GitHub] spark pull request #19343: [SPARK-22121][SQL] Correct database location for ...

2017-09-26 Thread squito
Github user squito closed the pull request at: https://github.com/apache/spark/pull/19343 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19343: [SPARK-22121][SQL] Correct database location for namenod...

2017-09-26 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/19343 OK, closing this and the jira --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19353: [SPARK-22103][FOLLOWUP] Rename addExtraCode to addInnerC...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19353 **[Test build #82204 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82204/testReport)** for PR 19353 at commit

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82209/ Test FAILed. ---

[GitHub] spark pull request #19349: [SPARK-22125][PYSPARK][SQL] Enable Arrow Stream f...

2017-09-26 Thread BryanCutler
Github user BryanCutler commented on a diff in the pull request: https://github.com/apache/spark/pull/19349#discussion_r141142944 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/python/ArrowStreamPythonUDFRunner.scala --- @@ -0,0 +1,197 @@ +/* + *

[GitHub] spark issue #19272: [Spark-21842][Mesos] Support Kerberos ticket renewal and...

2017-09-26 Thread ArtRand
Github user ArtRand commented on the issue: https://github.com/apache/spark/pull/19272 Hey @kalvinnchau good catch on the first renewal time. I believe I addressed it. Have a look. Thanks again. --- - To

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19355 **[Test build #82210 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82210/testReport)** for PR 19355 at commit

[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/19218#discussion_r141187555 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/InsertSuite.scala --- @@ -728,4 +728,120 @@ class InsertSuite extends QueryTest with

[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/19218#discussion_r141187890 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/InsertSuite.scala --- @@ -728,4 +728,120 @@ class InsertSuite extends QueryTest with

[GitHub] spark issue #18098: [SPARK-16944][Mesos] Improve data locality when launchin...

2017-09-26 Thread gpang
Github user gpang commented on the issue: https://github.com/apache/spark/pull/18098 Hi @srowen @vanzin , would you be able to help me with this PR? Thanks! --- - To unsubscribe, e-mail:

[GitHub] spark issue #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compression.co...

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/19218 Hi, @fjh100456 . I left a few comments. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19270: [SPARK-21809] : Change Stage Page to use datatables to s...

2017-09-26 Thread pgandhi999
Github user pgandhi999 commented on the issue: https://github.com/apache/spark/pull/19270 Ok, I will look into it. I am currently fixing ui bugs and unit tests, so will commit those changes first, then will look into the above issue. Thank you. ---

[GitHub] spark issue #19348: [BUILD] Close stale PRs

2017-09-26 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19348 Add: #15009 #18253 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/19218#discussion_r141186378 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/InsertSuite.scala --- @@ -728,4 +728,120 @@ class InsertSuite extends QueryTest with

[GitHub] spark pull request #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compres...

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/19218#discussion_r141188386 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/InsertSuite.scala --- @@ -728,4 +728,120 @@ class InsertSuite extends QueryTest with

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/19355 Retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #18887: [SPARK-20642][core] Store FsHistoryProvider listing data...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18887 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82206/ Test PASSed. ---

[GitHub] spark issue #18887: [SPARK-20642][core] Store FsHistoryProvider listing data...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18887 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82211 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82211/testReport)** for PR 19327 at commit

[GitHub] spark issue #19270: [SPARK-21809] : Change Stage Page to use datatables to s...

2017-09-26 Thread ajbozarth
Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/19270 Ok so I'm still doing more testing but I've narrowed the above problem. The above error is occurring when using either local or standalone, the error doesn't appear when using yarn. I'll continue

[GitHub] spark issue #19020: [SPARK-3181] [ML] Implement huber loss for LinearRegress...

2017-09-26 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/19020 > We have two candidate name: epsilon or m I see; that seems fine then, though I worry that we use "epsilon" in MLlib (tests) for "a very small positive number." Can we document it more

[GitHub] spark issue #18887: [SPARK-20642][core] Store FsHistoryProvider listing data...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18887 **[Test build #82206 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82206/testReport)** for PR 18887 at commit

[GitHub] spark issue #19272: [Spark-21842][Mesos] Support Kerberos ticket renewal and...

2017-09-26 Thread kalvinnchau
Github user kalvinnchau commented on the issue: https://github.com/apache/spark/pull/19272 @ArtRand thanks! I've been testing a local version of doing that, I'll pull that change in and test it as well. --- - To

[GitHub] spark issue #19294: [SPARK-21549][CORE] Respect OutputFormats with no output...

2017-09-26 Thread szhem
Github user szhem commented on the issue: https://github.com/apache/spark/pull/19294 Hello guys, are there a change for this patch to be merged to master? --- - To unsubscribe, e-mail:

[GitHub] spark issue #19348: [BUILD] Close stale PRs

2017-09-26 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19348 Added. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19357: [SPARK-21322][SQL][WIP] support histogram in filter card...

2017-09-26 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19357 Jenkins, add to whitelist --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82211 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82211/testReport)** for PR 19327 at commit

[GitHub] spark issue #19357: [SPARK-21322][SQL][WIP] support histogram in filter card...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19357 **[Test build #82213 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82213/testReport)** for PR 19357 at commit

[GitHub] spark issue #19357: [SPARK-21322][SQL][WIP] support histogram in filter card...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19357 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82213/ Test FAILed. ---

[GitHub] spark issue #19338: [SPARK-22123][CORE] Add latest failure reason for task s...

2017-09-26 Thread caneGuy
Github user caneGuy commented on the issue: https://github.com/apache/spark/pull/19338 Thanks @squito i have updated the description --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141226456 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetBlacklist.scala --- @@ -61,6 +61,16 @@ private[scheduler] class TaskSetBlacklist(val

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141234485 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala --- @@ -670,9 +670,12 @@ private[spark] class TaskSetManager(

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141235801 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala --- @@ -670,9 +670,12 @@ private[spark] class TaskSetManager(

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19355 **[Test build #82210 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82210/testReport)** for PR 19355 at commit

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82210/ Test PASSed. ---

[GitHub] spark issue #19355: [SPARK-22130][Core] UTF8String.trim() scans " " twice

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19355 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19348: [BUILD] Close stale PRs

2017-09-26 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19348 Merged to master. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19327: [WIP] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82212 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82212/testReport)** for PR 19327 at commit

[GitHub] spark issue #19357: support histogram in filter cardinality estimation

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19357 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19357: [SPARK-21322][SQL][WIP] support histogram in filter card...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19357 **[Test build #82213 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82213/testReport)** for PR 19357 at commit

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82211/ Test PASSed. ---

[GitHub] spark pull request #19358: [SPARK-22135] metrics in spark-dispatcher not bei...

2017-09-26 Thread pmackles
GitHub user pmackles opened a pull request: https://github.com/apache/spark/pull/19358 [SPARK-22135] metrics in spark-dispatcher not being registered properly ## What changes were proposed in this pull request? Fix a trivial bug with how metrics are registered in the mesos

[GitHub] spark issue #19358: [SPARK-22135] metrics in spark-dispatcher not being regi...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19358 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19351: [SPARK-22127][CORE]The Master Register Applicatio...

2017-09-26 Thread guoxiaolongzte
Github user guoxiaolongzte commented on a diff in the pull request: https://github.com/apache/spark/pull/19351#discussion_r141224687 --- Diff: core/src/main/scala/org/apache/spark/deploy/master/Master.scala --- @@ -265,6 +265,9 @@ private[deploy] class Master( val app

[GitHub] spark issue #19339: [SPARK-22112][PYSPARK] Supports RDD of strings as input ...

2017-09-26 Thread goldmedal
Github user goldmedal commented on the issue: https://github.com/apache/spark/pull/19339 @HyukjinKwon I has updated this title. Thanks ! --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread caneGuy
Github user caneGuy commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141229038 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala --- @@ -671,8 +671,10 @@ private[spark] class TaskSetManager(

[GitHub] spark issue #19338: [SPARK-22123][CORE] Add latest failure reason for task s...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19338 **[Test build #82215 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82215/testReport)** for PR 19338 at commit

[GitHub] spark pull request #19339: [SPARK-22112][PYSPARK] Supports RDD of strings as...

2017-09-26 Thread goldmedal
Github user goldmedal commented on a diff in the pull request: https://github.com/apache/spark/pull/19339#discussion_r141232322 --- Diff: python/pyspark/sql/readwriter.py --- @@ -420,7 +425,29 @@ def csv(self, path, schema=None, sep=None, encoding=None, quote=None, escape=Non

[GitHub] spark pull request #19359: [SPARK-22129][SPARK-22138] Release script improve...

2017-09-26 Thread holdenk
GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/19359 [SPARK-22129][SPARK-22138] Release script improvements ## What changes were proposed in this pull request? Use the GPG_KEY param, fix lsof to non-hardcoded path, remove version swap since

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141235459 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala --- @@ -670,9 +670,12 @@ private[spark] class TaskSetManager(

[GitHub] spark pull request #17383: [SPARK-3165][MLlib] DecisionTree use sparsity in ...

2017-09-26 Thread facaiy
Github user facaiy closed the pull request at: https://github.com/apache/spark/pull/17383 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/82212/ Test PASSed. ---

[GitHub] spark issue #17383: [SPARK-3165][MLlib] DecisionTree use sparsity in data

2017-09-26 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17383 Hi, since the work has been done for a long time, I take a review by myself. After careful review, as SparseVector is compressed sparse row format, so the only benefit of the PR would be

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19327 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19327: [SPARK-22136][SS] Implement stream-stream outer joins.

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19327 **[Test build #82212 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82212/testReport)** for PR 19327 at commit

[GitHub] spark issue #19339: [SPARK-22112][PYSPARK] Add an API to create a DataFrame ...

2017-09-26 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19339 @goldmedal, are you online now? how about fixing the PR title to say something like .. "Supports RDD of strings as input in spark.read.csv in PySpark"? ---

[GitHub] spark issue #19357: [SPARK-21322][SQL][WIP] support histogram in filter card...

2017-09-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19357 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19338: [SPARK-22123][CORE] Add latest failure reason for...

2017-09-26 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/19338#discussion_r141226978 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSetManager.scala --- @@ -671,8 +671,10 @@ private[spark] class TaskSetManager(

[GitHub] spark issue #19229: [SPARK-22001][ML][SQL] ImputerModel can do withColumn fo...

2017-09-26 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/19229 ping @gatorsmile for the SQL part. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands,

[GitHub] spark pull request #19168: [SPARK-21956][CORE] Fetch up to max bytes when bu...

2017-09-26 Thread caneGuy
Github user caneGuy commented on a diff in the pull request: https://github.com/apache/spark/pull/19168#discussion_r141228856 --- Diff: core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala --- @@ -439,6 +443,8 @@ final class ShuffleBlockFetcherIterator(

[GitHub] spark pull request #19339: [SPARK-22112][PYSPARK] Supports RDD of strings as...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19339 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19349: [SPARK-22125][PYSPARK][SQL] Enable Arrow Stream format f...

2017-09-26 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19349 **[Test build #82214 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/82214/testReport)** for PR 19349 at commit

[GitHub] spark issue #19339: [SPARK-22112][PYSPARK] Supports RDD of strings as input ...

2017-09-26 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/19339 I've tested few times locally. Can't have the same failure. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #19339: [SPARK-22112][PYSPARK] Supports RDD of strings as input ...

2017-09-26 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19339 Merged to master. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-09-26 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 HI, @WeichenXu123. As said by @srowen , the benefit of this would be for speed at predict time or for model storage. Hence I'm not sure whether benchmark is really need for the PR. ---

[GitHub] spark issue #19339: [SPARK-22112][PYSPARK] Add an API to create a DataFrame ...

2017-09-26 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19339 In a quick look, both tests failures: ``` File "/home/jenkins/workspace/SparkPullRequestBuilder/python/pyspark/sql/readwriter.py", line 303, in parquet return

[GitHub] spark pull request #19357: support histogram in filter cardinality estimatio...

2017-09-26 Thread ron8hu
GitHub user ron8hu opened a pull request: https://github.com/apache/spark/pull/19357 support histogram in filter cardinality estimation ## What changes were proposed in this pull request? Histogram is effective in dealing with skewed distribution. After we generate

[GitHub] spark pull request #19335: mapPartitions Api

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19335 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19348: [BUILD] Close stale PRs

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19348 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19347: Branch 2.2 sparkmlib's output of many algorithms ...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19347 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19244: SPARK-22021

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19244 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #18978: [SPARK-21737][YARN]Create communication channel b...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18978 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #18474: [SPARK-21235][TESTS] UTest should clear temp resu...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18474 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #18253: [SPARK-18838][CORE] Introduce multiple queues in ...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18253 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19295: [SPARK-22080][SQL] Adds support for allowing user...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19295 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19356: Merge pull request #1 from apache/master

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19356 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #13794: [SPARK-15574][ML][PySpark] Python meta-algorithms...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13794 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19300: [SPARK-22082][SparkR]Spelling mistake: "choosen" ...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19300 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #18897: [SPARK-21655][YARN] Support Kill CLI for Yarn mod...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18897 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19334: Branch 1.6

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19334 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #15009: [SPARK-17443][SPARK-11035] Stop Spark Application...

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15009 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19315: [MINOR][ML]Updated english.txt word ordering

2017-09-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19315 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

  1   2   3   4   5   >