[GitHub] spark pull request #22011: [WIP][SPARK-24822][PySpark] Python support for ba...

2018-08-07 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/22011#discussion_r208122669 --- Diff: python/pyspark/rdd.py --- @@ -2429,6 +2441,29 @@ def _wrap_function(sc, func, deserializer, serializer, profiler=None

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-07 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r208120759 --- Diff: core/src/main/scala/org/apache/spark/api/python/PythonRunner.scala --- @@ -60,14 +61,26 @@ private[spark] object PythonEvalType

[GitHub] zeppelin issue #3112: Branch 0.8

2018-08-07 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3112 please close this @GioCorti80 ---

[GitHub] spark issue #21988: [SPARK-25003][PYSPARK][BRANCH-2.2] Use SessionExtensions...

2018-08-07 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21988 we always open against master and backport if agreed upon. this is documented here https://spark.apache.org/contributing.html

[GitHub] zeppelin issue #3107: [ZEPPELIN-3646] Add note for updating user permissions

2018-08-07 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3107 I think the advantage of having a script separate from demo is that it can be ACL'd properly - user won't have access etc. ---

[GitHub] zeppelin issue #3092: [ZEPPELIN-3653] - New Java interpreter

2018-08-07 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3092 so we need to open a JIRA then? ---

[GitHub] zeppelin issue #3054: ZEPPELIN-3569. Improvement of FlinkInterpreter

2018-08-07 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3054 this comment? https://github.com/apache/zeppelin/pull/3054#discussion_r200825969 it's in every file touched ---

zeppelin git commit: [ZEPPELIN-3665] fix notebook name

2018-08-05 Thread felixcheung
Repository: zeppelin Updated Branches: refs/heads/master 2eac6872e -> cbf222848 [ZEPPELIN-3665] fix notebook name ### What is this PR for? Name of notebook incorrectly displayed due to text direction. Bug Fix ### What is the Jira issue?

[GitHub] spark pull request #22001: [SPARK-24819][CORE] Fail fast when no enough slot...

2018-08-05 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/22001#discussion_r207745122 --- Diff: resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosFineGrainedSchedulerBackend.scala --- @@ -453,4 +453,8

[GitHub] spark pull request #22001: [SPARK-24819][CORE] Fail fast when no enough slot...

2018-08-05 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/22001#discussion_r207745108 --- Diff: resource-managers/mesos/src/main/scala/org/apache/spark/scheduler/cluster/mesos/MesosFineGrainedSchedulerBackend.scala --- @@ -453,4 +453,8

[GitHub] spark pull request #22001: [SPARK-24819][CORE] Fail fast when no enough slot...

2018-08-05 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/22001#discussion_r207745157 --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala --- @@ -496,6 +496,12 @@ class

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-05 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207726897 --- Diff: python/pyspark/worker.py --- @@ -259,6 +260,26 @@ def main(infile, outfile): "PYSPARK_DRIVER_P

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-05 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207726903 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala --- @@ -133,10 +133,17 @@ private[yarn] class

[GitHub] spark issue #20838: [SPARK-23698] Resolve undefined names in Python 3

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/20838 jenkins, retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark issue #20838: [SPARK-23698] Resolve undefined names in Python 3

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/20838 looks to me everything passes --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark pull request #21996: [SPARK-24888][CORE] spark-submit --master spark:/...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21996#discussion_r207717051 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -98,17 +98,24 @@ private[spark] class SparkSubmit extends Logging

[GitHub] spark issue #21996: [SPARK-24888][CORE] spark-submit --master spark://host:p...

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21996 I think we generally describe the change in PR title. what user see you can put as JIRA title. --- - To unsubscribe, e-mail

[GitHub] spark issue #21977: SPARK-25004: Add spark.executor.pyspark.memory limit.

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21977 build error ``` [error] /home/jenkins/workspace/SparkPullRequestBuilder@2/sql/core/src/main/scala/org/apache/spark/sql/execution/python/ArrowEvalPythonExec.scala:88: method sparkContext

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207716854 --- Diff: python/pyspark/worker.py --- @@ -259,6 +260,26 @@ def main(infile, outfile): "PYSPARK_DRIVER_P

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207716877 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala --- @@ -133,10 +133,17 @@ private[yarn] class

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207716893 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocator.scala --- @@ -133,10 +133,17 @@ private[yarn] class

[GitHub] spark pull request #21977: SPARK-25004: Add spark.executor.pyspark.memory li...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21977#discussion_r207716813 --- Diff: python/pyspark/worker.py --- @@ -259,6 +260,26 @@ def main(infile, outfile): "PYSPARK_DRIVER_P

[GitHub] spark pull request #21997: [SPARK-24987][SS] - Fix Kafka consumer leak when ...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21997#discussion_r207716713 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSourceRDD.scala --- @@ -124,8 +124,6 @@ private[kafka010] class

[GitHub] spark issue #21997: [SPARK-24987][SS] - Fix Kafka consumer leak when no new ...

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21997 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark issue #21997: [SPARK-24987][SS] - Fix Kafka consumer leak when no new ...

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21997 @koeninger back --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews

[GitHub] spark issue #21987: [SPARK-25015][BUILD] Update Hadoop 2.7 to 2.7.7

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21987 one hive suite failed, is that related to this change? HiveExternalCatalogVersionsSuite ``` [info] 2018-08-03 17:14:26.108 - stdout> 17:14:26.107 W

[GitHub] spark pull request #21996: [SPARK-24888][CORE] spark-submit --master spark:/...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21996#discussion_r207702200 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -98,17 +98,24 @@ private[spark] class SparkSubmit extends Logging

[GitHub] spark issue #21988: [SPARK-25003][PYSPARK] Use SessionExtensions in Pyspark

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21988 why do we need multiple PRs? typically we do that only then the change is non-trivial or cannot be backported by cherry-picking

[GitHub] spark issue #21983: [SPARK-24987][SS] - Fix Kafka consumer leak when no new ...

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21983 @YuvalItzchakov you should open the PR against master - it can be picked to release branch (eg. 2.3) when merged

[GitHub] spark pull request #21991: [SPARK-25018] [Infra] Use `Co-authored-by` and `S...

2018-08-04 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21991#discussion_r207701760 --- Diff: dev/merge_spark_pr.py --- @@ -154,20 +154,22 @@ def merge_pr(pr_num, target_ref, title, body, pr_repo_desc): # to people every

[GitHub] spark issue #21991: [SPARK-25018] [Infra] Use `Co-authored-by` and `Signed-o...

2018-08-04 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21991 Jenkins, retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark issue #21953: [SPARK-24992][Core] spark should randomize yarn local di...

2018-08-02 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21953 Jenkins, test this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #21006: [SPARK-22256][MESOS] - Introduce spark.mesos.driver.memo...

2018-08-02 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21006 @pmackles ? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] zeppelin issue #3066: [ZEPPELIN-3618] ZeppelinContext methods z.run and z.ru...

2018-08-02 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3066 looks like everything passed https://travis-ci.org/TinkoffCreditSystems/zeppelin/builds/407590408 ---

[GitHub] zeppelin issue #3092: [ZEPPELIN-3653] - New Java interpreter

2018-08-02 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3092 right. just want to clarify that it is perfectly fine to run Scala, Python etc without Spark. The session context is maintained in the REPL implementation and not in SparkContext

[GitHub] spark issue #21546: [SPARK-23030][SQL][PYTHON] Use Arrow stream format for c...

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21546 so... can we still target this for 2.4? sounds like very good to have and complimentary to hydrogen --- - To unsubscribe

[GitHub] spark issue #21516: [SPARK-24501][MESOS] Add Dispatcher and Driver metrics

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21516 Jenkins, retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark issue #21923: [SPARK-24918][Core] Executor Plugin api

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21923 this https://github.com/squito/spark-memory --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #21909: [SPARK-24959][SQL] Speed up count() for JSON and CSV

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21909 got it --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] zeppelin issue #3099: [ZEPPELIN-3665] fix notebook name

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3099 cool merging if no more comment ---

[GitHub] zeppelin issue #3103: ZEPPELIN-3666. Use zeppelin.interpreter.default to rep...

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3103 I see, would be good to update the title - I think that goes into the actual commit ---

[GitHub] zeppelin issue #3102: [WIP][ZEPPELIN-3671] Add info about running interprete...

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3102 cool! ---

[GitHub] spark issue #21930: [SPARK-14540][Core] Fix remaining major issues for Scala...

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21930 I think that's binary-incompatible breaking API change, right? ex. https://github.com/apache/spark/pull/21930/files#diff-2b8f0f66fe5397b169d0f754e99da8d5R64

[GitHub] spark pull request #21936: [SPARK-24981][Core] ShutdownHook timeout causes j...

2018-08-01 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21936#discussion_r206769869 --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala --- @@ -571,7 +571,12 @@ class SparkContext(config: SparkConf) extends Logging

[GitHub] spark pull request #21936: [SPARK-24981][Core] ShutdownHook timeout causes j...

2018-08-01 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21936#discussion_r206770131 --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala --- @@ -571,7 +571,12 @@ class SparkContext(config: SparkConf) extends Logging

[GitHub] zeppelin issue #3092: [ZEPPELIN-3653] - New Java interpreter

2018-08-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3092 no, I mean paragraph 1 ``` %java Map counts = new HashMap<>(); ``` paragraph 2 ``` %java counts.put("key", 1); ``` ---

[GitHub] spark pull request #21928: [SPARK-24976][PYTHON] Allow None for Decimal type...

2018-07-30 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21928#discussion_r206401496 --- Diff: python/pyspark/serializers.py --- @@ -236,6 +237,11 @@ def create_array(s, t): # TODO: need decode before converting to Arrow

[GitHub] spark pull request #21928: [SPARK-24976][PYTHON] Allow None for Decimal type...

2018-07-30 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21928#discussion_r206401644 --- Diff: python/pyspark/serializers.py --- @@ -236,6 +237,11 @@ def create_array(s, t): # TODO: need decode before converting to Arrow

[GitHub] spark issue #21928: [SPARK-24976][PYTHON] Allow None for Decimal type conver...

2018-07-30 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21928 yea, it doesn't seem very useful to ping matei on every single PR ;) --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #21909: [SPARK-24959][SQL] Speed up count() for JSON and ...

2018-07-30 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21909#discussion_r206400571 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVBenchmarks.scala --- @@ -119,8 +119,47 @@ object CSVBenchmarks

[GitHub] zeppelin issue #3092: [ZEPPELIN-3653] - New Java interpreter

2018-07-30 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3092 did you add the behavior in doc? ---

[GitHub] spark issue #21928: [SPARK-24976][PYTHON] Allow None for Decimal type conver...

2018-07-30 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21928 I wonder if we could tune the bot suggestions to more recent contributions/contributors? --- - To unsubscribe, e-mail

[GitHub] spark issue #21905: [SPARK-24956][Build][test-maven] Upgrade maven version t...

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21905 hmm, interesting suggestions https://github.com/apache/spark/pull/21905#issuecomment-408580441 --- - To unsubscribe, e-mail

[GitHub] spark pull request #21902: [SPARK-24952][SQL] Support LZMA2 compression by A...

2018-07-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21902#discussion_r205933175 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -1449,6 +1451,16 @@ object SQLConf { .intConf

[GitHub] spark issue #20028: [SPARK-19053][ML]Supporting multiple evaluation metrics ...

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/20028 this shouldn't say version 2.3.0? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands

[GitHub] spark issue #20272: [SPARK-23078] [CORE] [K8s] allow Spark Thrift Server to ...

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/20272 still need to run tests https://github.com/apache/spark/pull/20272#pullrequestreview-108271893 --- - To unsubscribe, e

[GitHub] spark pull request #21748: [SPARK-23146][K8S] Support client mode.

2018-07-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21748#discussion_r205932789 --- Diff: resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/ClientModeTestsSuite.scala

[GitHub] zeppelin issue #3099: [ZEPPELIN-3665] fix notebook name

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3099 it might help if you include a before / after screen shots? ---

[GitHub] spark issue #21516: [SPARK-24501][MESOS] Add Dispatcher and Driver metrics

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21516 Jenkins, retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e

[GitHub] spark issue #21884: k8s: explicitly expose ports on driver container

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21884 can you update the format of the title and description as described here "Pull Request" in https://spark.apache.org/contrib

[GitHub] zeppelin issue #3090: [ZEPPELIN-3645] Add LSP Protocol completion support

2018-07-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3090 open ip and port to connect to has become a huge problem recently, so unless LSP has some sort of authentication story, my vote would be "no" even if this is disabled by defaul

[GitHub] spark issue #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR on K8s

2018-07-26 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21584 SPARK-24908 went in, it should unblock R? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #21835: [SPARK-24779]Add sequence / map_concat / map_from_entrie...

2018-07-26 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21835 and my comment https://github.com/apache/spark/pull/21835#discussion_r20541 --- - To unsubscribe, e-mail: reviews

[GitHub] spark pull request #21835: [SPARK-24779]Add sequence / map_concat / map_from...

2018-07-26 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21835#discussion_r205355152 --- Diff: R/pkg/R/functions.R --- @@ -3320,7 +3321,7 @@ setMethod("explode", #' @aliases sequence sequence,Column-method #' @not

[GitHub] spark pull request #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR ...

2018-07-26 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21584#discussion_r205354798 --- Diff: resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/submit/KubernetesDriverBuilder.scala --- @@ -71,8 +75,9

[GitHub] spark pull request #21835: [SPARK-24779]Add sequence / map_concat / map_from...

2018-07-25 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21835#discussion_r20541 --- Diff: R/pkg/tests/fulltests/test_context.R --- @@ -21,10 +21,11 @@ test_that("Check masked functions", { # Check that we are not m

[GitHub] spark issue #21811: [SPARK-24801][CORE] Avoid memory waste by empty byte[] a...

2018-07-24 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21811 SGTM --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #21748: [SPARK-23146][K8S] Support client mode.

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21748#discussion_r204635755 --- Diff: docs/running-on-kubernetes.md --- @@ -117,6 +117,45 @@ If the local proxy is running at localhost:8001, `--master k8s://http://127.0.0

[GitHub] spark pull request #21748: [SPARK-23146][K8S] Support client mode.

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21748#discussion_r204635584 --- Diff: docs/running-on-kubernetes.md --- @@ -117,6 +117,45 @@ If the local proxy is running at localhost:8001, `--master k8s://http://127.0.0

[GitHub] spark pull request #21748: [SPARK-23146][K8S] Support client mode.

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21748#discussion_r204636048 --- Diff: resource-managers/kubernetes/core/src/main/scala/org/apache/spark/scheduler/cluster/k8s/KubernetesClusterManager.scala --- @@ -35,26 +35,39

[GitHub] spark pull request #21546: [SPARK-23030][SQL][PYTHON] Use Arrow stream forma...

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21546#discussion_r204635055 --- Diff: python/pyspark/serializers.py --- @@ -184,27 +184,67 @@ def loads(self, obj): raise NotImplementedError -class

[GitHub] spark issue #21788: [SPARK-24609][ML][DOC] PySpark/SparkR doc doesn't explai...

2018-07-24 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21788 what's changed? it was forced pushed so I can't tell what's updated --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #21835: [SPARK-24779]Add sequence / map_concat / map_from...

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21835#discussion_r204633191 --- Diff: R/pkg/R/functions.R --- @@ -1986,15 +1998,20 @@ setMethod("levenshtein", signature(y = "Column"), #' are on t

[GitHub] spark pull request #21835: [SPARK-24779]Add sequence / map_concat / map_from...

2018-07-24 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21835#discussion_r204633464 --- Diff: R/pkg/tests/fulltests/test_context.R --- @@ -21,10 +21,11 @@ test_that("Check masked functions", { # Check that we are not m

[GitHub] spark issue #21118: SPARK-23325: Use InternalRow when reading with DataSourc...

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21118 so where are we on this? looks like we have 2 LGTM? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #21075: [SPARK-23988][MESOS] Improve handling of appResource in ...

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21075 @tnachen @susanxhuynh @mgummelt @skonto --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #21516: [SPARK-24501][MESOS] Add Dispatcher and Driver metrics

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21516 @tnachen @susanxhuynh @mgummelt @skonto --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #21516: [SPARK-24501][MESOS] Add Dispatcher and Driver metrics

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21516 Jenkins, ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] zeppelin issue #3035: [ZEPPELIN-3553] Fix URLs on "Multi-user Support" page

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3035 we don't have a 0.9 branch or plan yet. should we backport to 0.8? are we having another release, like 0.8.1? ---

[GitHub] zeppelin issue #3035: [ZEPPELIN-3553] Fix URLs on "Multi-user Support" page

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3035 I don't this is ported to 0.8 https://github.com/apache/zeppelin/commits/branch-0.8 ---

[GitHub] zeppelin issue #3066: [ZEPPELIN-3618] ZeppelinContext methods z.run and z.ru...

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3066 can you comment on my comment: `this doesn't seem quite right, we are ignoring all IOException from running the paragraph?`? ---

[GitHub] spark issue #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR on K8s

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21584 I see. you are referring to https://github.com/apache/spark/pull/21584#issuecomment-406091994 got

[GitHub] spark issue #21517: Testing k8s change - please ignore (13)

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21517 I think we can close this --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR on K8s

2018-07-22 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21584 looks like test failed ``` - Run SparkR on simple dataframe.R example *** FAILED *** The code passed to eventually never returned normally. Attempted 70 times over

[GitHub] spark pull request #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR ...

2018-07-22 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21584#discussion_r204229514 --- Diff: bin/docker-image-tool.sh --- @@ -79,11 +80,16 @@ function build { docker build $NOCACHEARG "${BINDING_BUILD_ARGS[@]}" \

[GitHub] spark pull request #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR ...

2018-07-22 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21584#discussion_r204229713 --- Diff: resource-managers/kubernetes/docker/src/main/dockerfiles/spark/bindings/R/Dockerfile --- @@ -0,0 +1,29 @@ +# +# Licensed

[GitHub] spark pull request #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR ...

2018-07-22 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21584#discussion_r204229663 --- Diff: resource-managers/kubernetes/core/src/main/scala/org/apache/spark/deploy/k8s/submit/KubernetesDriverBuilder.scala --- @@ -71,8 +75,9

[GitHub] spark pull request #21584: [SPARK-24433][K8S] Initial R Bindings for SparkR ...

2018-07-22 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21584#discussion_r204229518 --- Diff: bin/docker-image-tool.sh --- @@ -79,11 +80,16 @@ function build { docker build $NOCACHEARG "${BINDING_BUILD_ARGS[@]}" \

[GitHub] zeppelin issue #3079: ZEPPELIN-3635. Drop support for spark before 1.6

2018-07-17 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3079 looks like we don't test older spark versions.. ---

[GitHub] zeppelin issue #3068: [ZEPPELIN-3438] Passing Z variables to BigQuery

2018-07-17 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/zeppelin/pull/3068 do we really need this for all interpreters though? for those we don't have active maintainer it might better not to change them ---

[GitHub] spark issue #20146: [SPARK-11215][ML] Add multiple columns support to String...

2018-07-17 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/20146 it's odd appveyer tests are not getting triggered. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark pull request #21710: [SPARK-24207][R]add R API for PrefixSpan

2018-07-17 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21710#discussion_r203245118 --- Diff: R/pkg/R/mllib_fpm.R --- @@ -154,3 +160,74 @@ setMethod("write.ml", signature(object = "FPGrowthModel"

[GitHub] spark pull request #21710: [SPARK-24207][R]add R API for PrefixSpan

2018-07-17 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21710#discussion_r203245010 --- Diff: R/pkg/R/generics.R --- @@ -1415,6 +1415,13 @@ setGeneric("spark.freqItemsets", function(object) { standardGeneric(&q

[GitHub] spark issue #21542: [SPARK-24529][Build][test-maven] Add spotbugs into maven...

2018-07-17 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21542 "permission" stuff might be Java 9 related? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.

spark git commit: [TRIVIAL][ML] GMM unpersist RDD after training

2018-07-15 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master bbc2ffc8a -> bcf7121ed [TRIVIAL][ML] GMM unpersist RDD after training ## What changes were proposed in this pull request? unpersist `instances` after training ## How was this patch tested? existing tests Author: 郑瑞峰 Closes #21562

[GitHub] spark issue #21562: [Trivial][ML] GMM unpersist RDD after training

2018-07-15 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/21562 sure, LGTM. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark pull request #21758: [SPARK-24795][CORE] Implement barrier execution m...

2018-07-15 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21758#discussion_r202535313 --- Diff: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala --- @@ -1386,29 +1418,90 @@ class DAGScheduler

[GitHub] spark pull request #21758: [SPARK-24795][CORE] Implement barrier execution m...

2018-07-15 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21758#discussion_r202533650 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala --- @@ -359,17 +368,49 @@ private[spark] class TaskSchedulerImpl

[GitHub] spark pull request #21758: [SPARK-24795][CORE] Implement barrier execution m...

2018-07-15 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21758#discussion_r202533477 --- Diff: core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala --- @@ -1386,29 +1418,90 @@ class DAGScheduler

[GitHub] spark pull request #21758: [SPARK-24795][CORE] Implement barrier execution m...

2018-07-15 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/21758#discussion_r202533903 --- Diff: core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala --- @@ -359,17 +368,49 @@ private[spark] class TaskSchedulerImpl

<    1   2   3   4   5   6   7   8   9   10   >