[GitHub] spark issue #16441: [SPARK-14975][ML] Fixed GBTClassifier to predict probabi...

2017-07-14 Thread MLnick
Github user MLnick commented on the issue: https://github.com/apache/spark/pull/16441 Should be in 2.2.0 On Sat, 15 Jul 2017 at 07:54, yonglyhoo wrote: > In which release this fix is going to be available? Thanks! > > — > You are

[GitHub] spark issue #16441: [SPARK-14975][ML] Fixed GBTClassifier to predict probabi...

2017-07-14 Thread yonglyhoo
Github user yonglyhoo commented on the issue: https://github.com/apache/spark/pull/16441 In which release this fix is going to be available? Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames in PySpa...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18618 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames in PySpa...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18618 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79631/ Test PASSed. ---

[GitHub] spark issue #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames in PySpa...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18618 **[Test build #79631 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79631/testReport)** for PR 18618 at commit

[GitHub] spark pull request #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames i...

2017-07-14 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/18618#discussion_r127576287 --- Diff: python/pyspark/sql/types.py --- @@ -562,6 +562,16 @@ def jsonValue(self): def fromJson(cls, json): return

[GitHub] spark issue #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames in PySpa...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18618 **[Test build #79631 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79631/testReport)** for PR 18618 at commit

[GitHub] spark issue #18618: [SPARK-20090][PYTHON] Add StructType.fieldNames in PySpa...

2017-07-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/18618 Either way is fine to me. Let me update this to return a list. I was just thinking struct/row are a tuple-like and the output for this could be as so. --- If your project is set up for it, you

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-07-14 Thread kiszk
Github user kiszk commented on the issue: https://github.com/apache/spark/pull/17980 If new test cases works for existing orc component, how about updating test cases at first? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request #18571: [SPARK-21344][SQL] BinaryType comparison does sig...

2017-07-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18571 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #18571: [SPARK-21344][SQL] BinaryType comparison does signed byt...

2017-07-14 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18571 Thanks! Merging to master/2.2/2.1/2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18640 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18640 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79627/ Test PASSed. ---

[GitHub] spark issue #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18640 **[Test build #79627 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79627/testReport)** for PR 18640 at commit

[GitHub] spark issue #18633: [SPARK-21411][YARN] Lazily create FS within kerberized U...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18633 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18633: [SPARK-21411][YARN] Lazily create FS within kerberized U...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18633 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79626/ Test PASSed. ---

[GitHub] spark issue #18633: [SPARK-21411][YARN] Lazily create FS within kerberized U...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18633 **[Test build #79626 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79626/testReport)** for PR 18633 at commit

[GitHub] spark issue #18639: [SPARK-21408][core] Better default number of RPC dispatc...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18639 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79624/ Test PASSed. ---

[GitHub] spark issue #18639: [SPARK-21408][core] Better default number of RPC dispatc...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18639 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18639: [SPARK-21408][core] Better default number of RPC dispatc...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18639 **[Test build #79624 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79624/testReport)** for PR 18639 at commit

[GitHub] spark issue #18281: [SPARK-21027][ML][PYTHON] Added tunable parallelism to o...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18281 **[Test build #79630 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79630/testReport)** for PR 18281 at commit

[GitHub] spark issue #18281: [SPARK-21027][ML][PYTHON] Added tunable parallelism to o...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18281 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79630/ Test FAILed. ---

[GitHub] spark issue #18281: [SPARK-21027][ML][PYTHON] Added tunable parallelism to o...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18281 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18281: [SPARK-21027][ML][PYTHON] Added tunable parallelism to o...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18281 **[Test build #79630 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79630/testReport)** for PR 18281 at commit

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18637 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79621/ Test PASSed. ---

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18637 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18637 **[Test build #79621 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79621/testReport)** for PR 18637 at commit

[GitHub] spark issue #18487: [SPARK-21243][Core] Limit no. of map outputs in a shuffl...

2017-07-14 Thread jinxing64
Github user jinxing64 commented on the issue: https://github.com/apache/spark/pull/18487 `maxReqsInFlight` and `maxBytesInFlight` is hard to control the # of blocks in a single request. When # of map is very high, this change can alleviate the pressure of shuffle server. @dhruve

[GitHub] spark issue #18571: [SPARK-21344][SQL] BinaryType comparison does signed byt...

2017-07-14 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/18571 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #18633: [SPARK-21411][YARN] Lazily create FS within kerbe...

2017-07-14 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/18633#discussion_r127565024 --- Diff: core/src/main/scala/org/apache/spark/deploy/security/HadoopDelegationTokenManager.scala --- @@ -42,7 +42,7 @@ import

[GitHub] spark issue #16571: [SPARK-19208][ML][WIP] MaxAbsScaler and MinMaxScaler are...

2017-07-14 Thread WeichenXu123
Github user WeichenXu123 commented on the issue: https://github.com/apache/spark/pull/16571 This PR is very similar to my early PR. Is that right? @jkbradley #14950 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request #18630: [SPARK-12559][SPARK SUBMIT] fix --packages for st...

2017-07-14 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/18630#discussion_r127562278 --- Diff: core/src/main/scala/org/apache/spark/deploy/worker/DriverWrapper.scala --- @@ -66,4 +75,50 @@ object DriverWrapper {

[GitHub] spark pull request #18630: [SPARK-12559][SPARK SUBMIT] fix --packages for st...

2017-07-14 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/18630#discussion_r127561962 --- Diff: core/src/main/scala/org/apache/spark/deploy/SparkSubmit.scala --- @@ -473,6 +474,12 @@ object SparkSubmit extends CommandLineUtils {

[GitHub] spark pull request #18630: [SPARK-12559][SPARK SUBMIT] fix --packages for st...

2017-07-14 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/18630#discussion_r127563323 --- Diff: core/src/main/scala/org/apache/spark/deploy/worker/DriverWrapper.scala --- @@ -66,4 +75,50 @@ object DriverWrapper {

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18616 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18616 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79629/ Test PASSed. ---

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18616 **[Test build #79629 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79629/testReport)** for PR 18616 at commit

[GitHub] spark pull request #18633: [SPARK-21411][YARN] Lazily create FS within kerbe...

2017-07-14 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/18633#discussion_r127561224 --- Diff: core/src/main/scala/org/apache/spark/deploy/security/HadoopFSDelegationTokenProvider.scala --- @@ -45,11 +45,11 @@ private[deploy] class

[GitHub] spark pull request #18633: [SPARK-21411][YARN] Lazily create FS within kerbe...

2017-07-14 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/18633#discussion_r127562035 --- Diff: core/src/main/scala/org/apache/spark/deploy/security/HadoopDelegationTokenManager.scala --- @@ -42,7 +42,7 @@ import

[GitHub] spark issue #18628: [SPARK-18061][ThriftServer] Add spnego auth support for ...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18628 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79628/ Test PASSed. ---

[GitHub] spark issue #18630: [SPARK-12559][SPARK SUBMIT] fix --packages for stand-alo...

2017-07-14 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/18630 Are you trying to support `--packages` in standalone cluster? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #18628: [SPARK-18061][ThriftServer] Add spnego auth support for ...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18628 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18628: [SPARK-18061][ThriftServer] Add spnego auth support for ...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18628 **[Test build #79628 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79628/testReport)** for PR 18628 at commit

[GitHub] spark issue #18607: [SPARK-21362][SQL][Adding Apache Drill JDBC Dialect]

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18607 **[Test build #3842 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3842/testReport)** for PR 18607 at commit

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18616 **[Test build #79629 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79629/testReport)** for PR 18616 at commit

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/18616 Thanks @vanzin for your review, I will update it soon. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18428 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79623/ Test PASSed. ---

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18428 **[Test build #79623 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79623/testReport)** for PR 18428 at commit

[GitHub] spark issue #18628: [SPARK-18061][ThriftServer] Add spnego auth support for ...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18628 **[Test build #79628 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79628/testReport)** for PR 18628 at commit

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18428 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127558429 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/FeatureHasher.scala --- @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127554746 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/FeatureHasher.scala --- @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127555147 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/FeatureHasher.scala --- @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127498459 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/FeatureHasher.scala --- @@ -0,0 +1,185 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127557871 --- Diff: mllib/src/test/scala/org/apache/spark/ml/feature/FeatureHasherSuite.scala --- @@ -0,0 +1,193 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/18513#discussion_r127491688 --- Diff: mllib/src/test/scala/org/apache/spark/ml/feature/FeatureHasherSuite.scala --- @@ -0,0 +1,193 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18428 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79622/ Test PASSed. ---

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18428 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18428: [Spark-21221][ML] CrossValidator and TrainValidationSpli...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18428 **[Test build #79622 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79622/testReport)** for PR 18428 at commit

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread ajaysaini725
Github user ajaysaini725 commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127558107 --- Diff: python/pyspark/ml/tests.py --- @@ -1229,11 +1229,30 @@ def test_output_columns(self): (2.0,

[GitHub] spark issue #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18640 **[Test build #79627 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79627/testReport)** for PR 18640 at commit

[GitHub] spark issue #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/18640 This aims to reduce the review scope for #17980 . cc @kiszk . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread ajaysaini725
Github user ajaysaini725 commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127557890 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/OneVsRestSuite.scala --- @@ -101,6 +101,50 @@ class OneVsRestSuite extends

[GitHub] spark pull request #18640: [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0

2017-07-14 Thread dongjoon-hyun
GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/18640 [SPARK-21422][BUILD] Depend on Apache ORC 1.4.0 ## What changes were proposed in this pull request? Like Parquet, this PR aims to depend on the latest Apache ORC 1.4 for Apache

[GitHub] spark pull request #18616: [SPARK-21377][YARN] Make jars specify with --jars...

2017-07-14 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/18616#discussion_r127556964 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala --- @@ -438,6 +441,24 @@ private[spark] class

[GitHub] spark pull request #18616: [SPARK-21377][YARN] Make jars specify with --jars...

2017-07-14 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/18616#discussion_r127557008 --- Diff: resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala --- @@ -438,6 +441,24 @@ private[spark] class

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-07-14 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/17980 Hi, @kiszk . I will start with `Adding Apache ORC dependency (pom and dependency changes)` in [SPARK-21422](https://issues.apache.org/jira/browse/SPARK-21422) first. --- If your

[GitHub] spark pull request #18444: [SPARK-16542][SQL][PYSPARK] Fix bugs about types ...

2017-07-14 Thread zasdfgbnm
Github user zasdfgbnm commented on a diff in the pull request: https://github.com/apache/spark/pull/18444#discussion_r127555013 --- Diff: core/src/main/scala/org/apache/spark/api/python/SerDeUtil.scala --- @@ -57,11 +57,11 @@ private[spark] object SerDeUtil extends Logging {

[GitHub] spark issue #18444: [SPARK-16542][SQL][PYSPARK] Fix bugs about types that re...

2017-07-14 Thread zasdfgbnm
Github user zasdfgbnm commented on the issue: https://github.com/apache/spark/pull/18444 I updated my code according to @HyukjinKwon's suggestion --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #18633: [SPARK-21411][YARN] Lazily create FS within kerberized U...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18633 **[Test build #79626 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79626/testReport)** for PR 18633 at commit

[GitHub] spark issue #18633: [SPARK-21411][YARN] Lazily create FS within kerberized U...

2017-07-14 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/18633 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18616 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79625/ Test PASSed. ---

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18637 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79619/ Test PASSed. ---

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18616 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18637 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18616: [SPARK-21377][YARN] Make jars specify with --jars/--pack...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18616 **[Test build #79625 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79625/testReport)** for PR 18616 at commit

[GitHub] spark pull request #18638: [SPARK-21421][SS]Add the query id as a local prop...

2017-07-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18638 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #18513: [SPARK-13969][ML] Add FeatureHasher transformer

2017-07-14 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/18513 Just to clarify: * If I want to treat a column as categorical that is represented by integers, I'd have to map those integers to strings, right? I believe that's one of your bullets above.

[GitHub] spark issue #18444: [SPARK-16542][SQL][PYSPARK] Fix bugs about types that re...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18444 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79618/ Test PASSed. ---

[GitHub] spark issue #18637: [SPARK-15526][ML][FOLLOWUP][test-maven] Make JPMML provi...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18637 **[Test build #79619 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79619/testReport)** for PR 18637 at commit

[GitHub] spark issue #18444: [SPARK-16542][SQL][PYSPARK] Fix bugs about types that re...

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18444 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18444: [SPARK-16542][SQL][PYSPARK] Fix bugs about types that re...

2017-07-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18444 **[Test build #79618 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/79618/testReport)** for PR 18444 at commit

[GitHub] spark issue #18638: [SPARK-21421][SS]Add the query id as a local property to...

2017-07-14 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/18638 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127552088 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/OneVsRestSuite.scala --- @@ -101,6 +101,50 @@ class OneVsRestSuite extends

[GitHub] spark pull request #18370: [SPARK-9825][yarn] Do not overwrite final Hadoop ...

2017-07-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18370 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127552778 --- Diff: python/pyspark/ml/classification.py --- @@ -1511,27 +1512,47 @@ class OneVsRest(Estimator, OneVsRestParams, MLReadable, MLWritable):

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127552478 --- Diff: python/pyspark/ml/classification.py --- @@ -1511,27 +1512,47 @@ class OneVsRest(Estimator, OneVsRestParams, MLReadable, MLWritable):

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127552072 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/OneVsRestSuite.scala --- @@ -101,6 +101,50 @@ class OneVsRestSuite extends

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127552824 --- Diff: python/pyspark/ml/classification.py --- @@ -1511,27 +1512,47 @@ class OneVsRest(Estimator, OneVsRestParams, MLReadable, MLWritable):

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127550679 --- Diff: mllib/src/main/scala/org/apache/spark/ml/util/HasParallelism.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127553451 --- Diff: python/pyspark/ml/tests.py --- @@ -1229,11 +1229,30 @@ def test_output_columns(self): (2.0,

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127551356 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/OneVsRest.scala --- @@ -325,8 +326,11 @@ final class OneVsRest @Since("1.4.0") (

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127551019 --- Diff: mllib/src/main/scala/org/apache/spark/ml/util/HasParallelism.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127550735 --- Diff: mllib/src/main/scala/org/apache/spark/ml/util/HasParallelism.scala --- @@ -0,0 +1,61 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #18281: [SPARK-21027][ML][PYTHON] Added tunable paralleli...

2017-07-14 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/18281#discussion_r127553419 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/OneVsRestSuite.scala --- @@ -101,6 +101,50 @@ class OneVsRestSuite extends

[GitHub] spark issue #18547: [SPARK-21321][Spark Core] Spark very verbose on shutdown

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18547 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18555: [SPARK-21353][CORE]add checkValue in spark.internal.conf...

2017-07-14 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18555 Checking whether the values are set or not is not enough. We need to check whether these parameters are effective or not. That means, we need to check the behaviors of Spark --- If your

[GitHub] spark issue #18547: [SPARK-21321][Spark Core] Spark very verbose on shutdown

2017-07-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18547 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/79617/ Test PASSed. ---

[GitHub] spark issue #18370: [SPARK-9825][yarn] Do not overwrite final Hadoop config ...

2017-07-14 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/18370 > it feels like we shouldn't ship the hadoop conf dir but that might break some people Yeah, we rely on that to ship other configs (like Hive and HBase) with the application without the

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-07-14 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/17980 Thank you for review, @kiszk . - The first one about [adding new ORC source](https://github.com/apache/spark/pull/17924) is more smaller than this . - Also, there is [more smaller

  1   2   3   >