[
https://issues.apache.org/jira/browse/SPARK-28360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936207#comment-16936207
]
holdenk commented on SPARK-28360:
-
Don't we need a service account name to create the ex
[
https://issues.apache.org/jira/browse/SPARK-28362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936206#comment-16936206
]
holdenk edited comment on SPARK-28362 at 9/23/19 9:34 PM:
--
Why
[
https://issues.apache.org/jira/browse/SPARK-28362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936206#comment-16936206
]
holdenk commented on SPARK-28362:
-
Why is your default parallelism configured to `49 * 1
[
https://issues.apache.org/jira/browse/SPARK-28403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-28403:
Shepherd: holdenk
> Executor Allocation Manager can add an extra executor when speculative tasks
> ---
[
https://issues.apache.org/jira/browse/SPARK-28517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936201#comment-16936201
]
holdenk commented on SPARK-28517:
-
cc [~bryanc] / [~ifilonenko]
> pyspark with --conf s
[
https://issues.apache.org/jira/browse/SPARK-28558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936200#comment-16936200
]
holdenk commented on SPARK-28558:
-
What storage system are y'all using [~nladuguie] & [~
[
https://issues.apache.org/jira/browse/SPARK-28592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936199#comment-16936199
]
holdenk commented on SPARK-28592:
-
Should we set this to blocker so we don't forget?
>
[
https://issues.apache.org/jira/browse/SPARK-28653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936198#comment-16936198
]
holdenk commented on SPARK-28653:
-
[~thanida.t] can you confirm if you're still exerpein
[
https://issues.apache.org/jira/browse/SPARK-28727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936196#comment-16936196
]
holdenk commented on SPARK-28727:
-
I don't believe we'll be adding new algorithms to Spa
[
https://issues.apache.org/jira/browse/SPARK-28781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-28781:
Issue Type: Improvement (was: Bug)
> Unneccesary persist in PeriodicCheckpointer.update()
> -
[
https://issues.apache.org/jira/browse/SPARK-28978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-28978:
Target Version/s: 3.0.0
> PySpark: Can't pass more than 256 arguments to a UDF
> -
[
https://issues.apache.org/jira/browse/SPARK-29083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-29083:
---
Assignee: holdenk
> Speed up toLocalIterator with prefetching when enabled
> --
[
https://issues.apache.org/jira/browse/SPARK-29217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936186#comment-16936186
]
holdenk commented on SPARK-29217:
-
Can you clarify what you mean by "Moving some files i
[
https://issues.apache.org/jira/browse/SPARK-29163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16936053#comment-16936053
]
holdenk commented on SPARK-29163:
-
I'm going to try and do some work on this before the
[
https://issues.apache.org/jira/browse/SPARK-27659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-27659.
-
Fix Version/s: 3.0.0
Assignee: holdenk
Resolution: Fixed
> Allow PySpark toLocalIterator
[
https://issues.apache.org/jira/browse/SPARK-28936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-28936.
-
Resolution: Fixed
> Simplify Spark K8s tests by replacing race condition during command execution
>
[
https://issues.apache.org/jira/browse/SPARK-28936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-28936:
Fix Version/s: 3.0.0
> Simplify Spark K8s tests by replacing race condition during command execution
> ---
[
https://issues.apache.org/jira/browse/SPARK-28937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-28937.
-
Fix Version/s: 3.0.0
Resolution: Fixed
> Improve error reporting in Spark Secrets Test Suite
> --
[
https://issues.apache.org/jira/browse/SPARK-29193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16934798#comment-16934798
]
holdenk commented on SPARK-29193:
-
My bad looks, like we fixed this in
SPARK-28921
>
[
https://issues.apache.org/jira/browse/SPARK-29193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-29193.
-
Fix Version/s: 3.0.0
Resolution: Duplicate
> Update fabric8 version to 4.3 continue docker 4 desk
[
https://issues.apache.org/jira/browse/SPARK-29193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-29193:
Description:
The current version of the kubernetes client we are using has some issues with
not setting o
[
https://issues.apache.org/jira/browse/SPARK-29193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16934795#comment-16934795
]
holdenk commented on SPARK-29193:
-
While I've only observed the issue on docker 4 deskto
holdenk created SPARK-29193:
---
Summary: Update fabric8 version to continue docker 4 desktop
support
Key: SPARK-29193
URL: https://issues.apache.org/jira/browse/SPARK-29193
Project: Spark
Issue Type
holdenk created SPARK-29163:
---
Summary: Provide a mixin to simplify HadoopConf access patterns in
DataSource V2
Key: SPARK-29163
URL: https://issues.apache.org/jira/browse/SPARK-29163
Project: Spark
holdenk created SPARK-29158:
---
Summary: Expose SerializableConfiguration for DSv2
Key: SPARK-29158
URL: https://issues.apache.org/jira/browse/SPARK-29158
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-22390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932739#comment-16932739
]
holdenk commented on SPARK-22390:
-
Love to follow where this is going, especially if it
holdenk created SPARK-29083:
---
Summary: Speed up toLocalIterator with prefetching when enabled
Key: SPARK-29083
URL: https://issues.apache.org/jira/browse/SPARK-29083
Project: Spark
Issue Type: Impr
holdenk created SPARK-29076:
---
Summary: Generalize the PVTestSuite to no longer need the minikube
tag
Key: SPARK-29076
URL: https://issues.apache.org/jira/browse/SPARK-29076
Project: Spark
Issue Ty
[
https://issues.apache.org/jira/browse/SPARK-28937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16919991#comment-16919991
]
holdenk commented on SPARK-28937:
-
I'm working on this
> Improve error reporting in Spa
holdenk created SPARK-28937:
---
Summary: Improve error reporting in Spark Secrets Test Suite
Key: SPARK-28937
URL: https://issues.apache.org/jira/browse/SPARK-28937
Project: Spark
Issue Type: Improve
[
https://issues.apache.org/jira/browse/SPARK-28936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16919990#comment-16919990
]
holdenk commented on SPARK-28936:
-
I'm working on this.
> Simplify Spark K8s tests by r
holdenk created SPARK-28936:
---
Summary: Simplify Spark K8s tests by replacing race condition
during command execution
Key: SPARK-28936
URL: https://issues.apache.org/jira/browse/SPARK-28936
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-28904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16918142#comment-16918142
]
holdenk commented on SPARK-28904:
-
Related PV FSGroup https://issues.apache.org/jira/bro
holdenk created SPARK-28905:
---
Summary: PVs mounted into Spark may not be writable by Spark
Key: SPARK-28905
URL: https://issues.apache.org/jira/browse/SPARK-28905
Project: Spark
Issue Type: Improve
holdenk created SPARK-28904:
---
Summary: Spark PV tests don't create required mount
Key: SPARK-28904
URL: https://issues.apache.org/jira/browse/SPARK-28904
Project: Spark
Issue Type: Bug
Co
holdenk created SPARK-28886:
---
Summary: Kubernetes DepsTestsSuite fails on OSX with minikube
1.3.1 due to formatting
Key: SPARK-28886
URL: https://issues.apache.org/jira/browse/SPARK-28886
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-28842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-28842:
Labels: starter (was: )
> Cleanup the formatting/trailing spaces in
> resource-managers/kubernetes/integ
holdenk created SPARK-28842:
---
Summary: Cleanup the formatting/trailing spaces in
resource-managers/kubernetes/integration-tests/README.md
Key: SPARK-28842
URL: https://issues.apache.org/jira/browse/SPARK-28842
[
https://issues.apache.org/jira/browse/SPARK-28784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-28784:
---
Assignee: Shruti Gumma
> StreamExecution and StreamingQueryManager should utilize
> CheckpointFile
[
https://issues.apache.org/jira/browse/SPARK-27659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16909509#comment-16909509
]
holdenk commented on SPARK-27659:
-
I'm working on this.
> Allow PySpark toLocalIterator
[
https://issues.apache.org/jira/browse/SPARK-27683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16908410#comment-16908410
]
holdenk commented on SPARK-27683:
-
Interesting related discussion over in
[https://cont
[
https://issues.apache.org/jira/browse/SPARK-24666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16908267#comment-16908267
]
holdenk commented on SPARK-24666:
-
[~zhongyu09]specific code & data which leads to repro
holdenk created SPARK-28740:
---
Summary: Add support for building with bloop
Key: SPARK-28740
URL: https://issues.apache.org/jira/browse/SPARK-28740
Project: Spark
Issue Type: Improvement
C
[
https://issues.apache.org/jira/browse/SPARK-9792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-9792.
Resolution: Fixed
Fix Version/s: 3.0.0
> PySpark DenseMatrix, SparseMatrix should override __eq__
>
holdenk created SPARK-27095:
---
Summary: We depend on silently accepting failures in
setup-integration-test-env.sh
Key: SPARK-27095
URL: https://issues.apache.org/jira/browse/SPARK-27095
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-21094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-21094:
---
Assignee: Peter Parente
> Allow stdout/stderr pipes in pyspark.java_gateway.launch_gateway
> --
[
https://issues.apache.org/jira/browse/SPARK-21094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-21094.
-
Resolution: Fixed
Fix Version/s: 3.0.0
> Allow stdout/stderr pipes in pyspark.java_gateway.launch
holdenk created SPARK-26898:
---
Summary: Scalastyle should run during k8s integration tests
Key: SPARK-26898
URL: https://issues.apache.org/jira/browse/SPARK-26898
Project: Spark
Issue Type: Improvem
holdenk created SPARK-26882:
---
Summary: lint-scala script does not check all components
Key: SPARK-26882
URL: https://issues.apache.org/jira/browse/SPARK-26882
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-26185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-26185:
---
Assignee: Huaxin Gao
> add weightCol in python MulticlassClassificationEvaluator
>
[
https://issues.apache.org/jira/browse/SPARK-24489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-24489.
-
Resolution: Fixed
Fix Version/s: 3.0.0
Thank's for working on this, I've merged the fix into mast
[
https://issues.apache.org/jira/browse/SPARK-24489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-24489:
---
Assignee: shahid
> No check for invalid input type of weight data in ml.PowerIterationClustering
>
holdenk created SPARK-26497:
---
Summary: Show users where the pre-packaged SparkR and PySpark
Dockerfiles are in the image build script.
Key: SPARK-26497
URL: https://issues.apache.org/jira/browse/SPARK-26497
holdenk created SPARK-26343:
---
Summary: Running the kubernetes
Key: SPARK-26343
URL: https://issues.apache.org/jira/browse/SPARK-26343
Project: Spark
Issue Type: Improvement
Components: K
[
https://issues.apache.org/jira/browse/SPARK-26343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-26343:
Summary: Speed up running the kubernetes integration tests locally (was:
Running the kubernetes )
> Spee
[
https://issues.apache.org/jira/browse/SPARK-25255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-25255.
-
Resolution: Fixed
Thanks for the PR and fixing this issue :)
> Add getActiveSession to SparkSession in
[
https://issues.apache.org/jira/browse/SPARK-25255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-25255:
Fix Version/s: 3.0.0
> Add getActiveSession to SparkSession in PySpark
> -
[
https://issues.apache.org/jira/browse/SPARK-25255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-25255:
---
Assignee: Huaxin Gao
> Add getActiveSession to SparkSession in PySpark
> --
[
https://issues.apache.org/jira/browse/SPARK-20598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621396#comment-16621396
]
holdenk commented on SPARK-20598:
-
Huh that's interesting.I suspect that could be we're
[
https://issues.apache.org/jira/browse/SPARK-25467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621391#comment-16621391
]
holdenk commented on SPARK-25467:
-
cc [~bryanc]
> Python date/datetime objects in dataf
[
https://issues.apache.org/jira/browse/SPARK-14352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-14352:
---
Assignee: zhengruifeng
> approxQuantile should support multi columns
>
[
https://issues.apache.org/jira/browse/SPARK-17602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621389#comment-16621389
]
holdenk commented on SPARK-17602:
-
Did we end up going anywhere with this?
> PySpark -
[
https://issues.apache.org/jira/browse/SPARK-14352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-14352.
-
Resolution: Fixed
Target Version/s: 2.2.0
> approxQuantile should support multi columns
>
[
https://issues.apache.org/jira/browse/SPARK-25021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-25021:
Fix Version/s: 2.4.0
> Add spark.executor.pyspark.memory support to Kubernetes
> -
holdenk created SPARK-25432:
---
Summary: Consider if using standard getOrCreate from PySpark into
JVM SparkSession would simplify code
Key: SPARK-25432
URL: https://issues.apache.org/jira/browse/SPARK-25432
P
[
https://issues.apache.org/jira/browse/SPARK-25021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-25021.
-
Resolution: Fixed
Fix Version/s: 3.0.0
Merged for 3 - open to the discussion around backporting.
[
https://issues.apache.org/jira/browse/SPARK-25021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-25021:
---
Assignee: Ilan Filonenko
> Add spark.executor.pyspark.memory support to Kubernetes
> --
holdenk created SPARK-25373:
---
Summary: Support mixed language pipelines on Spark on K8s
Key: SPARK-25373
URL: https://issues.apache.org/jira/browse/SPARK-25373
Project: Spark
Issue Type: Improvemen
[
https://issues.apache.org/jira/browse/SPARK-25270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-25270:
---
Assignee: cclauss
> lint-python: Add flake8 to find syntax errors and undefined names
> ---
[
https://issues.apache.org/jira/browse/SPARK-25370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-25370.
-
Resolution: Duplicate
Issue was already fixed later.
> Undefined name _exception_message in java_gatewa
holdenk created SPARK-25370:
---
Summary: Undefined name _exception_message in java_gateway
Key: SPARK-25370
URL: https://issues.apache.org/jira/browse/SPARK-25370
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-25370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-25370:
---
Assignee: holdenk
> Undefined name _exception_message in java_gateway
> ---
holdenk created SPARK-25360:
---
Summary: Parallelized RDDs of Ranges could have known partitioner
Key: SPARK-25360
URL: https://issues.apache.org/jira/browse/SPARK-25360
Project: Spark
Issue Type: Im
[
https://issues.apache.org/jira/browse/SPARK-25255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-25255:
Labels: starter (was: )
> Add getActiveSession to SparkSession in PySpark
> -
holdenk created SPARK-25255:
---
Summary: Add getActiveSession to SparkSession in PySpark
Key: SPARK-25255
URL: https://issues.apache.org/jira/browse/SPARK-25255
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-25236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16593098#comment-16593098
]
holdenk commented on SPARK-25236:
-
Probably. The only thing would be probably wanting to
holdenk created SPARK-25236:
---
Summary: Investigate using a logging library inside of PySpark on
the workers instead of print
Key: SPARK-25236
URL: https://issues.apache.org/jira/browse/SPARK-25236
Project:
[
https://issues.apache.org/jira/browse/SPARK-9636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-9636:
---
Labels: (was: easyfix)
> Treat $SPARK_HOME as write-only
> ---
>
>
[
https://issues.apache.org/jira/browse/SPARK-19094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-19094.
-
Resolution: Won't Fix
No longer as important given other changes.
> Plumb through logging/error message
[
https://issues.apache.org/jira/browse/SPARK-25153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-25153:
Labels: starter (was: )
> Improve error messages for columns with dots/periods
>
holdenk created SPARK-25153:
---
Summary: Improve error messages for columns with dots/periods
Key: SPARK-25153
URL: https://issues.apache.org/jira/browse/SPARK-25153
Project: Spark
Issue Type: Improv
[
https://issues.apache.org/jira/browse/SPARK-24735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16578719#comment-16578719
]
holdenk commented on SPARK-24735:
-
So [~bryanc]what do you think of if we add a Aggregat
[
https://issues.apache.org/jira/browse/SPARK-24735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16578710#comment-16578710
]
holdenk commented on SPARK-24735:
-
I think we could do better than just improving the ex
holdenk created SPARK-25105:
---
Summary: Importing all of pyspark.sql.functions should bring
PandasUDFType in as well
Key: SPARK-25105
URL: https://issues.apache.org/jira/browse/SPARK-25105
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-24735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-24735:
Summary: Improve exception when mixing up pandas_udf types (was: Improve
exception when mixing pandas_udf
[
https://issues.apache.org/jira/browse/SPARK-24736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16578624#comment-16578624
]
holdenk commented on SPARK-24736:
-
cc [~ifilonenko]
> --py-files not functional for non
holdenk created SPARK-25053:
---
Summary: Allow additional port forwarding on Spark on K8S as needed
Key: SPARK-25053
URL: https://issues.apache.org/jira/browse/SPARK-25053
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-21436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16570517#comment-16570517
]
holdenk commented on SPARK-21436:
-
@[~podongfeng]
So distinct triggers a `map` first (
[
https://issues.apache.org/jira/browse/SPARK-21436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16570517#comment-16570517
]
holdenk edited comment on SPARK-21436 at 8/6/18 5:19 PM:
-
@[~pod
[
https://issues.apache.org/jira/browse/SPARK-24579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16562638#comment-16562638
]
holdenk commented on SPARK-24579:
-
[~mengxr]How about you just open comments up in gener
[
https://issues.apache.org/jira/browse/SPARK-23451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-23451.
-
Resolution: Fixed
Assignee: Marco Gaido
Fix Version/s: 2.4.0
> Deprecate KMeans computeC
[
https://issues.apache.org/jira/browse/SPARK-23528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk reassigned SPARK-23528:
---
Assignee: Marco Gaido
> Add numIter to ClusteringSummary
>
>
>
[
https://issues.apache.org/jira/browse/SPARK-23528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk resolved SPARK-23528.
-
Resolution: Fixed
Fix Version/s: 2.4.0
Thanks!
> Add numIter to ClusteringSummary
>
[
https://issues.apache.org/jira/browse/SPARK-23528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-23528:
Description:
Spark ML should expose vital statistics of the GMM model:
* *Number of iterations* (actual,
[
https://issues.apache.org/jira/browse/SPARK-23528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-23528:
Summary: Add numIter to ClusteringSummary (was: Expose vital statistics of
GaussianMixtureModel)
> Add n
[
https://issues.apache.org/jira/browse/SPARK-24780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-24780:
Summary: DataFrame.column_name should resolve to a distinct ref (was:
DataFrame.column_name should take i
[
https://issues.apache.org/jira/browse/SPARK-24780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-24780:
Description:
If we join a dataframe with another dataframe which has the same column name of
the conditio
holdenk created SPARK-24780:
---
Summary: DataFrame.column_name should take into account DataFrame
alias for future joins
Key: SPARK-24780
URL: https://issues.apache.org/jira/browse/SPARK-24780
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-24668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16530284#comment-16530284
]
holdenk commented on SPARK-24668:
-
So there is also the case where Spark is run without
[
https://issues.apache.org/jira/browse/SPARK-24668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
holdenk updated SPARK-24668:
Shepherd: holdenk
Affects Version/s: 2.4.0
> PySpark crashes when getting the webui url if th
1 - 100 of 981 matches
Mail list logo