[jira] [Comment Edited] (SPARK-38648) SPIP: Simplified API for DL Inferencing

2022-08-19 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582071#comment-17582071 ] Xiangrui Meng edited comment on SPARK-38648 at 8/19/22 10:55 PM: - I had

[jira] [Commented] (SPARK-38648) SPIP: Simplified API for DL Inferencing

2022-08-19 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582071#comment-17582071 ] Xiangrui Meng commented on SPARK-38648: --- I had an offline discussion with [~leewyang]. Summary:

[jira] [Commented] (SPARK-38648) SPIP: Simplified API for DL Inferencing

2022-04-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-38648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17528862#comment-17528862 ] Xiangrui Meng commented on SPARK-38648: --- I think it is beneficial to both Spark and DL frameworks

[jira] [Updated] (SPARK-37004) Job cancellation causes py4j errors on Jupyter due to pinned thread mode

2021-10-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-37004: -- Description: Spark 3.2.0 turned on py4j pinned thread mode by default (SPARK-35303).

[jira] [Updated] (SPARK-37004) Job cancellation causes py4j errors on Jupyter due to pinned thread mode

2021-10-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-37004: -- Issue Type: Bug (was: Improvement) > Job cancellation causes py4j errors on Jupyter due to

[jira] [Updated] (SPARK-37004) Job cancellation causes py4j errors on Jupyter due to pinned thread mode

2021-10-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-37004: -- Attachment: pinned.ipynb > Job cancellation causes py4j errors on Jupyter due to pinned

[jira] [Created] (SPARK-37004) Job cancellation causes py4j errors on Jupyter due to pinned thread mode

2021-10-13 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-37004: - Summary: Job cancellation causes py4j errors on Jupyter due to pinned thread mode Key: SPARK-37004 URL: https://issues.apache.org/jira/browse/SPARK-37004 Project:

[jira] [Assigned] (SPARK-36578) Minor UnivariateFeatureSelector API doc improvement

2021-08-24 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-36578: - Assignee: Huaxin Gao > Minor UnivariateFeatureSelector API doc improvement >

[jira] [Commented] (SPARK-36578) Minor UnivariateFeatureSelector API doc improvement

2021-08-24 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404095#comment-17404095 ] Xiangrui Meng commented on SPARK-36578: --- [~huaxingao] I saw some possible improvement to the API

[jira] [Created] (SPARK-36578) Minor UnivariateFeatureSelector API doc improvement

2021-08-24 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-36578: - Summary: Minor UnivariateFeatureSelector API doc improvement Key: SPARK-36578 URL: https://issues.apache.org/jira/browse/SPARK-36578 Project: Spark Issue

[jira] [Updated] (SPARK-36578) Minor UnivariateFeatureSelector API doc improvement

2021-08-24 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-36578: -- Affects Version/s: 3.1.1 > Minor UnivariateFeatureSelector API doc improvement >

[jira] [Updated] (SPARK-36578) Minor UnivariateFeatureSelector API doc improvement

2021-08-24 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-36578: -- Target Version/s: (was: 3.2.0) > Minor UnivariateFeatureSelector API doc improvement >

[jira] [Commented] (SPARK-34415) Use randomization as a possibly better technique than grid search in optimizing hyperparameters

2021-08-17 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400644#comment-17400644 ] Xiangrui Meng commented on SPARK-34415: --- [~phenry] [~srowen] The implementation doesn't do uniform

[jira] [Resolved] (SPARK-24374) SPIP: Support Barrier Execution Mode in Apache Spark

2021-03-19 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-24374. --- Fix Version/s: 2.4.0 Target Version/s: 2.4.0 Resolution: Fixed I'm marking

[jira] [Commented] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17263006#comment-17263006 ] Xiangrui Meng commented on SPARK-34080: --- Not sure if we have time for 3.1.1 release. But if there

[jira] [Updated] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-34080: -- Affects Version/s: 3.1.1 > Add UnivariateFeatureSelector to deprecate existing selectors >

[jira] [Updated] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-34080: -- Description: In SPARK-26111, we introduced a few univariate feature selectors, which share a

[jira] [Updated] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-34080: -- Description: In SPARK-26111, we introduced a few univariate feature selectors, which share a

[jira] [Updated] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-34080: -- Description: In SPARK-26111, we introduced a few univariate feature selectors, which share a

[jira] [Created] (SPARK-34080) Add UnivariateFeatureSelector to deprecate existing selectors

2021-01-11 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-34080: - Summary: Add UnivariateFeatureSelector to deprecate existing selectors Key: SPARK-34080 URL: https://issues.apache.org/jira/browse/SPARK-34080 Project: Spark

[jira] [Commented] (SPARK-32933) Use keyword-only syntax for keyword_only methods

2020-09-23 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17200580#comment-17200580 ] Xiangrui Meng commented on SPARK-32933: --- Why do we keep using the @keyword_only annotation after

[jira] [Comment Edited] (SPARK-32429) Standalone Mode allow setting CUDA_VISIBLE_DEVICES on executor launch

2020-07-28 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166532#comment-17166532 ] Xiangrui Meng edited comment on SPARK-32429 at 7/28/20, 4:30 PM: -

[jira] [Commented] (SPARK-32429) Standalone Mode allow setting CUDA_VISIBLE_DEVICES on executor launch

2020-07-28 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17166532#comment-17166532 ] Xiangrui Meng commented on SPARK-32429: --- [~tgraves] Thanks for the clarification! It makes sense

[jira] [Commented] (SPARK-32429) Standalone Mode allow setting CUDA_VISIBLE_DEVICES on executor launch

2020-07-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17165903#comment-17165903 ] Xiangrui Meng commented on SPARK-32429: --- Couple questions: 1. Which GPU resource name do we use?

[jira] [Created] (SPARK-31777) CrossValidator supports user-supplied folds

2020-05-20 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-31777: - Summary: CrossValidator supports user-supplied folds Key: SPARK-31777 URL: https://issues.apache.org/jira/browse/SPARK-31777 Project: Spark Issue Type:

[jira] [Updated] (SPARK-31776) Literal lit() supports lists and numpy arrays

2020-05-20 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31776: -- Issue Type: Improvement (was: New Feature) > Literal lit() supports lists and numpy arrays >

[jira] [Created] (SPARK-31776) Literal lit() supports lists and numpy arrays

2020-05-20 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-31776: - Summary: Literal lit() supports lists and numpy arrays Key: SPARK-31776 URL: https://issues.apache.org/jira/browse/SPARK-31776 Project: Spark Issue Type:

[jira] [Created] (SPARK-31775) Support tensor type (TensorType) in Spark SQL/DataFrame

2020-05-20 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-31775: - Summary: Support tensor type (TensorType) in Spark SQL/DataFrame Key: SPARK-31775 URL: https://issues.apache.org/jira/browse/SPARK-31775 Project: Spark

[jira] [Updated] (SPARK-31610) Expose hashFuncVersion property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Issue Type: Improvement (was: Bug) > Expose hashFuncVersion property in HashingTF >

[jira] [Updated] (SPARK-31610) Expose hashFuncVersion property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Priority: Major (was: Critical) > Expose hashFuncVersion property in HashingTF >

[jira] [Updated] (SPARK-31610) Expose hashFuncVersion property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Description: Expose hashFuncVersion property in HashingTF Some third-party library such as

[jira] [Assigned] (SPARK-31610) Expose hashFunc property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-31610: - Assignee: Weichen Xu > Expose hashFunc property in HashingTF >

[jira] [Resolved] (SPARK-31668) Saving and loading HashingTF leads to hash function changed

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-31668. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 28413

[jira] [Updated] (SPARK-31610) Expose hashFuncVersion property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Summary: Expose hashFuncVersion property in HashingTF (was: Expose hashFunc property in

[jira] [Resolved] (SPARK-31610) Expose hashFunc property in HashingTF

2020-05-12 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-31610. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 28413

[jira] [Updated] (SPARK-31610) Expose hashFunc property in HashingTF

2020-05-08 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Issue Type: Bug (was: Improvement) > Expose hashFunc property in HashingTF >

[jira] [Updated] (SPARK-31610) Expose hashFunc property in HashingTF

2020-05-08 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31610: -- Priority: Critical (was: Major) > Expose hashFunc property in HashingTF >

[jira] [Updated] (SPARK-31549) Pyspark SparkContext.cancelJobGroup do not work correctly

2020-04-28 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31549: -- Target Version/s: 3.0.0 > Pyspark SparkContext.cancelJobGroup do not work correctly >

[jira] [Resolved] (SPARK-31497) Pyspark CrossValidator/TrainValidationSplit with pipeline estimator cannot save and load model

2020-04-26 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-31497. --- Resolution: Fixed Issue resolved by pull request 28279

[jira] [Updated] (SPARK-31497) Pyspark CrossValidator/TrainValidationSplit with pipeline estimator cannot save and load model

2020-04-26 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31497: -- Fix Version/s: 3.0.0 > Pyspark CrossValidator/TrainValidationSplit with pipeline estimator

[jira] [Updated] (SPARK-31497) Pyspark CrossValidator/TrainValidationSplit with pipeline estimator cannot save and load model

2020-04-26 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-31497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-31497: -- Target Version/s: 3.0.0 > Pyspark CrossValidator/TrainValidationSplit with pipeline estimator

[jira] [Commented] (SPARK-30969) Remove resource coordination support from Standalone

2020-02-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17046795#comment-17046795 ] Xiangrui Meng commented on SPARK-30969: --- [~Ngone51] [~jiangxb1987] Is there a JIRA to deprecate

[jira] [Updated] (SPARK-30969) Remove resource coordination support from Standalone

2020-02-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30969: -- Environment: (was: Resource coordination is used for the case where multiple workers

[jira] [Updated] (SPARK-30969) Remove resource coordination support from Standalone

2020-02-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30969: -- Priority: Critical (was: Major) > Remove resource coordination support from Standalone >

[jira] [Assigned] (SPARK-30969) Remove resource coordination support from Standalone

2020-02-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-30969: - Assignee: wuyi > Remove resource coordination support from Standalone >

[jira] [Updated] (SPARK-30969) Remove resource coordination support from Standalone

2020-02-27 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30969: -- Description: Resource coordination is used for the case where multiple workers running on the

[jira] [Updated] (SPARK-30667) Support simple all gather in barrier task context

2020-02-20 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30667: -- Fix Version/s: (was: 3.0.0) > Support simple all gather in barrier task context >

[jira] [Reopened] (SPARK-30667) Support simple all gather in barrier task context

2020-02-20 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-30667: --- > Support simple all gather in barrier task context >

[jira] [Resolved] (SPARK-30667) Support simple all gather in barrier task context

2020-02-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-30667. --- Resolution: Fixed > Support simple all gather in barrier task context >

[jira] [Assigned] (SPARK-30667) Support simple all gather in barrier task context

2020-02-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-30667: - Assignee: Sarth Frey > Support simple all gather in barrier task context >

[jira] [Updated] (SPARK-30667) Support simple all gather in barrier task context

2020-02-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30667: -- Target Version/s: 3.0.0 (was: 3.1.0) > Support simple all gather in barrier task context >

[jira] [Updated] (SPARK-30667) Support simple all gather in barrier task context

2020-02-13 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30667: -- Fix Version/s: 3.0.0 > Support simple all gather in barrier task context >

[jira] [Assigned] (SPARK-30762) Add dtype="float32" support to vector_to_array UDF

2020-02-09 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-30762: - Assignee: Liang Zhang > Add dtype="float32" support to vector_to_array UDF >

[jira] [Updated] (SPARK-30762) Add dtype="float32" support to vector_to_array UDF

2020-02-09 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30762: -- Component/s: PySpark > Add dtype="float32" support to vector_to_array UDF >

[jira] [Updated] (SPARK-30667) Support simple all gather in barrier task context

2020-01-28 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30667: -- Description: Currently we offer task.barrier() to coordinate tasks in barrier mode. Tasks

[jira] [Created] (SPARK-30667) Support simple all gather in barrier task context

2020-01-28 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-30667: - Summary: Support simple all gather in barrier task context Key: SPARK-30667 URL: https://issues.apache.org/jira/browse/SPARK-30667 Project: Spark Issue

[jira] [Resolved] (SPARK-30154) PySpark UDF to convert MLlib vectors to dense arrays

2020-01-06 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-30154. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26910

[jira] [Updated] (SPARK-30154) PySpark UDF to convert MLlib vectors to dense arrays

2019-12-06 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30154: -- Description: If a PySpark user wants to convert MLlib sparse/dense vectors in a DataFrame

[jira] [Updated] (SPARK-30154) PySpark UDF to convert MLlib vectors to dense arrays

2019-12-06 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-30154: -- Summary: PySpark UDF to convert MLlib vectors to dense arrays (was: Allow PySpark code

[jira] [Created] (SPARK-30154) Allow PySpark code efficiently convert MLlib vectors to dense arrays

2019-12-06 Thread Xiangrui Meng (Jira)
Xiangrui Meng created SPARK-30154: - Summary: Allow PySpark code efficiently convert MLlib vectors to dense arrays Key: SPARK-30154 URL: https://issues.apache.org/jira/browse/SPARK-30154 Project:

[jira] [Resolved] (SPARK-28978) PySpark: Can't pass more than 256 arguments to a UDF

2019-11-08 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-28978. --- Fix Version/s: 3.0.0 Assignee: Bago Amirbekian Resolution: Fixed > PySpark:

[jira] [Resolved] (SPARK-29417) Resource Scheduling - add TaskContext.resource java api

2019-10-14 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-29417. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26083

[jira] [Assigned] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc

2019-07-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-28206: - Assignee: Hyukjin Kwon > "@pandas_udf" in doctest is rendered as ":pandas_udf" in html

[jira] [Resolved] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc

2019-07-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-28206. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 25060

[jira] [Updated] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc

2019-06-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-28206: -- Attachment: Screen Shot 2019-06-28 at 9.55.13 AM.png > "@pandas_udf" in doctest is rendered

[jira] [Updated] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc

2019-06-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-28206: -- Issue Type: Bug (was: Documentation) > "@pandas_udf" in doctest is rendered as ":pandas_udf"

[jira] [Updated] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc

2019-06-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-28206: -- Summary: "@pandas_udf" in doctest is rendered as ":pandas_udf" in html API doc (was:

[jira] [Updated] (SPARK-28206) "@pandas_udf" in doctest is rendered as ":pandas_udf" in html

2019-06-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-28206: -- Summary: "@pandas_udf" in doctest is rendered as ":pandas_udf" in html (was: "@" is rendered

[jira] [Created] (SPARK-28206) "@" is rendered as ":" in doctest

2019-06-28 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-28206: - Summary: "@" is rendered as ":" in doctest Key: SPARK-28206 URL: https://issues.apache.org/jira/browse/SPARK-28206 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-28115) Fix flaky test: SparkContextSuite.test resource scheduling under local-cluster mode

2019-06-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-28115. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24917

[jira] [Resolved] (SPARK-28056) Document SCALAR_ITER Pandas UDF

2019-06-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-28056. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24897

[jira] [Assigned] (SPARK-28056) Document SCALAR_ITER Pandas UDF

2019-06-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-28056: - Assignee: Xiangrui Meng > Document SCALAR_ITER Pandas UDF >

[jira] [Resolved] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-06-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-26412. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24643

[jira] [Created] (SPARK-28056) Document SCALAR_ITER Pandas UDF

2019-06-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-28056: - Summary: Document SCALAR_ITER Pandas UDF Key: SPARK-28056 URL: https://issues.apache.org/jira/browse/SPARK-28056 Project: Spark Issue Type: Documentation

[jira] [Resolved] (SPARK-28030) Binary file data source doesn't support space in file names

2019-06-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-28030. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24855

[jira] [Assigned] (SPARK-27823) Add an abstraction layer for accelerator resource handling to avoid manipulating raw confs

2019-06-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27823: - Assignee: Xiangrui Meng (was: Thomas Graves) > Add an abstraction layer for

[jira] [Updated] (SPARK-28030) Binary file data source doesn't support space in file names

2019-06-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-28030: -- Description: {code} echo 123 > "/tmp/test space.txt"

[jira] [Created] (SPARK-28030) Binary file data source doesn't support space in file names

2019-06-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-28030: - Summary: Binary file data source doesn't support space in file names Key: SPARK-28030 URL: https://issues.apache.org/jira/browse/SPARK-28030 Project: Spark

[jira] [Commented] (SPARK-27360) Standalone cluster mode support for GPU-aware scheduling

2019-06-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861353#comment-16861353 ] Xiangrui Meng commented on SPARK-27360: --- Done. Thanks for taking the task! > Standalone cluster

[jira] [Deleted] (SPARK-27999) setup resources when Standalone Worker starts up

2019-06-11 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng deleted SPARK-27999: -- > setup resources when Standalone Worker starts up >

[jira] [Updated] (SPARK-27372) Standalone executor process-level isolation to support GPU scheduling

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27372: -- Issue Type: Story (was: Sub-task) Parent: (was: SPARK-27360) > Standalone

[jira] [Updated] (SPARK-27371) Standalone master receives resource info from worker and allocate driver/executor properly

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27371: -- Summary: Standalone master receives resource info from worker and allocate driver/executor

[jira] [Updated] (SPARK-27371) Master receives resource info from worker and allocate driver/executor properly

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27371: -- Summary: Master receives resource info from worker and allocate driver/executor properly

[jira] [Deleted] (SPARK-27370) spark-submit requests GPUs in standalone mode

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng deleted SPARK-27370: -- > spark-submit requests GPUs in standalone mode > -

[jira] [Updated] (SPARK-27369) Standalone worker can load resource conf and discover resources

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27369: -- Summary: Standalone worker can load resource conf and discover resources (was: Standalone

[jira] [Updated] (SPARK-27368) Design: Standalone supports GPU scheduling

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27368: -- Description: Design draft: Scenarios: * client-mode, worker might create one or more

[jira] [Updated] (SPARK-27368) Design: Standalone supports GPU scheduling

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27368: -- Description: Design draft: Scenarios: * client-mode, worker might create one or more

[jira] [Updated] (SPARK-27368) Design: Standalone supports GPU scheduling

2019-06-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27368: -- Description: Design draft: Scenarios: * client-mode, worker might create one or more

[jira] [Resolved] (SPARK-27968) ArrowEvalPythonExec.evaluate shouldn't eagerly read the first row

2019-06-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27968. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24816

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-06-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-06-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-06-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Created] (SPARK-27968) ArrowEvalPythonExec.evaluate shouldn't eagerly read the first row

2019-06-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27968: - Summary: ArrowEvalPythonExec.evaluate shouldn't eagerly read the first row Key: SPARK-27968 URL: https://issues.apache.org/jira/browse/SPARK-27968 Project: Spark

[jira] [Updated] (SPARK-27368) Design: Standalone supports GPU scheduling

2019-06-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27368: -- Description: Design draft: Scenarios: * client-mode, worker might create one or more

[jira] [Updated] (SPARK-27368) Design: Standalone supports GPU scheduling

2019-06-05 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27368: -- Description: Design draft: Scenarios: * client-mode, worker might create one or more

[jira] [Resolved] (SPARK-27366) Spark scheduler internal changes to support GPU scheduling

2019-06-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27366. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24374

[jira] [Commented] (SPARK-27888) Python 2->3 migration guide for PySpark users

2019-06-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16855838#comment-16855838 ] Xiangrui Meng commented on SPARK-27888: --- It would be nice if we can find some PySpark users who

[jira] [Assigned] (SPARK-27884) Deprecate Python 2 support in Spark 3.0

2019-06-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27884: - Assignee: Xiangrui Meng > Deprecate Python 2 support in Spark 3.0 >

[jira] [Commented] (SPARK-27888) Python 2->3 migration guide for PySpark users

2019-06-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16855832#comment-16855832 ] Xiangrui Meng commented on SPARK-27888: --- This JIRA is not to inform users that Python 2 is

[jira] [Resolved] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-06-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27886. --- Resolution: Done > Add Apache Spark project to https://python3statement.org/ >

  1   2   3   4   5   6   7   8   9   10   >