[jira] [Commented] (SPARK-27889) Make development scripts under dev/ support Python 3

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16855104#comment-16855104 ] Xiangrui Meng commented on SPARK-27889: --- [~smilegator] I assigned this ticket to you for auditing

[jira] [Assigned] (SPARK-27887) Check python version and print deprecation warning if version < 3

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27887: - Assignee: Xiangrui Meng > Check python version and print deprecation warning if

[jira] [Assigned] (SPARK-27889) Make development scripts under dev/ support Python 3

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27889: - Assignee: Xiao Li > Make development scripts under dev/ support Python 3 >

[jira] [Commented] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16854833#comment-16854833 ] Xiangrui Meng commented on SPARK-27886: --- Created a PR here:

[jira] [Resolved] (SPARK-27885) Announce deprecation of Python 2 support

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27885. --- Resolution: Done > Announce deprecation of Python 2 support >

[jira] [Assigned] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27886: - Assignee: Xiangrui Meng > Add Apache Spark project to https://python3statement.org/ >

[jira] [Commented] (SPARK-27885) Announce deprecation of Python 2 support

2019-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16854737#comment-16854737 ] Xiangrui Meng commented on SPARK-27885: --- Announced at: *

[jira] [Assigned] (SPARK-27885) Announce deprecation of Python 2 support

2019-05-31 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27885: - Assignee: Xiangrui Meng > Announce deprecation of Python 2 support >

[jira] [Commented] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852276#comment-16852276 ] Xiangrui Meng commented on SPARK-27886: --- By "at the end of year", you mean year 2020, right? The

[jira] [Created] (SPARK-27889) Make development scripts under dev/ support Python 3

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27889: - Summary: Make development scripts under dev/ support Python 3 Key: SPARK-27889 URL: https://issues.apache.org/jira/browse/SPARK-27889 Project: Spark Issue

[jira] [Updated] (SPARK-27885) Announce deprecation of Python 2 support

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27885: -- Description: * Draft the message. * Update Spark website and announce deprecation of Python 2

[jira] [Commented] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852088#comment-16852088 ] Xiangrui Meng commented on SPARK-27886: --- cc: [~srowen] [~smilegator] > Add Apache Spark project

[jira] [Updated] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27886: -- Description: Add Spark to https://python3statement.org/ and indicate our timeline. I

[jira] [Updated] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27886: -- Description: Add Spark to https://python3statement.org/ and indicate our timeline. I

[jira] [Commented] (SPARK-27884) Deprecate Python 2 support in Spark 3.0

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852077#comment-16852077 ] Xiangrui Meng commented on SPARK-27884: --- [~srowen] Could you help review the draft? Feel free to

[jira] [Commented] (SPARK-27884) Deprecate Python 2 support in Spark 3.0

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16852071#comment-16852071 ] Xiangrui Meng commented on SPARK-27884: --- Draft message: *Apache Spark's plan for dropping Python

[jira] [Updated] (SPARK-27884) Deprecate Python 2 support in Spark 3.0

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27884: -- Description: Officially deprecate Python 2 support in Spark 3.0. (was: Officially deprecate

[jira] [Updated] (SPARK-27885) Announce deprecation of Python 2 support

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27885: -- Description: * Draft the message. * Update Spark website and announce deprecation of Python 2

[jira] [Created] (SPARK-27888) Python 2->3 migration guide for PySpark users

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27888: - Summary: Python 2->3 migration guide for PySpark users Key: SPARK-27888 URL: https://issues.apache.org/jira/browse/SPARK-27888 Project: Spark Issue Type:

[jira] [Updated] (SPARK-27885) Announce deprecation of Python 2 support

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27885: -- Summary: Announce deprecation of Python 2 support (was: Update Spark website and put

[jira] [Updated] (SPARK-27887) Check python version and print deprecation warning if version < 3

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27887: -- Description: In Spark 3.0, users should see a deprecation warning if they use PySpark with

[jira] [Created] (SPARK-27887) Check python version and print deprecation warning if version < 3

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27887: - Summary: Check python version and print deprecation warning if version < 3 Key: SPARK-27887 URL: https://issues.apache.org/jira/browse/SPARK-27887 Project: Spark

[jira] [Updated] (SPARK-27885) Update Spark website and put deprecation warning

2019-05-30 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27885: -- Description: Update Spark website and announce deprecation of Python 2 support in the next

[jira] [Created] (SPARK-27886) Add Apache Spark project to https://python3statement.org/

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27886: - Summary: Add Apache Spark project to https://python3statement.org/ Key: SPARK-27886 URL: https://issues.apache.org/jira/browse/SPARK-27886 Project: Spark

[jira] [Created] (SPARK-27885) Update Spark website and put deprecation warning

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27885: - Summary: Update Spark website and put deprecation warning Key: SPARK-27885 URL: https://issues.apache.org/jira/browse/SPARK-27885 Project: Spark Issue

[jira] [Created] (SPARK-27884) Deprecate Python 2 support in Spark 3.0

2019-05-30 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27884: - Summary: Deprecate Python 2 support in Spark 3.0 Key: SPARK-27884 URL: https://issues.apache.org/jira/browse/SPARK-27884 Project: Spark Issue Type: Story

[jira] [Created] (SPARK-27823) Add an abstraction layer for accelerator resource handling to avoid manipulating raw confs

2019-05-23 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27823: - Summary: Add an abstraction layer for accelerator resource handling to avoid manipulating raw confs Key: SPARK-27823 URL: https://issues.apache.org/jira/browse/SPARK-27823

[jira] [Resolved] (SPARK-27488) Driver interface to support GPU resources

2019-05-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27488. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24615

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-05-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-05-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Commented] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-05-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16837353#comment-16837353 ] Xiangrui Meng commented on SPARK-26412: --- [~WeichenXu123] I updated the description. > Allow

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames

2019-05-10 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Summary: Allow Pandas UDF to take an iterator of pd.DataFrames (was: Allow Pandas UDF to

[jira] [Assigned] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches

2019-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-26412: - Assignee: Weichen Xu > Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow

[jira] [Commented] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message

2019-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16835780#comment-16835780 ] Xiangrui Meng commented on SPARK-27657: --- This is how JDK format the error string:

[jira] [Comment Edited] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message

2019-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16835780#comment-16835780 ] Xiangrui Meng edited comment on SPARK-27657 at 5/8/19 5:42 PM: --- This is

[jira] [Commented] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message

2019-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16835779#comment-16835779 ] Xiangrui Meng commented on SPARK-27657: --- [~mrbago] Can you send a PR to fix it? >

[jira] [Created] (SPARK-27657) ml.util.Instrumentation.logFailure doesn't log error message

2019-05-08 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27657: - Summary: ml.util.Instrumentation.logFailure doesn't log error message Key: SPARK-27657 URL: https://issues.apache.org/jira/browse/SPARK-27657 Project: Spark

[jira] [Resolved] (SPARK-27588) Fail fast if binary file data source will load a file that is bigger than 2GB

2019-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27588. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24483

[jira] [Resolved] (SPARK-27472) Docuement binary file data source in Spark user guide

2019-04-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27472. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24484

[jira] [Assigned] (SPARK-27472) Docuement binary file data source in Spark user guide

2019-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27472: - Assignee: Xiangrui Meng > Docuement binary file data source in Spark user guide >

[jira] [Assigned] (SPARK-27588) Fail fast if binary file data source will load a file that is bigger than 2GB

2019-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27588: - Assignee: Xiangrui Meng > Fail fast if binary file data source will load a file that

[jira] [Created] (SPARK-27588) Fail fast if binary file data source will load a file that is bigger than 2GB

2019-04-28 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27588: - Summary: Fail fast if binary file data source will load a file that is bigger than 2GB Key: SPARK-27588 URL: https://issues.apache.org/jira/browse/SPARK-27588

[jira] [Resolved] (SPARK-27534) Do not load `content` column in binary data source if it is not selected

2019-04-28 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27534. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24473

[jira] [Created] (SPARK-27569) Pandas UDF prefetches Arrow batches in the queue while executing the current batch

2019-04-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27569: - Summary: Pandas UDF prefetches Arrow batches in the queue while executing the current batch Key: SPARK-27569 URL: https://issues.apache.org/jira/browse/SPARK-27569

[jira] [Assigned] (SPARK-27312) PropertyGraph <-> GraphX conversions

2019-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27312: - Assignee: Weichen Xu > PropertyGraph <-> GraphX conversions >

[jira] [Assigned] (SPARK-27300) Create the new graph projects in Spark and set up build/test

2019-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27300: - Assignee: Martin Junghanns (was: Xiangrui Meng) > Create the new graph projects in

[jira] [Assigned] (SPARK-27300) Create the new graph projects in Spark and set up build/test

2019-04-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27300: - Assignee: Xiangrui Meng > Create the new graph projects in Spark and set up build/test

[jira] [Assigned] (SPARK-27534) Do not load `content` column in binary data source if it is not selected

2019-04-22 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27534: - Assignee: Weichen Xu > Do not load `content` column in binary data source if it is not

[jira] [Created] (SPARK-27534) Do not load `content` column in binary data source if it is not selected

2019-04-21 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27534: - Summary: Do not load `content` column in binary data source if it is not selected Key: SPARK-27534 URL: https://issues.apache.org/jira/browse/SPARK-27534 Project:

[jira] [Comment Edited] (SPARK-25348) Data source for binary files

2019-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16818640#comment-16818640 ] Xiangrui Meng edited comment on SPARK-25348 at 4/21/19 7:49 PM: I

[jira] [Resolved] (SPARK-27473) Support filter push down for status fields in binary file data source

2019-04-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27473. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24387

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Summary: Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches for the

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Summary: Allow Pandas UDF to take an iterator of pd.DataFrames or Arrow batches (was: Allow

[jira] [Comment Edited] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng edited comment on SPARK-26412 at 4/20/19 5:13 PM:

[jira] [Comment Edited] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng edited comment on SPARK-26412 at 4/20/19 5:13 PM:

[jira] [Updated] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-26412: -- Description: Pandas UDF is the ideal connection between PySpark and DL model inference

[jira] [Commented] (SPARK-26412) Allow Pandas UDF to take an iterator of pd.DataFrames for the entire partition

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822521#comment-16822521 ] Xiangrui Meng commented on SPARK-26412: --- [~bryanc] It handles the data exchange for DL model

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:03 PM:

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:02 PM:

[jira] [Comment Edited] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng edited comment on SPARK-27396 at 4/20/19 5:01 PM:

[jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822517#comment-16822517 ] Xiangrui Meng commented on SPARK-27396: --- [~revans2] Thanks for clarifying the proposal! If your

[jira] [Commented] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2019-04-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822367#comment-16822367 ] Xiangrui Meng commented on SPARK-27396: --- [~revans2] What would end users do with public APIs for

[jira] [Commented] (SPARK-27473) Support filter push down for status fields in binary file data source

2019-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16820500#comment-16820500 ] Xiangrui Meng commented on SPARK-27473: --- Given SPARK-25558 is WIP, we might want to flatten the

[jira] [Assigned] (SPARK-27473) Support filter push down for status fields in binary file data source

2019-04-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27473: - Assignee: Weichen Xu > Support filter push down for status fields in binary file data

[jira] [Resolved] (SPARK-25348) Data source for binary files

2019-04-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-25348. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24354

[jira] [Commented] (SPARK-25348) Data source for binary files

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16818640#comment-16818640 ] Xiangrui Meng commented on SPARK-25348: --- I created two follow-up tasks: * DocumentationL

[jira] [Comment Edited] (SPARK-25348) Data source for binary files

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16818640#comment-16818640 ] Xiangrui Meng edited comment on SPARK-25348 at 4/16/19 5:19 AM: I

[jira] [Created] (SPARK-27473) Support filter push down for status fields in binary file data source

2019-04-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27473: - Summary: Support filter push down for status fields in binary file data source Key: SPARK-27473 URL: https://issues.apache.org/jira/browse/SPARK-27473 Project:

[jira] [Updated] (SPARK-25348) Data source for binary files

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-25348: -- Component/s: (was: ML) > Data source for binary files > > >

[jira] [Created] (SPARK-27472) Docuement binary file data source in Spark user guide

2019-04-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27472: - Summary: Docuement binary file data source in Spark user guide Key: SPARK-27472 URL: https://issues.apache.org/jira/browse/SPARK-27472 Project: Spark

[jira] [Updated] (SPARK-25348) Data source for binary files

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-25348: -- Description: It would be useful to have a data source implementation for binary files, which

[jira] [Assigned] (SPARK-27454) Spark image datasource fail when encounter some illegal images

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-27454: - Assignee: Weichen Xu > Spark image datasource fail when encounter some illegal images

[jira] [Resolved] (SPARK-27454) Spark image datasource fail when encounter some illegal images

2019-04-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-27454. --- Resolution: Fixed Fix Version/s: 3.0.0 > Spark image datasource fail when encounter

[jira] [Commented] (SPARK-25348) Data source for binary files

2019-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813834#comment-16813834 ] Xiangrui Meng commented on SPARK-25348: --- Sampling could be supported later. > Data source for

[jira] [Updated] (SPARK-25348) Data source for binary files

2019-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-25348: -- Description: It would be useful to have a data source implementation for binary files, which

[jira] [Commented] (SPARK-25348) Data source for binary files

2019-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16813832#comment-16813832 ] Xiangrui Meng commented on SPARK-25348: --- Updated the description and proposed APIs. > Data source

[jira] [Updated] (SPARK-25348) Data source for binary files

2019-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-25348: -- Description: It would be useful to have a data source implementation for binary files, which

[jira] [Assigned] (SPARK-25348) Data source for binary files

2019-04-09 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-25348: - Assignee: Weichen Xu > Data source for binary files > > >

[jira] [Commented] (SPARK-27363) Mesos support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16809511#comment-16809511 ] Xiangrui Meng commented on SPARK-27363: --- [~felixcheung] [~srowen] Anyone you recommend to lead

[jira] [Updated] (SPARK-27363) Mesos support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27363: -- Description: Design and implement Mesos support for GPU-aware scheduling. > Mesos support for

[jira] [Updated] (SPARK-27362) Kubernetes support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27362: -- Shepherd: Yinan Li > Kubernetes support for GPU-aware scheduling >

[jira] [Updated] (SPARK-27365) Spark Jenkins supports testing GPU-aware scheduling features

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27365: -- Description: Upgrade Spark Jenkins to install GPU cards and run GPU integration tests

[jira] [Updated] (SPARK-27361) YARN support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27361: -- Description: Design and implement YARN support for GPU-aware scheduling: * User can request

[jira] [Updated] (SPARK-27361) YARN support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27361: -- Description: Design and implement YARN support for GPU-aware scheduling: * User can request

[jira] [Updated] (SPARK-27377) Upgrade YARN to 3.1.2+ to support GPU

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27377: -- Description: This task should be covered by SPARK-23710. Just a placeholder here. > Upgrade

[jira] [Updated] (SPARK-27024) Executor interface for cluster managers to support GPU resources

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27024: -- Issue Type: Story (was: Task) > Executor interface for cluster managers to support GPU

[jira] [Updated] (SPARK-24615) SPIP: Accelerator-aware task scheduling for Spark

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-24615: -- Epic Name: GPU-aware Scheduling (was: Support GPU Scheduling) > SPIP: Accelerator-aware task

[jira] [Created] (SPARK-27380) Get and install GPU cards to Jenkins machines

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27380: - Summary: Get and install GPU cards to Jenkins machines Key: SPARK-27380 URL: https://issues.apache.org/jira/browse/SPARK-27380 Project: Spark Issue Type:

[jira] [Created] (SPARK-27381) Design: Spark Jenkins supports GPU integration tests

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27381: - Summary: Design: Spark Jenkins supports GPU integration tests Key: SPARK-27381 URL: https://issues.apache.org/jira/browse/SPARK-27381 Project: Spark Issue

[jira] [Updated] (SPARK-27365) Spark Jenkins supports testing GPU-aware scheduling features

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27365: -- Description: Upgrade Spark Jenkins to install GPU cards and run GPU integration tests

[jira] [Created] (SPARK-27379) YARN passes GPU info to Spark executor

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27379: - Summary: YARN passes GPU info to Spark executor Key: SPARK-27379 URL: https://issues.apache.org/jira/browse/SPARK-27379 Project: Spark Issue Type:

[jira] [Created] (SPARK-27378) spark-submit requests GPUs in YARN mode

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27378: - Summary: spark-submit requests GPUs in YARN mode Key: SPARK-27378 URL: https://issues.apache.org/jira/browse/SPARK-27378 Project: Spark Issue Type:

[jira] [Created] (SPARK-27377) Upgrade YARN to 3.1.2+ to support GPU

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27377: - Summary: Upgrade YARN to 3.1.2+ to support GPU Key: SPARK-27377 URL: https://issues.apache.org/jira/browse/SPARK-27377 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-27361) YARN support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27361: -- Description: Design and implement YARN support for GPU-aware scheduling: * User can request

[jira] [Created] (SPARK-27376) Design: YARN supports Spark GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27376: - Summary: Design: YARN supports Spark GPU-aware scheduling Key: SPARK-27376 URL: https://issues.apache.org/jira/browse/SPARK-27376 Project: Spark Issue

[jira] [Updated] (SPARK-27361) YARN support for GPU-aware scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27361: -- Description: Design and implement YARN support for GPU-aware scheduling: * User can request

[jira] [Updated] (SPARK-27366) Spark scheduler internal changes to support GPU scheduling

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27366: -- Description: Update Spark job scheduler to support accelerator resource requests submitted at

[jira] [Created] (SPARK-27374) Fetch assigned resources from TaskContext

2019-04-03 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-27374: - Summary: Fetch assigned resources from TaskContext Key: SPARK-27374 URL: https://issues.apache.org/jira/browse/SPARK-27374 Project: Spark Issue Type:

[jira] [Updated] (SPARK-27365) Spark Jenkins supports testing GPU-aware scheduling features

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27365: -- Summary: Spark Jenkins supports testing GPU-aware scheduling features (was: Spark Jenkins to

[jira] [Updated] (SPARK-27365) Spark Jenkins to support testing GPU-aware scheduling features

2019-04-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-27365: -- Summary: Spark Jenkins to support testing GPU-aware scheduling features (was: Spark Jenkins

<    1   2   3   4   5   6   7   8   9   10   >