[jira] [Updated] (SPARK-42374) User-facing documentation
[ https://issues.apache.org/jira/browse/SPARK-42374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42374: -- Summary: User-facing documentation (was: User-facing documentaiton) > User-facing documentation > - > > Key: SPARK-42374 > URL: https://issues.apache.org/jira/browse/SPARK-42374 > Project: Spark > Issue Type: Documentation > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Assignee: Haejoon Lee >Priority: Major > > Should provide the user-facing documentation so end users how to use Spark > Connect. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42797) Spark Connect - Grammatical improvements to Spark Overview and Spark Connect Overview doc pages
Allan Folting created SPARK-42797: - Summary: Spark Connect - Grammatical improvements to Spark Overview and Spark Connect Overview doc pages Key: SPARK-42797 URL: https://issues.apache.org/jira/browse/SPARK-42797 Project: Spark Issue Type: Documentation Components: Spark Core Affects Versions: 3.4.0 Reporter: Allan Folting Grammatical improvements, this is a follow-up to this ticket: Introducing Spark Connect on the main page and adding Spark Connect Overview page https://issues.apache.org/jira/browse/SPARK-42496 -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42496) Introducing Spark Connect on the main page and adding Spark Connect Overview page
[ https://issues.apache.org/jira/browse/SPARK-42496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42496: -- Summary: Introducing Spark Connect on the main page and adding Spark Connect Overview page (was: Introducting Spark Connect at main page) > Introducing Spark Connect on the main page and adding Spark Connect Overview > page > - > > Key: SPARK-42496 > URL: https://issues.apache.org/jira/browse/SPARK-42496 > Project: Spark > Issue Type: Sub-task > Components: Connect, Documentation >Affects Versions: 3.4.0 >Reporter: Haejoon Lee >Assignee: Haejoon Lee >Priority: Major > Fix For: 3.4.1 > > > We should document the introduction of Spark Connect at PySpark main > documentation page to give a summary to users. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42773) Minor grammatical change to "Supports Spark Connect" message
Allan Folting created SPARK-42773: - Summary: Minor grammatical change to "Supports Spark Connect" message Key: SPARK-42773 URL: https://issues.apache.org/jira/browse/SPARK-42773 Project: Spark Issue Type: Documentation Components: PySpark Affects Versions: 3.4.0 Reporter: Allan Folting Changing "Support Spark Connect" to "Supports Spark Connect" in the 3.4.0 version change message which is also used in the documentation: .. versionchanged:: 3.4.0 Supports Spark Connect. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42642) Make Python the first code example tab in the Spark documentation
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Summary: Make Python the first code example tab in the Spark documentation (was: Make Python the first code example tab) > Make Python the first code example tab in the Spark documentation > - > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > Attachments: Screenshot 2023-03-01 at 8.10.08 PM.png, Screenshot > 2023-03-01 at 8.10.22 PM.png > > > Python is the most approachable and most popular language so it should be the > default language in code examples so this makes Python the first code example > tab consistently across the documentation, where applicable. > This is continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > where these two pages were updated: > [https://spark.apache.org/docs/latest/sql-getting-started.html] > [https://spark.apache.org/docs/latest/sql-data-sources-load-save-functions.html] > > Pages being updated now: > [https://spark.apache.org/docs/latest/ml-classification-regression.html] > [https://spark.apache.org/docs/latest/ml-clustering.html] > [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/ml-datasource.html] > [https://spark.apache.org/docs/latest/ml-features.html] > [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/ml-migration-guide.html] > [https://spark.apache.org/docs/latest/ml-pipeline.html] > [https://spark.apache.org/docs/latest/ml-statistics.html] > [https://spark.apache.org/docs/latest/ml-tuning.html] > > [https://spark.apache.org/docs/latest/mllib-clustering.html] > [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/mllib-data-types.html] > [https://spark.apache.org/docs/latest/mllib-decision-tree.html] > [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] > [https://spark.apache.org/docs/latest/mllib-ensembles.html] > [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] > [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] > [https://spark.apache.org/docs/latest/mllib-linear-methods.html] > [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] > [https://spark.apache.org/docs/latest/mllib-statistics.html] > > [https://spark.apache.org/docs/latest/quick-start.html] > > [https://spark.apache.org/docs/latest/rdd-programming-guide.html] > > [https://spark.apache.org/docs/latest/sql-data-sources-avro.html] > [https://spark.apache.org/docs/latest/sql-data-sources-binaryFile.html] > [https://spark.apache.org/docs/latest/sql-data-sources-csv.html] > [https://spark.apache.org/docs/latest/sql-data-sources-generic-options.html] > [https://spark.apache.org/docs/latest/sql-data-sources-hive-tables.html] > [https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html] > [https://spark.apache.org/docs/latest/sql-data-sources-json.html] > [https://spark.apache.org/docs/latest/sql-data-sources-parquet.html] > sql-data-sources-protobuf.html > [https://spark.apache.org/docs/latest/sql-data-sources-text.html] > [https://spark.apache.org/docs/latest/sql-migration-guide.html] > [https://spark.apache.org/docs/latest/sql-performance-tuning.html] > [https://spark.apache.org/docs/latest/sql-ref-datatypes.html] > > [https://spark.apache.org/docs/latest/streaming-kinesis-integration.html] > [https://spark.apache.org/docs/latest/streaming-programming-guide.html] > > [https://spark.apache.org/docs/latest/structured-streaming-kafka-integration.html] > [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] > > > > > > > > > > > > > > > -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Description: Python is the most approachable and most popular language so it should be the default language in code examples so this makes Python the first code example tab consistently across the documentation, where applicable. This is continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 where these two pages were updated: [https://spark.apache.org/docs/latest/sql-getting-started.html] [https://spark.apache.org/docs/latest/sql-data-sources-load-save-functions.html] Pages being updated now: [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-migration-guide.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/quick-start.html] [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/sql-data-sources-avro.html] [https://spark.apache.org/docs/latest/sql-data-sources-binaryFile.html] [https://spark.apache.org/docs/latest/sql-data-sources-csv.html] [https://spark.apache.org/docs/latest/sql-data-sources-generic-options.html] [https://spark.apache.org/docs/latest/sql-data-sources-hive-tables.html] [https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html] [https://spark.apache.org/docs/latest/sql-data-sources-json.html] [https://spark.apache.org/docs/latest/sql-data-sources-parquet.html] sql-data-sources-protobuf.html [https://spark.apache.org/docs/latest/sql-data-sources-text.html] [https://spark.apache.org/docs/latest/sql-migration-guide.html] [https://spark.apache.org/docs/latest/sql-performance-tuning.html] [https://spark.apache.org/docs/latest/sql-ref-datatypes.html] [https://spark.apache.org/docs/latest/streaming-kinesis-integration.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-kafka-integration.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] was: Python is the most approachable and most popular language so it should be the default language in code examples so this makes Python the first code example tab consistently across the documentation, where applicable. This is continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 where these two pages were updated: [https://spark.apache.org/docs/latest/sql-getting-started.html] [https://spark.apache.org/docs/latest/sql-data-sources-load-save-functions.html] Pages being updated now: [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-migration-guide.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-data-types.html]
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Description: Python is the most approachable and most popular language so it should be the default language in code examples so this makes Python the first code example tab consistently across the documentation, where applicable. This is continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 where these two pages were updated: [https://spark.apache.org/docs/latest/sql-getting-started.html] [https://spark.apache.org/docs/latest/sql-data-sources-load-save-functions.html] Pages being updated now: [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-migration-guide.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/quick-start.html] [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/sql-data-sources-avro.html] [https://spark.apache.org/docs/latest/sql-data-sources-binaryFile.html] [https://spark.apache.org/docs/latest/sql-data-sources-csv.html] [https://spark.apache.org/docs/latest/sql-data-sources-generic-options.html] [https://spark.apache.org/docs/latest/sql-data-sources-hive-tables.html] [https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html] [https://spark.apache.org/docs/latest/sql-data-sources-json.html] [https://spark.apache.org/docs/latest/sql-data-sources-parquet.html] sql-data-sources-protobuf.md [https://spark.apache.org/docs/latest/sql-data-sources-text.html] [https://spark.apache.org/docs/latest/sql-migration-guide.html] [https://spark.apache.org/docs/latest/sql-performance-tuning.html] [https://spark.apache.org/docs/latest/sql-ref-datatypes.html] [https://spark.apache.org/docs/latest/streaming-kinesis-integration.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-kafka-integration.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] was: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/ml-migration-guide.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html]
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Description: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/ml-migration-guide.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] [https://spark.apache.org/docs/latest/quick-start.html] was: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > Attachments: Screenshot 2023-03-01 at 8.10.08 PM.png, Screenshot > 2023-03-01 at 8.10.22 PM.png > > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: >
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Attachment: Screenshot 2023-03-01 at 8.10.22 PM.png > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > Attachments: Screenshot 2023-03-01 at 8.10.08 PM.png, Screenshot > 2023-03-01 at 8.10.22 PM.png > > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > Making Python the first code example tab consistently across the > documentation, where applicable. > Pages being updated: > [https://spark.apache.org/docs/latest/rdd-programming-guide.html] > [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/ml-statistics.html] > [https://spark.apache.org/docs/latest/ml-datasource.html] > [https://spark.apache.org/docs/latest/ml-pipeline.html] > [https://spark.apache.org/docs/latest/ml-features.html] > [https://spark.apache.org/docs/latest/ml-classification-regression.html] > [https://spark.apache.org/docs/latest/ml-clustering.html] > [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/ml-tuning.html] > [https://spark.apache.org/docs/latest/mllib-data-types.html] > [https://spark.apache.org/docs/latest/mllib-statistics.html] > [https://spark.apache.org/docs/latest/mllib-linear-methods.html] > [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] > [https://spark.apache.org/docs/latest/mllib-decision-tree.html] > [https://spark.apache.org/docs/latest/mllib-ensembles.html] > [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] > [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/mllib-clustering.html] > [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] > [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] > [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > > > > -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Attachment: Screenshot 2023-03-01 at 8.10.08 PM.png > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > Attachments: Screenshot 2023-03-01 at 8.10.08 PM.png, Screenshot > 2023-03-01 at 8.10.22 PM.png > > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > Making Python the first code example tab consistently across the > documentation, where applicable. > Pages being updated: > [https://spark.apache.org/docs/latest/rdd-programming-guide.html] > [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/ml-statistics.html] > [https://spark.apache.org/docs/latest/ml-datasource.html] > [https://spark.apache.org/docs/latest/ml-pipeline.html] > [https://spark.apache.org/docs/latest/ml-features.html] > [https://spark.apache.org/docs/latest/ml-classification-regression.html] > [https://spark.apache.org/docs/latest/ml-clustering.html] > [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/ml-tuning.html] > [https://spark.apache.org/docs/latest/mllib-data-types.html] > [https://spark.apache.org/docs/latest/mllib-statistics.html] > [https://spark.apache.org/docs/latest/mllib-linear-methods.html] > [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] > [https://spark.apache.org/docs/latest/mllib-decision-tree.html] > [https://spark.apache.org/docs/latest/mllib-ensembles.html] > [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] > [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/mllib-clustering.html] > [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] > [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] > [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > > > > -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Description: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] was: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > Making Python the first code example tab consistently across the > documentation, where applicable. > Pages being updated: >
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Description: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. Pages being updated: [https://spark.apache.org/docs/latest/rdd-programming-guide.html] [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] [https://spark.apache.org/docs/latest/streaming-programming-guide.html] [https://spark.apache.org/docs/latest/ml-statistics.html] [https://spark.apache.org/docs/latest/ml-datasource.html] [https://spark.apache.org/docs/latest/ml-pipeline.html] [https://spark.apache.org/docs/latest/ml-features.html] [https://spark.apache.org/docs/latest/ml-classification-regression.html] [https://spark.apache.org/docs/latest/ml-clustering.html] [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/ml-tuning.html] [https://spark.apache.org/docs/latest/mllib-data-types.html] [https://spark.apache.org/docs/latest/mllib-statistics.html] [https://spark.apache.org/docs/latest/mllib-linear-methods.html] [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] [https://spark.apache.org/docs/latest/mllib-decision-tree.html] [https://spark.apache.org/docs/latest/mllib-ensembles.html] [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] [https://spark.apache.org/docs/latest/mllib-clustering.html] [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] was: Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > Making Python the first code example tab consistently across the > documentation, where applicable. > Pages being updated: > [https://spark.apache.org/docs/latest/rdd-programming-guide.html] > [https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/streaming-programming-guide.html] > [https://spark.apache.org/docs/latest/ml-statistics.html] > [https://spark.apache.org/docs/latest/ml-datasource.html] > [https://spark.apache.org/docs/latest/ml-pipeline.html] > [https://spark.apache.org/docs/latest/ml-features.html] > [https://spark.apache.org/docs/latest/ml-classification-regression.html] > [https://spark.apache.org/docs/latest/ml-clustering.html] > [https://spark.apache.org/docs/latest/ml-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/ml-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/ml-tuning.html] > [https://spark.apache.org/docs/latest/mllib-data-types.html] > [https://spark.apache.org/docs/latest/mllib-statistics.html] > [https://spark.apache.org/docs/latest/mllib-linear-methods.html] > [https://spark.apache.org/docs/latest/mllib-naive-bayes.html] > [https://spark.apache.org/docs/latest/mllib-decision-tree.html] > [https://spark.apache.org/docs/latest/mllib-ensembles.html] > [https://spark.apache.org/docs/latest/mllib-isotonic-regression.html] > [https://spark.apache.org/docs/latest/mllib-collaborative-filtering.html] > [https://spark.apache.org/docs/latest/mllib-clustering.html] > [https://spark.apache.org/docs/latest/mllib-dimensionality-reduction.html] > [https://spark.apache.org/docs/latest/mllib-feature-extraction.html] > [https://spark.apache.org/docs/latest/mllib-frequent-pattern-mining.html] > [https://spark.apache.org/docs/latest/mllib-evaluation-metrics.html] > > -- This message was sent by Atlassian Jira
[jira] [Updated] (SPARK-42642) Make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42642: -- Summary: Make Python the first code example tab (was: Make Python the first code example tab - ) > Make Python the first code example tab > -- > > Key: SPARK-42642 > URL: https://issues.apache.org/jira/browse/SPARK-42642 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > > Python is the most approachable and most popular language so it should be the > default language in code examples. > Continuing the work started with: > https://issues.apache.org/jira/browse/SPARK-42493 > Making Python the first code example tab consistently across the > documentation, where applicable. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42642) Make Python the first code example tab -
Allan Folting created SPARK-42642: - Summary: Make Python the first code example tab - Key: SPARK-42642 URL: https://issues.apache.org/jira/browse/SPARK-42642 Project: Spark Issue Type: Documentation Components: Spark Core Affects Versions: 3.5.0 Reporter: Allan Folting Python is the most approachable and most popular language so it should be the default language in code examples. Continuing the work started with: https://issues.apache.org/jira/browse/SPARK-42493 Making Python the first code example tab consistently across the documentation, where applicable. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42493) Spark SQL, DataFrames and Datasets Guide - make Python the first code example tab
[ https://issues.apache.org/jira/browse/SPARK-42493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42493: -- Summary: Spark SQL, DataFrames and Datasets Guide - make Python the first code example tab (was: Spark SQL, DataFrames and Datasets Guide - make Python the first example tab) > Spark SQL, DataFrames and Datasets Guide - make Python the first code example > tab > - > > Key: SPARK-42493 > URL: https://issues.apache.org/jira/browse/SPARK-42493 > Project: Spark > Issue Type: Documentation > Components: Spark Core >Affects Versions: 3.5.0 >Reporter: Allan Folting >Priority: Major > > Python is the easiest approachable and most popular language so it should be > the primary language in examples etc. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42493) Spark SQL, DataFrames and Datasets Guide - make Python the first example tab
Allan Folting created SPARK-42493: - Summary: Spark SQL, DataFrames and Datasets Guide - make Python the first example tab Key: SPARK-42493 URL: https://issues.apache.org/jira/browse/SPARK-42493 Project: Spark Issue Type: Documentation Components: Spark Core Affects Versions: 3.4.0 Reporter: Allan Folting Python is the easiest approachable and most popular language so it should be the primary language in examples etc. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-42456) Consolidating the PySpark version upgrade note pages into a single page to make it easier to read
[ https://issues.apache.org/jira/browse/SPARK-42456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allan Folting updated SPARK-42456: -- Description: Creating a new PySpark migration guide sub page and consolidating the existing 9 separate pages into this one new page. This makes it easier to take a look across multiple version upgrades by simply scrolling on the page. Also, this is similar to the Spark Core Migration Guide page here: [https://spark.apache.org/docs/latest/core-migration-guide.html] Updating the existing main Migration Guide page to point to this new sub page and also making some minor language updates to help readers. was: Creating a new PySpark migration guide and consolidating the existing 9 separate pages into this one new page. This makes it easier to take a look across multiple version upgrades by simply scrolling on the page. Also, this is similar to the Spark Core Migration Guide page here: [https://spark.apache.org/docs/latest/core-migration-guide.html] Updating the existing main Migration Guide page to point to this new sub page and also making some minor language updates to help readers. > Consolidating the PySpark version upgrade note pages into a single page to > make it easier to read > - > > Key: SPARK-42456 > URL: https://issues.apache.org/jira/browse/SPARK-42456 > Project: Spark > Issue Type: Documentation > Components: PySpark >Affects Versions: 3.4.0 >Reporter: Allan Folting >Priority: Major > > Creating a new PySpark migration guide sub page and consolidating the > existing 9 separate pages into this one new page. This makes it easier to > take a look across multiple version upgrades by simply scrolling on the page. > Also, this is similar to the Spark Core Migration Guide page here: > [https://spark.apache.org/docs/latest/core-migration-guide.html] > > Updating the existing main Migration Guide page to point to this new sub page > and also making some minor language updates to help readers. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42456) Consolidating the PySpark version upgrade note pages into a single page to make it easier to read
Allan Folting created SPARK-42456: - Summary: Consolidating the PySpark version upgrade note pages into a single page to make it easier to read Key: SPARK-42456 URL: https://issues.apache.org/jira/browse/SPARK-42456 Project: Spark Issue Type: Documentation Components: PySpark Affects Versions: 3.4.0 Reporter: Allan Folting Creating a new PySpark migration guide and consolidating the existing 9 separate pages into this one new page. This makes it easier to take a look across multiple version upgrades by simply scrolling on the page. Also, this is similar to the Spark Core Migration Guide page here: [https://spark.apache.org/docs/latest/core-migration-guide.html] Updating the existing main Migration Guide page to point to this new sub page and also making some minor language updates to help readers. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42446) Updating PySpark documentation to enhance usability
Allan Folting created SPARK-42446: - Summary: Updating PySpark documentation to enhance usability Key: SPARK-42446 URL: https://issues.apache.org/jira/browse/SPARK-42446 Project: Spark Issue Type: Documentation Components: PySpark Affects Versions: 3.4.0 Reporter: Allan Folting Updates to the PySpark documentation web site: * Fixing typo on the Getting Started page (Version => Versions) * Capitalizing "In/Out" in the DataFrame Quick Start notebook * Adding "(Legacy)" to the Spark Streaming heading on the Spark Streaming page * Reorganizing the User Guide page to list PySpark guides first + minor language updates -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-42418) Updating PySpark documentation to support new users better
Allan Folting created SPARK-42418: - Summary: Updating PySpark documentation to support new users better Key: SPARK-42418 URL: https://issues.apache.org/jira/browse/SPARK-42418 Project: Spark Issue Type: Documentation Components: PySpark Affects Versions: 3.4.0 Reporter: Allan Folting This is the first of a series of updates to the PySpark documentation site to better guide new users on what to use and when as well as help improve discoverability of related pages/resources. * Add "Overview" to the top navigation bar to make it easy to get back to the main page (clicking the logo is not super discoverable) * Break architecture image into separate, clickable parts for easy navigation to information for each part * Added links to related topics under each area description * Added date and version to the page -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org