user

Messages by Thread

Pyspark DataFrame.drop wrong type hints Oliver Beagley
- Pyspark DataFrame.drop wrong type hints Oliver Beagley
Help in understanding Exchange in Spark UI Dhruv Singla
- Re: Help in understanding Exchange in Spark UI Mich Talebzadeh
Spark Decommission Rajesh Mahindra
- Re: Spark Decommission Khaldi, Ahmed
- Re: Spark Decommission Rajesh Mahindra
[K8S] Divergense in dockerfiles between official repositories. Andrei L
Update mode in spark structured streaming Om Prakash
- Re: Update mode in spark structured streaming Mich Talebzadeh
Unable to load MongoDB atlas data via PySpark because of BsonString error Perez
- Re: Unable to load MongoDB atlas data via PySpark because of BsonString error Perez
OOM issue in Spark Driver Karthick Nk
- Re: OOM issue in Spark Driver Andrzej Zera
- Re: Re: OOM issue in Spark Driver Mich Talebzadeh
7368396 - Apache Spark 3.5.1 (Support) SANTOS SOUZA, ALEX
- Re: 7368396 - Apache Spark 3.5.1 (Support) Sadha Chilukoori
Kubernetes cluster: change log4j configuration using uploaded `--files` Jennifer Wirth
- Re: Kubernetes cluster: change log4j configuration using uploaded `--files` Mich Talebzadeh
[SPARK-48423] Unable to save ML Pipeline to azure blob storage Chhavi Bansal
- Re: [SPARK-48423] Unable to save ML Pipeline to azure blob storage Chhavi Bansal
[SPARK-48463] Mllib Feature transformer failing with nested dataset (Dot notation) Chhavi Bansal
- Re: [SPARK-48463] Mllib Feature transformer failing with nested dataset (Dot notation) Someshwar Kale
- Re: [SPARK-48463] Mllib Feature transformer failing with nested dataset (Dot notation) Chhavi Bansal
- Re: [SPARK-48463] Mllib Feature transformer failing with nested dataset (Dot notation) Someshwar Kale
Do we need partitioning while loading data from JDBC sources? Perez
- Re: Do we need partitioning while loading data from JDBC sources? Mich Talebzadeh
- Re: Do we need partitioning while loading data from JDBC sources? Perez
- Re: Do we need partitioning while loading data from JDBC sources? Mich Talebzadeh
- Re: Do we need partitioning while loading data from JDBC sources? Perez
- Re: Do we need partitioning while loading data from JDBC sources? Perez
- Re: Do we need partitioning while loading data from JDBC sources? Gourav Sengupta
Inquiry Regarding Security Compliance of Apache Spark Docker Image Tonmoy Sagar
Classification request VARGA, Sara
- Re: Classification request Artemis User
- Re: Classification request Dirk-Willem van Gulik
[ANNOUNCE] Announcing Apache Spark 4.0.0-preview1 Wenchen Fan
[ANNOUNCE] Apache Kyuubi released 1.9.1 Cheng Pan
Terabytes data processing via Glue Perez
- Re: Terabytes data processing via Glue Perez
- Re: Terabytes data processing via Glue Russell Jurney
- Re: Terabytes data processing via Glue Perez
[apache-spark][spark-dataframe] DataFrameWriter.partitionBy does not guarantee previous sort result leeyc0
[Spark on k8s] A issue of k8s resource creation order Tao Yang
Tox and Pyspark Perez
Spark Protobuf Deserialization Satyam Raj
- Re: Spark Protobuf Deserialization Sandish Kumar HN
[Spark SQL]: Does Spark support processing records with timestamp NULL in stateful streaming? Juan Casse
- Re: [Spark SQL]: Does Spark support processing records with timestamp NULL in stateful streaming? Mich Talebzadeh
OOM concern Perez
- Re: OOM concern Meena Rajani
- Re: OOM concern Russell Jurney
- Re: OOM concern Perez
- Re: OOM concern Mich Talebzadeh
- Re: OOM concern Perez
- Re: OOM concern Mich Talebzadeh
- Re: OOM concern Russell Jurney
- Re: OOM concern Perez
Subject: [Spark SQL] [Debug] Spark Memory Issue with DataFrame Processing Gaurav Madan
- Re: Subject: [Spark SQL] [Debug] Spark Memory Issue with DataFrame Processing Mich Talebzadeh
- Re: Subject: [Spark SQL] [Debug] Spark Memory Issue with DataFrame Processing Shay Elbaz
Can Spark Catalog Perform Multimodal Database Query Analysis ????
- Re: Can Spark Catalog Perform Multimodal Database Query Analysis Mich Talebzadeh
BUG :: UI Spark Prem Sahoo
- Re: BUG :: UI Spark Prem Sahoo
- Re: BUG :: UI Spark Prem Sahoo
- Re: BUG :: UI Spark Sathi Chowdhury
- Re: BUG :: UI Spark Mich Talebzadeh
- Re: BUG :: UI Spark Mich Talebzadeh
- Re: BUG :: UI Spark Mich Talebzadeh
[s3a] Spark is not reading s3 object content Amin Mosayyebzadeh
- Re: [s3a] Spark is not reading s3 object content Mich Talebzadeh
- Re: [s3a] Spark is not reading s3 object content Amin Mosayyebzadeh
- Re: [s3a] Spark is not reading s3 object content Mich Talebzadeh
- Re: [s3a] Spark is not reading s3 object content Amin Mosayyebzadeh
- Re: [s3a] Spark is not reading s3 object content Mich Talebzadeh
- Re: [s3a] Spark is not reading s3 object content Amin Mosayyebzadeh
- Re: [s3a] Spark is not reading s3 object content Mich Talebzadeh
- Re: [s3a] Spark is not reading s3 object content Amin Mosayyebzadeh
Remote File change detection in S3 when spark queries are running and parquet files in S3 changes Raghvendra Yadav
[ANNOUNCE] Apache Celeborn 0.4.1 available Nicholas Jiang
Dstream HasOffsetRanges equivalent in Structured streaming Anil Dasari
- Re: Dstream HasOffsetRanges equivalent in Structured streaming [email protected]
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Mich Talebzadeh
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Tathagata Das
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Anil Dasari
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Tathagata Das
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Anil Dasari
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Anil Dasari
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Anil Dasari
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Mich Talebzadeh
- Re: Dstream HasOffsetRanges equivalent in Structured streaming Mich Talebzadeh
Re: EXT: Dual Write to HDFS and MinIO in faster way Prem Sahoo
- Re: Re: EXT: Dual Write to HDFS and MinIO in faster way [email protected]
- Re: Re: EXT: Dual Write to HDFS and MinIO in faster way Prem Sahoo
- Re: Re: EXT: Dual Write to HDFS and MinIO in faster way Gera Shegalov
- Re: Re: EXT: Dual Write to HDFS and MinIO in faster way Subhasis Mukherjee
A handy tool called spark-column-analyser Mich Talebzadeh
- Re: A handy tool called spark-column-analyser Mich Talebzadeh
- Re: A handy tool called spark-column-analyser [email protected]
Request for Assistance: Adding User Authentication to Apache Spark Application NIKHIL RAJ SHRIVASTAVA
How to provide a Zstd "training mode" dictionary object Saha, Daniel
Query Regarding UDF Support in Spark Connect with Kubernetes as Cluster Manager Nagatomi Yasukazu
Display a warning in EMR welcome screen Abhishek Basu
Spark 3.5.x on Java 21? Stephen Coy
Spark not creating staging dir for insertInto partitioned table Sanskar Modi
[Spark Streaming]: Save the records that are dropped by watermarking in spark structured streaming Nandha Kumar
- Re: [Spark Streaming]: Save the records that are dropped by watermarking in spark structured streaming Mich Talebzadeh
Spark Materialized Views: Improve Query Performance and Data Management Mich Talebzadeh
Help needed optimize spark history server performance Vikas Tharyani
Issue with Materialized Views in Spark SQL Mich Talebzadeh
- Re: Issue with Materialized Views in Spark SQL Walaa Eldin Moustafa
- Re: Issue with Materialized Views in Spark SQL Jungtaek Lim
- Re: Issue with Materialized Views in Spark SQL Mich Talebzadeh
- Re: Issue with Materialized Views in Spark SQL Mich Talebzadeh
********Spark streaming issue to Elastic data********** Karthick Nk
- Re: ********Spark streaming issue to Elastic data********** Mich Talebzadeh
- Re: ********Spark streaming issue to Elastic data********** Karthick Nk
- Re: ********Spark streaming issue to Elastic data********** Mich Talebzadeh
Traceback is missing content in pyspark when invoked with UDF Indivar Mishra
spark.sql.shuffle.partitions=auto [email protected]
- Re: spark.sql.shuffle.partitions=auto Mich Talebzadeh
Re: Python for the kids and now PySpark Farshid Ashouri
- Re: Python for the kids and now PySpark Meena Rajani
[Release Question]: Estimate on 3.5.2 release? Paul Gerver
[SparkListener] Accessing classes loaded via the '--packages' option Damien Hawes
DataFrameReader: timestampFormat default value keen
[spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
- Re: [spark-graphframes]: Generating incorrect edges Mich Talebzadeh
- Re: [spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
- Re: [spark-graphframes]: Generating incorrect edges Mich Talebzadeh
- Re: [spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
- Re: [spark-graphframes]: Generating incorrect edges Stephen Coy
- Re: [spark-graphframes]: Generating incorrect edges Mich Talebzadeh
- Re: [spark-graphframes]: Generating incorrect edges Nijland, J.G.W. (Jelle, Student M-CS)
How to add MaxDOP option in spark mssql JDBC Elite
- RE: How to add MaxDOP option in spark mssql JDBC Appel, Kevin
- Re:RE: How to add MaxDOP option in spark mssql JDBC Elite
How to use Structured Streaming in Spark SQL ????
How to access the internal hidden columns of table by spark jdbc casel.chen
Accounting the impact of failures in spark jobs Faiz Halde
StreamingQueryListener integration with Spark native metric sink (JmxSink) Mason Chen
[ANNOUNCE] Apache Spark 3.4.3 released Dongjoon Hyun
[Spark SQL][How-To] Remove builtin function support from Spark Matthew McMillian
- [Spark SQL][How-To] Remove builtin function support from Spark Matthew McMillian
should OutputCommitCoordinator fail stages for authorized committer failures when using s3a optimized committers? Dylan McClelland
[Spark SQL] xxhash64 default seed of 42 confusion Igor Calabria
auto create event log directory if not exist [email protected]
Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Mich Talebzadeh
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Mich Talebzadeh
- Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly. Kidong Lee
Spark column headings, camelCase or snake case? Mich Talebzadeh
[Spark SQL]: Source code for PartitionedFile Ashley McManamon
- Re: [Spark SQL]: Source code for PartitionedFile Mich Talebzadeh
- Re: [Spark SQL]: Source code for PartitionedFile Ashley McManamon
How to get db related metrics when use spark jdbc to read db table? casel.chen
- Re: How to get db related metrics when use spark jdbc to read db table? Mich Talebzadeh
- Re: How to get db related metrics when use spark jdbc to read db table? Femi Anthony
Spark UDAF in examples fail with not serializable error Owen Bell
Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError? Baran, Mert
- Re: Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError? Mich Talebzadeh
Example UDAF fails with "not serializable" exception Owen Bell
External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Vakaris Baškirov
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s roryqi
- Re: External Spark shuffle service for k8s Vakaris Baškirov
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Arun Ravi
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Bjørn Jørgensen
- Re: External Spark shuffle service for k8s Cheng Pan
- Re: External Spark shuffle service for k8s Mich Talebzadeh
- Re: External Spark shuffle service for k8s Enrico Minack
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning Tahj Anderson
- Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning Tahj Anderson
Participate in the ASF 25th Anniversary Campaign Brian Proffitt
[Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Aaron Grubb
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
- Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix Oxlade, Dan
[Spark SQL] How can I use .sql() in conjunction with watermarks? Chloe He
- Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
- RE: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Chloe He
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? 刘唯
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? 刘唯
- Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks? Mich Talebzadeh
Apache Spark integration with Spring Boot 3.0.0+ Szymon Kasperkiewicz
Community Over Code NA 2024 Travel Assistance Applications now open! Gavin McDonald
[DISCUSS] MySQL version support policy Cheng Pan
- Re: [DISCUSS] MySQL version support policy Dongjoon Hyun
Is one Spark partition mapped to one and only Spark Task ? Sreyan Chakravarty
Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering Mich Talebzadeh