Messages by Date
-
2024/03/05
Re: It seems --py-files only takes the first two arguments. Can someone please confirm?
Mich Talebzadeh
-
2024/03/05
It seems --py-files only takes the first two arguments. Can someone please confirm?
Pedro, Chuck
-
2024/03/05
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/03/05
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
-
2024/03/05
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/03/05
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
-
2024/03/05
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/03/05
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
-
2024/03/04
Re: [ANNOUNCE] Apache Spark 3.5.1 released
yangjie01
-
2024/03/04
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/03/04
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Hyukjin Kwon
-
2024/03/04
Working with a text file that is both compressed by bz2 followed by zip in PySpark
Mich Talebzadeh
-
2024/03/03
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Peter Toth
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
John Zhuge
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Prem Sahoo
-
2024/02/29
Re: pyspark dataframe join with two different data type
Mich Talebzadeh
-
2024/02/29
pyspark dataframe join with two different data type
Karthick Nk
-
2024/02/29
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Xinrong Meng
-
2024/02/28
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
-
2024/02/28
Re: [External] Re: Issue of spark with antlr version
Chawla, Parul
-
2024/02/28
Re: [External] Re: Issue of spark with antlr version
Bjørn Jørgensen
-
2024/02/28
Re:[ANNOUNCE] Apache Spark 3.5.1 released
beliefer
-
2024/02/28
[ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
-
2024/02/27
Re: [Spark Core] Potential bug in JavaRDD#countByValue
Mich Talebzadeh
-
2024/02/27
[Spark Core] Potential bug in JavaRDD#countByValue
Stuart Fehr
-
2024/02/27
Re: Issue of spark with antlr version
Bjørn Jørgensen
-
2024/02/27
Re: Issue of spark with antlr version
Mich Talebzadeh
-
2024/02/27
RE: Issue of spark with antlr version
Sahni, Ashima
-
2024/02/27
Unsubscribe
benson fang
-
2024/02/27
Re: Bugs with joins and SQL in Structured Streaming
Andrzej Zera
-
2024/02/26
Re: Bugs with joins and SQL in Structured Streaming
Mich Talebzadeh
-
2024/02/26
Bugs with joins and SQL in Structured Streaming
Andrzej Zera
-
2024/02/25
Re: Bintray replacement for spark-packages.org
Richard Eggert
-
2024/02/25
Issue of spark with antlr version
Chawla, Parul
-
2024/02/24
unsubscribe
Ameet Kini
-
2024/02/24
Re: job uuid not unique
Xin Zhang
-
2024/02/24
Re: AQE coalesce 60G shuffle data into a single partition
Enrico Minack
-
2024/02/23
Re: [Beginner Debug]: Executor OutOfMemoryError
Mich Talebzadeh
-
2024/02/22
[Beginner Debug]: Executor OutOfMemoryError
Shawn Ligocki
-
2024/02/21
Re: unsubscribe
Xin Zhang
-
2024/02/21
Re: Spark 4.0 Query Analyzer Bug Report
Mich Talebzadeh
-
2024/02/21
Kafka-based Spark Streaming and Vertex AI for Sentiment Analysis
Mich Talebzadeh
-
2024/02/20
[ANNOUNCE] Apache Kyuubi 1.8.1 is available
Cheng Pan
-
2024/02/20
Re: Spark 3.3 Query Analyzer Bug Report
Sharma, Anup
-
2024/02/20
Re: Spark 4.0 Query Analyzer Bug Report
Holden Karau
-
2024/02/20
Spark 4.0 Query Analyzer Bug Report
Sharma, Anup
-
2024/02/20
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Manoj Kumar
-
2024/02/20
Re: unsubscribe
kritika jain
-
2024/02/20
unsubscribe
Крюков Виталий Семенович
-
2024/02/20
Community Over Code Asia 2024 Travel Assistance Applications now open!
Gavin McDonald
-
2024/02/19
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
-
2024/02/19
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Cheng Pan
-
2024/02/19
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Sri Potluri
-
2024/02/19
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
-
2024/02/19
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
2024/02/19
Re: [Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Mich Talebzadeh
-
2024/02/19
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
2024/02/19
[Spark on Kubernetes]: Seeking Guidance on Handling Persistent Executor Failures
Sri Potluri
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Mich Talebzadeh
-
2024/02/19
Re: Regarding Spark on Kubernetes(EKS)
Richard Smith
-
2024/02/19
Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
-
2024/02/19
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Mich Talebzadeh
-
2024/02/19
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Saha, Daniel
-
2024/02/18
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Mich Talebzadeh
-
2024/02/17
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Jörn Franke
-
2024/02/17
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Adam Binford
-
2024/02/16
Re: job uuid not unique
Mich Talebzadeh
-
2024/02/16
Effectively append the dataset to avro directory
Rushikesh Kavar
-
2024/02/16
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
2024/02/15
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
2024/02/14
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
2024/02/14
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
praveen sinha
-
2024/02/14
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
2024/02/13
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
John Zhuge
-
2024/02/13
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Yufei Gu
-
2024/02/13
Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001-
Mich Talebzadeh
-
2024/02/13
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Holden Karau
-
2024/02/13
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
2024/02/13
Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001-
Bjørn Jørgensen
-
2024/02/13
Re: Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001-
Abhishek Singla
-
2024/02/12
Re: Null pointer exception while replying WAL
Mich Talebzadeh
-
2024/02/12
Re: Null pointer exception while replying WAL
nayan sharma
-
2024/02/11
Re: Null pointer exception while replying WAL
Mich Talebzadeh
-
2024/02/09
Null pointer exception while replying WAL
nayan sharma
-
2024/02/09
Re: Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration
Mich Talebzadeh
-
2024/02/09
Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration
Mich Talebzadeh
-
2024/02/08
performance of union vs insert into
Manish Mehra
-
2024/02/06
[ANNOUNCE] Apache Celeborn(incubating) 0.4.0 available
Fu Chen
-
2024/02/03
Community over Code EU 2024 Travel Assistance Applications now open!
Gavin McDonald
-
2024/02/03
[no subject]
Gavin McDonald
-
2024/01/31
Re: Issue in Creating Temp_view in databricks and using spark.sql().
Mich Talebzadeh
-
2024/01/31
Re: Issue in Creating Temp_view in databricks and using spark.sql().
Mich Talebzadeh
-
2024/01/31
deploy spark as cluster
ali sharifi
-
2024/01/31
Create Custom Logs
PRASHANT L
-
2024/01/31
Re: Issue in Creating Temp_view in databricks and using spark.sql().
Jungtaek Lim
-
2024/01/31
randomsplit has issue?
second_co...@yahoo.com.INVALID
-
2024/01/30
Issue in Creating Temp_view in databricks and using spark.sql().
Karthick Nk
-
2024/01/29
[Spark SQL]: Crash when attempting to select PostgreSQL bpchar without length specifier in Spark 3.5.0
Lily Hahn
-
2024/01/29
Re: startTimestamp doesn't work when using rate-micro-batch format
Mich Talebzadeh
-
2024/01/29
Re: startTimestamp doesn't work when using rate-micro-batch format
Perfect Stranger
-
2024/01/28
Re: startTimestamp doesn't work when using rate-micro-batch format
Mich Talebzadeh
-
2024/01/28
startTimestamp doesn't work when using rate-micro-batch format
Perfect Stranger
-
2024/01/26
subscribe
Sahib Aulakh
-
2024/01/26
subscribe
Sahib Aulakh
-
2024/01/24
Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations
Andrzej Zera
-
2024/01/23
Some optimization questions about our beloved engine Spark
Aissam Chia
-
2024/01/17
Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001-
Abhishek Singla
-
2024/01/17
Re: unsubscribe
Крюков Виталий Семенович
-
2024/01/16
unsubscribe
Leandro Martelli
-
2024/01/13
Unsubscribe
Andrew Redd
-
2024/01/12
Re: [spark.local.dir] comma separated list does not work
Andrew Petersen
-
2024/01/12
Re: [spark.local.dir] comma separated list does not work
Andrew Petersen
-
2024/01/12
Re: [spark.local.dir] comma separated list does not work
Koert Kuipers
-
2024/01/12
[spark.local.dir] comma separated list does not work
Andrew Petersen
-
2024/01/12
[GraphFrames Spark Package]: Why is there not a distribution for Spark 3.3?
Boileau, Brad
-
2024/01/11
Re: [Structured Streaming] Keeping checkpointing cost under control
Mich Talebzadeh
-
2024/01/11
Re: Structured Streaming Process Each Records Individually
Mich Talebzadeh
-
2024/01/11
Best option to process single kafka stream in parallel: PySpark Vs Dask
lab22
-
2024/01/11
Re: [Structured Streaming] Keeping checkpointing cost under control
Jungtaek Lim
-
2024/01/11
Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations
Jungtaek Lim
-
2024/01/11
Re: Okio Vulnerability in Spark 3.4.1
Bjørn Jørgensen
-
2024/01/10
Re: [Structured Streaming] Avoid one microbatch delay with multiple stateful operations
Ant Kutschera
-
2024/01/10
Re: Structured Streaming Process Each Records Individually
Ant Kutschera
-
2024/01/10
Re: Structured Streaming Process Each Records Individually
Mich Talebzadeh
-
2024/01/10
Re: Structured Streaming Process Each Records Individually
Khalid Mammadov
-
2024/01/10
Structured Streaming Process Each Records Individually
PRASHANT L
-
2024/01/10
Re: [Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
2024/01/10
Re: [Structured Streaming] Keeping checkpointing cost under control
Mich Talebzadeh
-
2024/01/10
[Structured Streaming] Avoid one microbatch delay with multiple stateful operations
Andrzej Zera
-
2024/01/10
Re: [Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
2024/01/10
Re: [Structured Streaming] Keeping checkpointing cost under control
Mich Talebzadeh
-
2024/01/10
Re: [Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
2024/01/10
unsubscribe
Daniel Maangi
-
2024/01/10
[apache-spark] documentation on File Metadata _metadata struct
Jason Horner
-
2024/01/09
Unsubscribe
qi bryce
-
2024/01/09
Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics.
Mich Talebzadeh
-
2024/01/09
Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics.
ashok34...@yahoo.com.INVALID
-
2024/01/09
Unsubscribe
mahzad kalantari
-
2024/01/09
Unsubscribe
Kalhara Gurugamage
-
2024/01/08
Re: Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics.
Mich Talebzadeh
-
2024/01/08
Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics.
Mich Talebzadeh
-
2024/01/08
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
-
2024/01/07
[ANNOUNCE] Apache Celeborn(incubating) 0.3.2 available
Nicholas Jiang
-
2024/01/07
Re: [Structured Streaming] Keeping checkpointing cost under control
Mich Talebzadeh
-
2024/01/07
Re: [Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
2024/01/06
Re: [Structured Streaming] Keeping checkpointing cost under control
Mich Talebzadeh
-
2024/01/05
[Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
2024/01/05
Re: Issue with Spark Session Initialization in Kubernetes Deployment
Mich Talebzadeh
-
2024/01/04
Issue with Spark Session Initialization in Kubernetes Deployment
Atul Patil
-
2024/01/02
Unsubscribe
Atlas - Samir Souidi
-
2023/12/30
Re: Select Columns from Dataframe in Java
Grisha Weintraub
-
2023/12/30
Re: Select Columns from Dataframe in Java
PRASHANT L
-
2023/12/30
Re: Select Columns from Dataframe in Java
Grisha Weintraub
-
2023/12/29
Unsubscribe
Vinti Maheshwari
-
2023/12/29
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
-
2023/12/29
Re: the life cycle shuffle Dependency
murat migdisoglu
-
2023/12/29
Select Columns from Dataframe in Java
PRASHANT L
-
2023/12/28
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
-
2023/12/28
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
2023/12/28
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
-
2023/12/28
Re: Pyspark UDF as a data source for streaming
Hyukjin Kwon
-
2023/12/27
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
2023/12/27
Fwd: the life cycle shuffle Dependency
yang chen
-
2023/12/27
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
2023/12/27
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
2023/12/27
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
-
2023/12/27
Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
2023/12/26
Re: Validate spark sql
Gourav Sengupta
-
2023/12/26
Re: Validate spark sql
Mich Talebzadeh
-
2023/12/25
Re: Validate spark sql
Bjørn Jørgensen
-
2023/12/25
Re: Validate spark sql
Bjørn Jørgensen
-
2023/12/25
回复:Validate spark sql
tianlangstudio
-
2023/12/24
Re: Validate spark sql
ram manickam
-
2023/12/24
Re: Validate spark sql
Mich Talebzadeh
-
2023/12/24
Re: Validate spark sql
Nicholas Chammas
-
2023/12/21
Unsubscribe
yxj1141
-
2023/12/21
India Scala & Big Data Job Referral
sri hari kali charan Tummala
-
2023/12/20
About shuffle partition size
Nebi Aydin
-
2023/12/16
[ANNOUNCE] Apache Spark 3.3.4 released
Dongjoon Hyun
-
2023/12/16
Unsubscribe
Andrew Milkowski
-
2023/12/15
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Mich Talebzadeh
-
2023/12/15
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Mich Talebzadeh
-
2023/12/14
Re: Architecture of Spark Connect
Hyukjin Kwon
-
2023/12/14
Re: Architecture of Spark Connect
Kezhi Xiong
-
2023/12/14
Re: Architecture of Spark Connect
Nikhil Goyal
-
2023/12/14
Architecture of Spark Connect
Nikhil Goyal
-
2023/12/13
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Koert Kuipers
-
2023/12/13
Unsubscribe
kritika jain
-
2023/12/13
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Atul Patil
-
2023/12/13
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Patil, Atul
-
2023/12/12
Unsubscribe
Daniel Maangi
-
2023/12/12
Unsubscribe
Klaus Schaefers
-
2023/12/12
Unsubscribe
Sergey Boytsov