user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Issue with Spark Session Initialization in Kubernetes Deployment
Atul Patil
Re: Issue with Spark Session Initialization in Kubernetes Deployment
Mich Talebzadeh
Select Columns from Dataframe in Java
PRASHANT L
Re: Select Columns from Dataframe in Java
Grisha Weintraub
Re: Select Columns from Dataframe in Java
PRASHANT L
Re: Select Columns from Dataframe in Java
Grisha Weintraub
Fwd: the life cycle shuffle Dependency
yang chen
Re: the life cycle shuffle Dependency
murat migdisoglu
Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
Re: Pyspark UDF as a data source for streaming
Hyukjin Kwon
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
RE: Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
Re: Pyspark UDF as a data source for streaming
Mich Talebzadeh
Re: Validate spark sql
Nicholas Chammas
Re: Validate spark sql
Mich Talebzadeh
Re: Validate spark sql
ram manickam
回复:Validate spark sql
tianlangstudio
Re: Validate spark sql
Mich Talebzadeh
Re: Validate spark sql
Bjørn Jørgensen
Re: Validate spark sql
Gourav Sengupta
Re: Validate spark sql
Bjørn Jørgensen
India Scala & Big Data Job Referral
sri hari kali charan Tummala
About shuffle partition size
Nebi Aydin
[ANNOUNCE] Apache Spark 3.3.4 released
Dongjoon Hyun
Architecture of Spark Connect
Nikhil Goyal
Re: Architecture of Spark Connect
Nikhil Goyal
Re: Architecture of Spark Connect
Kezhi Xiong
Re: Architecture of Spark Connect
Hyukjin Kwon
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Patil, Atul
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Atul Patil
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Koert Kuipers
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Mich Talebzadeh
Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Mich Talebzadeh
Cluster-mode job compute-time/cost metrics
Jack Wells
Re: Cluster-mode job compute-time/cost metrics
Jörn Franke
Re: Cluster-mode job compute-time/cost metrics
murat migdisoglu
Spark 3.1.3 with Hive dynamic partitions fails while driver moves the staged files
Shay Elbaz
Spark on Java 17
Faiz Halde
RE: Spark on Java 17
Luca Canali
Re: Spark on Java 17
Faiz Halde
Re: Spark on Java 17
Jörn Franke
Re: Spark on Java 17
Jörn Franke
SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
Re: SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
Re: SSH Tunneling issue with Apache Spark
Nicholas Chammas
Re: SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
ordering of rows in dataframe
Som Lima
Re: ordering of rows in dataframe
Enrico Minack
ML advice
Zahid Rahman
Do we have any mechanism to control requests per second for a Kafka connect sink?
Yeikel Santana
Re: Do we have any mechanism to control requests per second for a Kafka connect sink?
Yeikel Santana
Spark-Connect: Param `--packages` does not take effect for executors.
Xiaolong Wang
Re: Spark-Connect: Param `--packages` does not take effect for executors.
Aironman DirtDiver
Re: Spark-Connect: Param `--packages` does not take effect for executors.
Holden Karau
[PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Михаил Кулаков
Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Enrico Minack
Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Enrico Minack
Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Михаил Кулаков
ML using Spark Connect
Faiz Halde
[FYI] SPARK-45981: Improve Python language test coverage
Dongjoon Hyun
Re: [FYI] SPARK-45981: Improve Python language test coverage
Hyukjin Kwon
[Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka?
Saurabh Agrawal (180813)
Re: [Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka?
Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.4.2 released
Dongjoon Hyun
Re:[ANNOUNCE] Apache Spark 3.4.2 released
beliefer
[sql] how to connect query stage to Spark job/stages?
Chenghao Lyu
Tuning Best Practices
Bryant Wright
Re: Tuning Best Practices
Jack Goodson
Re: Tuning Best Practices
Bryant Wright
Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Holden Karau
Re: Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Pasha Finkelshtein
Re: Classpath isolation per SparkSession without Spark Connect
Faiz Halde
Re: Classpath isolation per SparkSession without Spark Connect
Pasha Finkelshtein
Re: Spark structured streaming tab is missing from spark web UI
Jungtaek Lim
[Spark-sql 3.2.4] Wrong Statistic INFO From 'ANALYZE TABLE' Command
Nick Luo
Query fails on CASE statement depending on order of summed columns
Evgenii Ignatev
How exactly does dropDuplicatesWithinWatermark work?
Perfect Stranger
Re: How exactly does dropDuplicatesWithinWatermark work?
Jungtaek Lim
Setting fs.s3a.aws.credentials.provider through a connect server.
Leandro Martelli
Spark-submit without access to HDFS
Eugene Miretsky
Re: Spark-submit without access to HDFS
eab...@163.com
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: Re: [EXTERNAL] Re: Spark-submit without access to HDFS
eab...@163.com
Re: Spark-submit without access to HDFS
Jörn Franke
Re: Spark-submit without access to HDFS
Mich Talebzadeh
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Mich Talebzadeh
Re: [EXTERNAL] Re: [EXTERNAL] Re: Spark-submit without access to HDFS
Eugene Miretsky
[Spark Structured Streaming] Two sink from Single stream
Subash Prabanantham
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
RE: The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Stevens, Clay
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
Why create/drop/alter/rename partition does not post listener event in ExternalCatalogWithListener?
李响
Pass xmx values to SparkLauncher launched Java process
Deepthi Sathia Raj
How grouping rows without shuffle
Yoel Benharrous
help needed with SPARK-45598 and SPARK-45769
Maksym M
Storage Partition Joins only works for buckets?
Arwin Tio
org.apache.ranger.authorization.hive.authorizer.RangerHiveAuthorizerFactory ClassNotFoundException
Yi Zheng
[ANNOUNCE] Apache Kyuubi released 1.8.0
Cheng Pan
Spark master shuts down when one of zookeeper dies
Kaustubh Ghode
Re: Spark master shuts down when one of zookeeper dies
Mich Talebzadeh
How to configure authentication from a pySpark client to a Spark Connect server ?
Xiaolong Wang
[Spark SQL] [Bug] Adding `checkpoint()` causes "column [...] cannot be resolved" error
Robin Zimmerman
Parser error when running PySpark on Windows connecting to GCS
Richard Smith
Re: Parser error when running PySpark on Windows connecting to GCS
Mich Talebzadeh
Data analysis issues
Jauru Lin
Re: Data analysis issues
Mich Talebzadeh
Spark / Scala conflict
Harry Jamison
Re: Spark / Scala conflict
Aironman DirtDiver
Re: Spark / Scala conflict
Harry Jamison
Fixed byte array issue
KhajaAsmath Mohammed
jackson-databind version mismatch
moshik.vitas
Re: jackson-databind version mismatch
eab...@163.com
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: jackson-databind version mismatch
Bjørn Jørgensen
Re: Re: jackson-databind version mismatch
eab...@163.com
RE: jackson-databind version mismatch
moshik.vitas
Elasticity and scalability for Spark in Kubernetes
Mich Talebzadeh
[Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Jungtaek Lim
Re: [Structured Streaming] Joins after aggregation don't work in streaming
Andrzej Zera
spark schema conflict behavior records being silently dropped
Carlos Aguni
submitting tasks failed in Spark standalone mode due to missing failureaccess jar file
eab...@163.com
Contribution Recommendations
Phil Dakin
Maximum executors in EC2 Machine
KhajaAsmath Mohammed
Re: Maximum executors in EC2 Machine
Riccardo Ferrari
automatically/dinamically renew aws temporary token
Carlos Aguni
Re: automatically/dinamically renew aws temporary token
Jörn Franke
Re: automatically/dinamically renew aws temporary token
Pol Santamaria
Re: automatically/dinamically renew aws temporary token
Carlos Aguni
Spark join produce duplicate rows in resultset
Meena Rajani
Re: Spark join produce duplicate rows in resultset
Patrick Tucci
Re: Spark join produce duplicate rows in resultset
Sadha Chilukoori
Re: Spark join produce duplicate rows in resultset
Bjørn Jørgensen
Re: Spark join produce duplicate rows in resultset
Meena Rajani
Error when trying to get the data from Hive Materialized View
Siva Sankar Reddy
spark.stop() cannot stop spark connect session
eab...@163.com
[Resolved] Re: spark.stop() cannot stop spark connect session
eab...@163.com
"Premature end of Content-Length" Error
Sandhya Bala
hive: spark as execution engine. class not found problem
Amirhossein Kabiri
Re: hive: spark as execution engine. class not found problem
Vijay Shankar
[ANNOUNCE] Apache Celeborn(incubating) 0.3.1 available
Cheng Pan
[ SPARK SQL ]: PPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column
Suyash Ajmera
Can not complete the read csv task
Kelum Perera
Fw: Can not complete the read csv task
Kelum Perera
Fwd: Fw: Can not complete the read csv task
KP Youtuber
Re: Can not complete the read csv task
Khalid Mammadov
Autoscaling in Spark
Kiran Biswal
Re: Autoscaling in Spark
Mich Talebzadeh
Log file location in Spark on K8s
Agrawal, Sanket
Re: Log file location in Spark on K8s
Prashant Sharma
Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
ashok34...@yahoo.com.INVALID
Re: Clarification with Spark Structured Streaming
Mich Talebzadeh
Re: Clarification with Spark Structured Streaming
Danilo Sousa
Spark Compatibility with Spring Boot 3.x
Ahmed Albalawi
Re: Spark Compatibility with Spring Boot 3.x
Sean Owen
Re: Spark Compatibility with Spring Boot 3.x
Angshuman Bhattacharya
RE: Re: Spark Compatibility with Spring Boot 3.x
Guru Panda
Connection pool shut down in Spark Iceberg Streaming Connector
Agrawal, Sanket
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Prashant Sharma
Re: Connection pool shut down in Spark Iceberg Streaming Connector
Igor Calabria
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Raghavendra Ganesh
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Perez
[PySpark Structured Streaming] How to tune .repartition(N) ?
Shao Yang Hong
Re: [PySpark Structured Streaming] How to tune .repartition(N) ?
Mich Talebzadeh
[Spark Core]: Recomputation cost of a job due to executor failures
Faiz Halde
Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Karthick Nk
Re: Updating delta file column data
Mich Talebzadeh
Re: Updating delta file column data
Mich Talebzadeh
using facebook Prophet + pyspark for forecasting - Dataframe has less than 2 non-NaN rows
karan alang
Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jayabindu Singh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Mich Talebzadeh
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jon Rodríguez Aranguren
Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration
Jörn Franke
Thread dump only shows 10 shuffle clients
Nebi Aydin
Files io threads vs shuffle io threads
Nebi Aydin
Inquiry about Processing Speed
Haseeb Khalid
Re: Inquiry about Processing Speed
Deepak Goel
Re: Inquiry about Processing Speed
Jack Goodson
Reading Glue Catalog Views through Spark.
Agrawal, Sanket
Earlier messages
Later messages