Messages by Thread
-
-
Regarding Spark on Kubernetes(EKS)
Jagannath Majhi
-
Re: Re-create SparkContext of SparkSession inside long-lived Spark app
Adam Binford
-
Re: job uuid not unique
Mich Talebzadeh
-
Effectively append the dataset to avro directory
Rushikesh Kavar
-
Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Holden Karau
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Yufei Gu
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
John Zhuge
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
praveen sinha
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Chao Sun
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Mich Talebzadeh
-
Re: Introducing Comet, a plugin to accelerate Spark execution via DataFusion and Arrow
Manoj Kumar
-
Null pointer exception while replying WAL
nayan sharma
-
Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration
Mich Talebzadeh
-
performance of union vs insert into
Manish Mehra
-
[ANNOUNCE] Apache Celeborn(incubating) 0.4.0 available
Fu Chen
-
Community over Code EU 2024 Travel Assistance Applications now open!
Gavin McDonald
-
[no subject]
Gavin McDonald
-
deploy spark as cluster
ali sharifi
-
Create Custom Logs
PRASHANT L
-
randomsplit has issue?
second_co...@yahoo.com.INVALID
-
Issue in Creating Temp_view in databricks and using spark.sql().
Karthick Nk
-
[Spark SQL]: Crash when attempting to select PostgreSQL bpchar without length specifier in Spark 3.5.0
Lily Hahn
-
startTimestamp doesn't work when using rate-micro-batch format
Perfect Stranger
-
Some optimization questions about our beloved engine Spark
Aissam Chia
-
Facing Error org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for s3ablock-0001-
Abhishek Singla
-
[spark.local.dir] comma separated list does not work
Andrew Petersen
-
[GraphFrames Spark Package]: Why is there not a distribution for Spark 3.3?
Boileau, Brad
-
Best option to process single kafka stream in parallel: PySpark Vs Dask
lab22
-
Structured Streaming Process Each Records Individually
PRASHANT L
-
[Structured Streaming] Avoid one microbatch delay with multiple stateful operations
Andrzej Zera
-
[apache-spark] documentation on File Metadata _metadata struct
Jason Horner
-
Spark Structured Streaming and Flask REST API for Real-Time Data Ingestion and Analytics.
Mich Talebzadeh
-
[ANNOUNCE] Apache Celeborn(incubating) 0.3.2 available
Nicholas Jiang
-
[Structured Streaming] Keeping checkpointing cost under control
Andrzej Zera
-
Issue with Spark Session Initialization in Kubernetes Deployment
Atul Patil
-
Select Columns from Dataframe in Java
PRASHANT L
-
Fwd: the life cycle shuffle Dependency
yang chen
-
Pyspark UDF as a data source for streaming
Поротиков Станислав Вячеславович
-
Re: Validate spark sql
Nicholas Chammas
-
India Scala & Big Data Job Referral
sri hari kali charan Tummala
-
About shuffle partition size
Nebi Aydin
-
[ANNOUNCE] Apache Spark 3.3.4 released
Dongjoon Hyun
-
Architecture of Spark Connect
Nikhil Goyal
-
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment)
Patil, Atul
-
Cluster-mode job compute-time/cost metrics
Jack Wells
-
Spark 3.1.3 with Hive dynamic partitions fails while driver moves the staged files
Shay Elbaz
-
Spark on Java 17
Faiz Halde
-
SSH Tunneling issue with Apache Spark
Venkatesan Muniappan
-
ordering of rows in dataframe
Som Lima
-
ML advice
Zahid Rahman
-
Do we have any mechanism to control requests per second for a Kafka connect sink?
Yeikel Santana
-
Spark-Connect: Param `--packages` does not take effect for executors.
Xiaolong Wang
-
[PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation?
Михаил Кулаков
-
ML using Spark Connect
Faiz Halde
-
[FYI] SPARK-45981: Improve Python language test coverage
Dongjoon Hyun
-
[Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka?
Saurabh Agrawal (180813)
-
[ANNOUNCE] Apache Spark 3.4.2 released
Dongjoon Hyun
-
[sql] how to connect query stage to Spark job/stages?
Chenghao Lyu
-
Tuning Best Practices
Bryant Wright
-
Classpath isolation per SparkSession without Spark Connect
Faiz Halde
-
Re: Spark structured streaming tab is missing from spark web UI
Jungtaek Lim
-
[Spark-sql 3.2.4] Wrong Statistic INFO From 'ANALYZE TABLE' Command
Nick Luo
-
Query fails on CASE statement depending on order of summed columns
Evgenii Ignatev
-
How exactly does dropDuplicatesWithinWatermark work?
Perfect Stranger
-
Setting fs.s3a.aws.credentials.provider through a connect server.
Leandro Martelli
-
Spark-submit without access to HDFS
Eugene Miretsky
-
[Spark Structured Streaming] Two sink from Single stream
Subash Prabanantham
-
The job failed when we upgraded from spark 3.3.1 to spark3.4.1
Hanyu Huang
-
Why create/drop/alter/rename partition does not post listener event in ExternalCatalogWithListener?
李响
-
Pass xmx values to SparkLauncher launched Java process
Deepthi Sathia Raj
-
How grouping rows without shuffle
Yoel Benharrous
-
help needed with SPARK-45598 and SPARK-45769
Maksym M
-
Storage Partition Joins only works for buckets?
Arwin Tio