Messages by Thread
-
-
[SPARK SQL] Spark Thrift server, It is not releasing memory.
Ramakrishna Chilaka
-
GCP Dataproc - adding multiple packages(kafka, mongodb) while submitting spark jobs not working
karan alang
-
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
-
Spark Push-Based Shuffle causing multiple stage failures
Han Altae-Tran
-
how to add a column for percent
wilson
-
Problem with implementing the Datasource V2 API for Salesforce
Rohit Pant
-
Final reminder: ApacheCon North America call for presentations closing soon
Rich Bowen
-
[SQL] Why does a small two-source JDBC query take ~150-200ms with all optimizations (AQE, CBO, pushdown, Kryo, unsafe) enabled? (v3.4.0-SNAPSHOT)
Gavin Ray
-
Spark 3 migration question
Jason Xu
-
What does Apache Spark do?
Turritopsis Dohrnii Teo En Ming
-
Stopping streaming after the write commit and before the read commit?
kineret M
-
A scene with unstable Spark performance
Bowen Song
-
Reverse proxy for Spark UI on Kubernetes
bo yang
-
[Spark SQL]: Configuring/Using Spark + Catalyst optimally for read-heavy transactional workloads in JDBC sources?
Gavin Ray
-
[Spark SQL]: Does Spark SQL support WAITFOR?
K. N. Ramachandran
-
Structured streaming help on releasing memory
Xavi Gervilla
-
Spark on K8s - repeating annoying exception
Shay Elbaz
-
How do I read parquet with python object
ben
-
Need help on migrating Spark on Hortonworks to Kubernetes Cluster
Chetan Khatri
-
Count() action leading to errors | Pyspark
Sid
-
groupby question
Irene Markelic
-
Something about Spark which has bothered me for a very long time, which I've never understood
Denarian Kislata
-
Kafka Spark Structure Streaming Error
nayan sharma
-
Disable/Remove datasources in Spark
Aditya
-
trouble using spark in kubernetes
Andreas Klos
-
Re: Spark error with jupyter
Bjørn Jørgensen
-
REMINDER - Travel Assistance available for ApacheCon NA New Orleans 2022
Gavin McDonald
-
Parse Execution Plan from PySpark
Pablo Alcain
-
Idea for improving performance when reading from hive-like partition folders and specifying a filter [Spark 3.2]
Martin
-
how spark handle the abnormal values
wilson
-
spark null values calculation
wilson
-
structured streaming- checkpoint metadata growing indefinetely
Wojciech Indyk
-
Reg: CVE-2020-9480
Sundar Sabapathi Meenakshi
-
[window aggregate][debug] Rows not dropping with watermark and window
Xavier Gervilla
-
Dealing with large number of small files
Sid
-
Vulnerabilities in htrace-core4-4.1.0-incubating.jar jar used in spark.
HARSH TAKKAR
-
Log4j vulnerability fix | CVE-2021-44228
Shankar, Prakash
-
Spark job failing and not giving error to do diagnosis
rajat kumar
-
Streaming write to orc problem
hsy...@gmail.com
-
Spark3.2 on K8s with proxy-user
Pralabh Kumar
-
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.5.1-incubating
Fu Chen
-
Why is spark running multiple stages with the same code line?
Joe
-
[Spark Core]: Unexpectedly exiting executor while gracefully decommissioning
Yeachan Park
-
When should we cache / persist ? After or Before Actions?
Sid
-
Re: RDD memory use question
Sean Owen
-
Grouping and counting occurences of specific column rows
marc nicole
-
How is union() implemented? Need to implement column bind
Andrew Davidson
-
[Spark Streaming] [Debug] Memory error when using NER model in Python
Xavier Gervilla
-
[Spark Web UI] Integrating Keycloak SSO
Solomon, Brad
-
Please Review My Code
marc nicole
-
Custom metrics in py-spark 3
Harut Martirosyan
-
Monitoring with elastic search in spark job
Xinyu Luan
-
Spark sql slowness in Spark 3.0.1
Anil Dasari
-
Problems with DataFrameReader in Structured Streaming
Artemis User
-
[Spark Streaming]: Why planInputPartitions is called multiple times for each micro-batch in Spark 3?
Hussain, Saghir
-
Streaming partition-by data locality for state lookupon executor
Sandip Khanzode
-
How to overwrite PySpark DataFrame schema without data scan?
Rafał Wojdyła
-
cannot access class sun.nio.ch.DirectBuffer
Arunachalam Sibisakkaravarthi
-
Question about bucketing and custom partitioners
David Diebold
-
Re: A simple comparison for three SQL engines
Wes Peng
-
binaryFile write
Philipp Kraus
-
Grabbing the current MemoryManager in a plugin
Andrew Melo
-
Spark Write BinaryType Column as continues file to S3
Philipp Kraus