Messages by Thread
-
-
Spark Group How to Ask
Zehra Günindi
-
DatasourceV2 with Custom JDBC Source
Arsh Bhardwaj
-
Sources/V2 DatasourceV2 in Spark 3.*
Bigg Ben
-
Understanding about joins in spark
Sid
-
[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022
Gavin McDonald
-
Glue is serverless? how?
Sid
-
Follow up on Jira Issue 39549
Chenyang Zhang
-
Need help with the configuration for AWS glue jobs
Sid
-
[Java 17] --add-exports required?
Greg Kopff
-
StructuredStreaming - read from Kafka, writing data into Mongo every 10 minutes
karan alang
-
repartition(n) should be deprecated/alerted
Igor Berman
-
[Spark Dataframe] How to load compressed file? (lz4, snappy)
HelloWorld
-
Will it lead to OOM error?
Sid
-
Spark Doubts
Sid
-
spark-submit on kubernetes
Michaela Bogiages
-
Spark Summit Europe
Gowran, Declan
-
How to guarantee dataset is split over unique partitions (partitioned by a column value)
DESCOTTE Loic - externe
-
How reading works?
Sid
-
input file size
mbreuer
-
how to properly filter a dataset by dates ?
marc nicole
-
How to update TaskMetrics from Python?
Shay Elbaz
-
Spark Structured streaming(batch mode) - running dependent jobs concurrently
karan alang
-
How to recognize and get the min of a date/string column in Java?
marc nicole
-
Stickers and Swag
Xiao Li
-
Redesign approach for hitting the APIs using PySpark
Sid
-
[no subject]
Rodrigo
-
Spark streaming / confluent Kafka- messages are empty
KhajaAsmath Mohammed
-
API Problem
Sid
-
Retrieve the count of spark nodes
Poorna Murali
-
to find Difference of locations in Spark Dataframe rows
Chetan Khatri
-
How the data is distributed
Sid
-
Structured streaming with protobuf proto3 schema registry
Kiran Biswal
-
partitionBy creating lot of small files
Nikhil Goyal
-
How to convert a Dataset<Row> to a Dataset<String>?
marc nicole
-
PartitionBy and SortWithinPartitions
Nikhil Goyal
-
approx_count_distinct in spark always return 1
marc nicole
-
Does adaptive auto broadcast respect spark.sql.autoBroadcastJoinThreshold
Henry Quan
-
What's the expected Spark 3.1.4 release date ?
Sandeep Vinayak
-
Kotlin API for Apache Spark feedback
finkel
-
Unable to format timestamp values in pyspark
Sid
-
Unable to convert double values
Sid
-
k-anonymity with Spark in Java
marc nicole
-
Issues getting Apache Spark
Martin, Michael
-
java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive
Prasanth M Sasidharan
-
Complexity with the data
Sid
-
[SPARK SQL] Spark Thrift server, It is not releasing memory.
Ramakrishna Chilaka
-
GCP Dataproc - adding multiple packages(kafka, mongodb) while submitting spark jobs not working
karan alang
-
Re: Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes
Ranadip Chatterjee
-
Spark Push-Based Shuffle causing multiple stage failures
Han Altae-Tran
-
how to add a column for percent
wilson
-
Problem with implementing the Datasource V2 API for Salesforce
Rohit Pant
-
Final reminder: ApacheCon North America call for presentations closing soon
Rich Bowen
-
[SQL] Why does a small two-source JDBC query take ~150-200ms with all optimizations (AQE, CBO, pushdown, Kryo, unsafe) enabled? (v3.4.0-SNAPSHOT)
Gavin Ray
-
Spark 3 migration question
Jason Xu
-
What does Apache Spark do?
Turritopsis Dohrnii Teo En Ming
-
Stopping streaming after the write commit and before the read commit?
kineret M
-
A scene with unstable Spark performance
Bowen Song