user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Dstream HasOffsetRanges equivalent in Structured streaming
Anil Dasari
Re: Dstream HasOffsetRanges equivalent in Structured streaming
ashok34...@yahoo.com.INVALID
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Mich Talebzadeh
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Tathagata Das
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Anil Dasari
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Tathagata Das
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Anil Dasari
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Anil Dasari
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Anil Dasari
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Mich Talebzadeh
Re: Dstream HasOffsetRanges equivalent in Structured streaming
Mich Talebzadeh
Re: EXT: Dual Write to HDFS and MinIO in faster way
Prem Sahoo
Re: Re: EXT: Dual Write to HDFS and MinIO in faster way
eab...@163.com
Re: Re: EXT: Dual Write to HDFS and MinIO in faster way
Prem Sahoo
Re: Re: EXT: Dual Write to HDFS and MinIO in faster way
Gera Shegalov
Re: Re: EXT: Dual Write to HDFS and MinIO in faster way
Subhasis Mukherjee
A handy tool called spark-column-analyser
Mich Talebzadeh
Re: A handy tool called spark-column-analyser
Mich Talebzadeh
Re: A handy tool called spark-column-analyser
ashok34...@yahoo.com.INVALID
Request for Assistance: Adding User Authentication to Apache Spark Application
NIKHIL RAJ SHRIVASTAVA
How to provide a Zstd "training mode" dictionary object
Saha, Daniel
Query Regarding UDF Support in Spark Connect with Kubernetes as Cluster Manager
Nagatomi Yasukazu
Display a warning in EMR welcome screen
Abhishek Basu
Spark 3.5.x on Java 21?
Stephen Coy
Spark not creating staging dir for insertInto partitioned table
Sanskar Modi
[Spark Streaming]: Save the records that are dropped by watermarking in spark structured streaming
Nandha Kumar
Re: [Spark Streaming]: Save the records that are dropped by watermarking in spark structured streaming
Mich Talebzadeh
Spark Materialized Views: Improve Query Performance and Data Management
Mich Talebzadeh
Help needed optimize spark history server performance
Vikas Tharyani
Issue with Materialized Views in Spark SQL
Mich Talebzadeh
Re: Issue with Materialized Views in Spark SQL
Walaa Eldin Moustafa
Re: Issue with Materialized Views in Spark SQL
Jungtaek Lim
Re: Issue with Materialized Views in Spark SQL
Mich Talebzadeh
Re: Issue with Materialized Views in Spark SQL
Mich Talebzadeh
********Spark streaming issue to Elastic data**********
Karthick Nk
Re: ********Spark streaming issue to Elastic data**********
Mich Talebzadeh
Re: ********Spark streaming issue to Elastic data**********
Karthick Nk
Re: ********Spark streaming issue to Elastic data**********
Mich Talebzadeh
Traceback is missing content in pyspark when invoked with UDF
Indivar Mishra
spark.sql.shuffle.partitions=auto
second_co...@yahoo.com.INVALID
Re: spark.sql.shuffle.partitions=auto
Mich Talebzadeh
Re: Python for the kids and now PySpark
Farshid Ashouri
Re: Python for the kids and now PySpark
Meena Rajani
[Release Question]: Estimate on 3.5.2 release?
Paul Gerver
[SparkListener] Accessing classes loaded via the '--packages' option
Damien Hawes
DataFrameReader: timestampFormat default value
keen
[spark-graphframes]: Generating incorrect edges
Nijland, J.G.W. (Jelle, Student M-CS)
Re: [spark-graphframes]: Generating incorrect edges
Mich Talebzadeh
Re: [spark-graphframes]: Generating incorrect edges
Nijland, J.G.W. (Jelle, Student M-CS)
Re: [spark-graphframes]: Generating incorrect edges
Mich Talebzadeh
Re: [spark-graphframes]: Generating incorrect edges
Nijland, J.G.W. (Jelle, Student M-CS)
Re: [spark-graphframes]: Generating incorrect edges
Stephen Coy
Re: [spark-graphframes]: Generating incorrect edges
Mich Talebzadeh
Re: [spark-graphframes]: Generating incorrect edges
Nijland, J.G.W. (Jelle, Student M-CS)
How to add MaxDOP option in spark mssql JDBC
Elite
RE: How to add MaxDOP option in spark mssql JDBC
Appel, Kevin
Re:RE: How to add MaxDOP option in spark mssql JDBC
Elite
How to use Structured Streaming in Spark SQL
????
How to access the internal hidden columns of table by spark jdbc
casel.chen
Accounting the impact of failures in spark jobs
Faiz Halde
StreamingQueryListener integration with Spark native metric sink (JmxSink)
Mason Chen
[ANNOUNCE] Apache Spark 3.4.3 released
Dongjoon Hyun
[Spark SQL][How-To] Remove builtin function support from Spark
Matthew McMillian
[Spark SQL][How-To] Remove builtin function support from Spark
Matthew McMillian
should OutputCommitCoordinator fail stages for authorized committer failures when using s3a optimized committers?
Dylan McClelland
[Spark SQL] xxhash64 default seed of 42 confusion
Igor Calabria
auto create event log directory if not exist
second_co...@yahoo.com.INVALID
Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Mich Talebzadeh
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Mich Talebzadeh
Re: Spark streaming job for kafka transaction does not consume read_committed messages correctly.
Kidong Lee
Spark column headings, camelCase or snake case?
Mich Talebzadeh
[Spark SQL]: Source code for PartitionedFile
Ashley McManamon
Re: [Spark SQL]: Source code for PartitionedFile
Mich Talebzadeh
Re: [Spark SQL]: Source code for PartitionedFile
Ashley McManamon
How to get db related metrics when use spark jdbc to read db table?
casel.chen
Re: How to get db related metrics when use spark jdbc to read db table?
Mich Talebzadeh
Re: How to get db related metrics when use spark jdbc to read db table?
Femi Anthony
Spark UDAF in examples fail with not serializable error
Owen Bell
Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError?
Baran, Mert
Re: Idiomatic way to rate-limit streaming sources to avoid OutOfMemoryError?
Mich Talebzadeh
Example UDAF fails with "not serializable" exception
Owen Bell
External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Vakaris Baškirov
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
roryqi
Re: External Spark shuffle service for k8s
Vakaris Baškirov
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Arun Ravi
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Bjørn Jørgensen
Re: External Spark shuffle service for k8s
Cheng Pan
Re: External Spark shuffle service for k8s
Mich Talebzadeh
Re: External Spark shuffle service for k8s
Enrico Minack
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning
Tahj Anderson
Clarification on what "[id=#]" refers to in Physical Plan Exchange hashpartitioning
Tahj Anderson
Participate in the ASF 25th Anniversary Campaign
Brian Proffitt
[Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Aaron Grubb
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
Re: [EXTERNAL] Re: [Spark]: Spark / Iceberg / hadoop-aws compatibility matrix
Oxlade, Dan
[Spark SQL] How can I use .sql() in conjunction with watermarks?
Chloe He
Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
RE: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Chloe He
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
刘唯
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
刘唯
Re: Re: [Spark SQL] How can I use .sql() in conjunction with watermarks?
Mich Talebzadeh
Apache Spark integration with Spring Boot 3.0.0+
Szymon Kasperkiewicz
Community Over Code NA 2024 Travel Assistance Applications now open!
Gavin McDonald
[DISCUSS] MySQL version support policy
Cheng Pan
Re: [DISCUSS] MySQL version support policy
Dongjoon Hyun
Is one Spark partition mapped to one and only Spark Task ?
Sreyan Chakravarty
Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering
Mich Talebzadeh
Re: Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering
Mich Talebzadeh
Bug in org.apache.spark.util.sketch.BloomFilter
Nathan Conroy
[no subject]
Рамик И
Re:
Mich Talebzadeh
Announcing the Community Over Code 2024 Streaming Track
James Hughes
[ANNOUNCE] Apache Kyuubi released 1.9.0
Binjie Yang
pyspark - Use Spark to generate a large dataset on the fly
Sreyan Chakravarty
pyspark - Use Spark to generate a large dataset on the fly
Sreyan Chakravarty
A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
ashok34...@yahoo.com.INVALID
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Parsian, Mahmoud
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Hyukjin Kwon
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Code Tutelage
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Deepak Sharma
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Bjørn Jørgensen
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Reynold Xin
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Joris Billen
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Mich Talebzadeh
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Varun Shah
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Farshid Ashouri
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Kiran Kumar Dusi
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Jay Han
Re: A proposal for creating a Knowledge Sharing Hub for Apache Spark Community
Winston Lai
[GraphX]: Prevent recomputation of DAG
Marek Berith
Re: [GraphX]: Prevent recomputation of DAG
Mich Talebzadeh
Python library that generates fake data using Faker
Mich Talebzadeh
Requesting further assistance with Spark Scala code coverage
里昂
pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Mich Talebzadeh
Re: pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Mich Talebzadeh
Re: pyspark - Where are Dataframes created from Python objects stored?
Sreyan Chakravarty
Re: pyspark - Where are Dataframes created from Python objects stored?
Varun Shah
Data ingestion into elastic failing using pyspark
Karthick Nk
Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Re: Bug in How to Monitor Streaming Queries in PySpark
刘唯
Re: Bug in How to Monitor Streaming Queries in PySpark
Mich Talebzadeh
Spark on Kubenets, execute dataset.show raise exceptions
BODY NO
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled
sharad mishra
Spark-UI stages and other tabs not accessible in standalone mode when reverse-proxy is enabled
sharad mishra
Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Tom Barber
Re: Creating remote tables using PySpark
Mich Talebzadeh
Dark mode logo
Mike Drob
S3 committer for dynamic partitioning
Nikhil Goyal
It seems --py-files only takes the first two arguments. Can someone please confirm?
Pedro, Chuck
Re: It seems --py-files only takes the first two arguments. Can someone please confirm?
Mich Talebzadeh
Re: It seems --py-files only takes the first two arguments. Can someone please confirm?
Mich Talebzadeh
Working with a text file that is both compressed by bz2 followed by zip in PySpark
Mich Talebzadeh
pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Mich Talebzadeh
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Damien Hawes
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Mich Talebzadeh
Re: pyspark dataframe join with two different data type
Karthick Nk
Re: pyspark dataframe join with two different data type
Karthick Nk
[ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re:[ANNOUNCE] Apache Spark 3.5.1 released
beliefer
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Xinrong Meng
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Prem Sahoo
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Peter Toth
Re: [ANNOUNCE] Apache Spark 3.5.1 released
John Zhuge
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Hyukjin Kwon
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Re: [ANNOUNCE] Apache Spark 3.5.1 released
yangjie01
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
答复: [ANNOUNCE] Apache Spark 3.5.1 released
Pan,Bingkun
Re: [ANNOUNCE] Apache Spark 3.5.1 released
Jungtaek Lim
Earlier messages
Later messages