user

Messages by Thread

- Re: [ANNOUNCE] Apache Spark 3.2.3 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.2.3 released huaxin gao
- Re: [ANNOUNCE] Apache Spark 3.2.3 released L. C. Hsieh
Create Jira account Gerben van der Huizen
- Re: Create Jira account Sean Owen
Implement custom datasource (writer) for Spark3 guenterh.lists
[PySpark] Join using condition where each record may be joined multiple times Oliver Ruebenacker
- Re: [PySpark] Join using condition where each record may be joined multiple times Artemis User
- Re: [PySpark] Join using condition where each record may be joined multiple times Oliver Ruebenacker
[PySpark] [applyInPandas] Regression Bug: Cogroup in pandas drops columns from the first dataframe Michael Bílý
Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes Gnana Kumar
- Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes Sean Owen
- Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes Sean Owen
- Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes Gnana Kumar
- Re: Unable to run Spark Job(3.3.2 SNAPSHOT) with Volcano scheduler in Kubernetes Bjørn Jørgensen
Creating a Spark 3 Connector Mitch Shepherd
- Re: Creating a Spark 3 Connector Bjørn Jørgensen
- Re: Creating a Spark 3 Connector Jungtaek Lim
- Spark Partitions Size control vijay khatri
Stack Overflow Question [email protected]
Unable to use GPU with pyspark in windows Vajiha Begum S A
- Re: Unable to use GPU with pyspark in windows Sean Owen
[sparklyR] broadcast table for temporary table -> can you compute statistics for temporary table? Joris Billen
Driver takes long time to finish once job ends Nikhil Goyal
- Re: EXT: Driver takes long time to finish once job ends Vibhor Gupta
- Re: Driver takes long time to finish once job ends Pralabh Kumar
- Re: Driver takes long time to finish once job ends Pralabh Kumar
CVE-2022-33891 mitigation Andrew Pomponio
- Re: CVE-2022-33891 mitigation Sean Owen
- Re: CVE-2022-33891 mitigation Kostya Kortchinsky
Dataproc serverless for Spark Mich Talebzadeh
- Re: Dataproc serverless for Spark Stephen Boesch
- Re: Dataproc serverless for Spark Mich Talebzadeh
- Re: Dataproc serverless for Spark Holden Karau
- Re: Dataproc serverless for Spark Mich Talebzadeh
Spark performance on small dataset Prarthi Jain
[Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Ramakrishna Rayudu
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Sean Owen
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Sean Owen
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Ramakrishna Rayudu
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Sean Owen
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Sean Owen
- pyspark read.csv() doesn't respect locale when reading float Weiand, Markus
- Re: [Spark SQL]: Is it possible that spark SQL appends "SELECT 1 " to the query Sean Owen
VolcanoFeatureStep( Custom Scheduler ) not found in Spark 3.3.1 archive Gnana Kumar
- Re: VolcanoFeatureStep( Custom Scheduler ) not found in Spark 3.3.1 archive Chris Nauroth
- Re: VolcanoFeatureStep( Custom Scheduler ) not found in Spark 3.3.1 archive Gnana Kumar
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.1-incubating Shaoyun Chen
Registering native UDF in PySpark Pavel Penkov
Pyspark ML model Save Error Vajiha Begum S A
- Re: Pyspark ML model Save Error Artemis User
- Re: Pyspark ML model Save Error Raja bhupati
sequence file write Shrikant Prasad
- Re: sequence file write Jie Han
- Re: sequence file write Shrikant Prasad
- Re: sequence file write Jie Han
Spark Structured Streaming Duplicate in ForEachBatch with BatchId Vedant Shirodkar
[Spark Sql] Global Setting for Case-Insensitive String Compare Patrick Tucci
- RE: [Spark Sql] Global Setting for Case-Insensitive String Compare Patrick Tucci
- Re: [Spark Sql] Global Setting for Case-Insensitive String Compare Andrew Melo
- RE: Re: [Spark Sql] Global Setting for Case-Insensitive String Compare Patrick Tucci
Spark Scala Contract Opportunity @USA sri hari kali charan Tummala
- Re: Spark Scala Contract Opportunity @USA Stephen Boesch
cannot write spark log to s3a [email protected]
- Re: cannot write spark log to s3a Chris Nauroth
[Spark Core] Adaptive dynamic partition pruning hajyoussef amine
- Re: [Spark Core] Adaptive dynamic partition pruning Jie Han
- Re: [Spark Core] Adaptive dynamic partition pruning hajyoussef amine
- Re: [Spark Core] Adaptive dynamic partition pruning Jie Han
- Re: [Spark Core] Adaptive dynamic partition pruning hajyoussef amine
- Re: [Spark Core] Adaptive dynamic partition pruning Jie Han
Offline elastic index creation Vibhor Gupta
- Re: Offline elastic index creation Debasish Das
[Spark Version] Which version should I choose? zzuly2010
ClassCastException while reading parquet data via Hive metastore Naresh Peshwe
- Re: ClassCastException while reading parquet data via Hive metastore Evy M
- Re: ClassCastException while reading parquet data via Hive metastore Naresh Peshwe
- Re: ClassCastException while reading parquet data via Hive metastore Evy M
- Re: ClassCastException while reading parquet data via Hive metastore Naresh Peshwe
Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: Stage level scheduling - lower the number of executors when using GPUs Artemis User
- Re: [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs Artemis User
- Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs bo yang
- Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs Sean Owen
- Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs Tom Graves
- Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs Tom Graves
- Re: [EXTERNAL] Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: [EXTERNAL] Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs ayan guha
- Re: [EXTERNAL] Re: Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
- Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs Artemis User
- Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs Shay Elbaz
[*IMPORTANT*] update Streaming Query Statistics url Priyanshi Shahu
should one every make a spark streaming job in pyspark Joris Billen
- Re: should one every make a spark streaming job in pyspark Mich Talebzadeh
- Re: Re: should one every make a spark streaming job in pyspark Lingzhe Sun
Ctrl - left and right now working in Spark Shell in Windows 10 Salil Surendran
- Re: Ctrl - left and right now working in Spark Shell in Windows 10 Sean Owen
spark - local question 张健BJ
- Re: spark - local question Sean Owen
- Re: spark - local question Bjørn Jørgensen
- Re: spark - local question Bjørn Jørgensen
How to find final status (Driver's) for an application Violet Vin
- Re: How to find final status (Driver's) for an application Artemis User
Dynamic Scaling without Kubernetes Artemis User
- Re: Dynamic Scaling without Kubernetes Holden Karau
- Re: Dynamic Scaling without Kubernetes Artemis User
- Re: Dynamic Scaling without Kubernetes Mich Talebzadeh
Running 30 Spark applications at the same time is slower than one on average [email protected]
- Re: Running 30 Spark applications at the same time is slower than one on average Sean Owen
- Re: Running 30 Spark applications at the same time is slower than one on average Artemis User
- Re: Running 30 Spark applications at the same time is slower than one on average Sean Owen
[ANNOUNCE] Apache Spark 3.3.1 released Yuming Wang
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.3.1 released L. C. Hsieh
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Hyukjin Kwon
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Maxim Gekk
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Yang,Jie(INF)
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Jacek Laskowski
- Re:[ANNOUNCE] Apache Spark 3.3.1 released beliefer
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Chao Sun
The Dataset unit test is much slower than the RDD unit test (in Scala) Tanin Na Nakorn
- Re: The Dataset unit test is much slower than the RDD unit test (in Scala) Enrico Minack
- Re: The Dataset unit test is much slower than the RDD unit test (in Scala) Cheng Pan
Dynamic allocation on K8 Nikhil Goyal
- Re: Dynamic allocation on K8 Shrikant Prasad
Prometheus with spark Raj ks
- Re: Prometheus with spark Jacek Laskowski
- Re: Prometheus with spark Raja bhupati
- Re: Prometheus with spark Denny Lee
[PySpark, Spark Streaming] Bug in timestamp handling in Structured Streaming? [email protected]
Spark partitioned By venkatesh bandaru
pyspark connect to spark thrift server port [email protected]
- Re: pyspark connect to spark thrift server port Artemis User
- Re: pyspark connect to spark thrift server port [email protected]
- Re: pyspark connect to spark thrift server port Artemis User
Encoded data retrieved when reading Parquet file Nipuna Shantha
- RE: Encoded data retrieved when reading Parquet file Nipuna Shantha
How to use neo4j cypher/opencypher to query spark RDD/graphdb ERSyrfw212oe
- Re: How to use neo4j cypher/opencypher to query spark RDD/graphdb Artemis User
spark on kubernetes Mohammad Abdollahzade Arani
- Re: spark on kubernetes Qian Sun
- Re: spark on kubernetes Qian Sun
- Spark on Kubernetes Tarun raghav
- Re: Spark on Kubernetes Mich Talebzadeh
[Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions) Martin
- Re: [Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions) Hyukjin Kwon
Apache Spark Operator for Kubernetes? Clayton Wohl
- Re: Apache Spark Operator for Kubernetes? Artemis User
- RE: Apache Spark Operator for Kubernetes? Jim Halfpenny
[SparkListener] Calculating the total amount of re-computations / waste Faiz Halde
- Re: [SparkListener] Calculating the total amount of re-computations / waste Emil Ejbyfeldt
Executor heartbeats on Kubernetes Kristopher Kane
- Re: Executor heartbeats on Kubernetes Qian SUN
Efficiently updating running sums only on new data Greg Kopff
- Re: Efficiently updating running sums only on new data Artemis User
- Re: Efficiently updating running sums only on new data Igor Calabria
Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Chartist
- Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Sadha Chilukoori
- Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Chartist
As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Oliver Plohmann
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Никита Романов
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Sean Owen
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Henrik Park
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Sean Owen
[Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? phoebe chen
- Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? Sean Owen
- Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? Bjørn Jørgensen
Converting None/Null into json in pyspark Karthick Nk
- Re: Converting None/Null into json in pyspark Yeachan Park
- Re: Converting None/Null into json in pyspark Karthick Nk
- Re: Converting None/Null into json in pyspark Yeachan Park
Reading too many files Sachit Murarka
- Re: Reading too many files Sid
- Re: Reading too many files Henrik Pang
- Re: Reading too many files Enrico Minack
- Re: Reading too many files Artemis User
- Spike on number of tasks - dynamic allocation murat migdisoglu
- Re: Spike on number of tasks - dynamic allocation Mich Talebzadeh
- Re: Spike on number of tasks - dynamic allocation murat migdisoglu
- Re: Spike on number of tasks - dynamic allocation Mich Talebzadeh
WARN ProcfsMetricsGetter: Exception Surya Gopisetty
- Re: WARN ProcfsMetricsGetter: Exception Henrik Pang
Spark ML VarianceThresholdSelector Unexpected Results 姜鑫
- Re: Spark ML VarianceThresholdSelector Unexpected Results Sean Owen
- Re: Spark ML VarianceThresholdSelector Unexpected Results 姜鑫
Help with Shuffle Read performance Igor Calabria
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Tufan Rakshit
- Re: Help with Shuffle Read performance Vladimir Prus
- Re: Help with Shuffle Read performance Igor Calabria
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Leszek Reimus
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Sungwoo Park
- Re: Help with Shuffle Read performance Leszek Reimus
- Re: Help with Shuffle Read performance Artemis User
- Re: Help with Shuffle Read performance Igor Calabria