user

Messages by Thread

- Re: should one every make a spark streaming job in pyspark Mich Talebzadeh
- Re: Re: should one every make a spark streaming job in pyspark Lingzhe Sun
Ctrl - left and right now working in Spark Shell in Windows 10 Salil Surendran
- Re: Ctrl - left and right now working in Spark Shell in Windows 10 Sean Owen
spark - local question 张健BJ
- Re: spark - local question Sean Owen
- Re: spark - local question Bjørn Jørgensen
- Re: spark - local question Bjørn Jørgensen
How to find final status (Driver's) for an application Violet Vin
- Re: How to find final status (Driver's) for an application Artemis User
Dynamic Scaling without Kubernetes Artemis User
- Re: Dynamic Scaling without Kubernetes Holden Karau
- Re: Dynamic Scaling without Kubernetes Artemis User
- Re: Dynamic Scaling without Kubernetes Mich Talebzadeh
Running 30 Spark applications at the same time is slower than one on average eab...@163.com
- Re: Running 30 Spark applications at the same time is slower than one on average Sean Owen
- Re: Running 30 Spark applications at the same time is slower than one on average Artemis User
- Re: Running 30 Spark applications at the same time is slower than one on average Sean Owen
[ANNOUNCE] Apache Spark 3.3.1 released Yuming Wang
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Dongjoon Hyun
- Re: [ANNOUNCE] Apache Spark 3.3.1 released L. C. Hsieh
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Hyukjin Kwon
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Maxim Gekk
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Yang,Jie(INF)
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Jacek Laskowski
- Re:[ANNOUNCE] Apache Spark 3.3.1 released beliefer
- Re: [ANNOUNCE] Apache Spark 3.3.1 released Chao Sun
The Dataset unit test is much slower than the RDD unit test (in Scala) Tanin Na Nakorn
- Re: The Dataset unit test is much slower than the RDD unit test (in Scala) Enrico Minack
- Re: The Dataset unit test is much slower than the RDD unit test (in Scala) Cheng Pan
Dynamic allocation on K8 Nikhil Goyal
- Re: Dynamic allocation on K8 Shrikant Prasad
Prometheus with spark Raj ks
- Re: Prometheus with spark Jacek Laskowski
- Re: Prometheus with spark Raja bhupati
- Re: Prometheus with spark Denny Lee
[PySpark, Spark Streaming] Bug in timestamp handling in Structured Streaming? kai-michael.roes...@sap.com.INVALID
Spark partitioned By venkatesh bandaru
pyspark connect to spark thrift server port second_co...@yahoo.com.INVALID
- Re: pyspark connect to spark thrift server port Artemis User
- Re: pyspark connect to spark thrift server port second_co...@yahoo.com.INVALID
- Re: pyspark connect to spark thrift server port Artemis User
Encoded data retrieved when reading Parquet file Nipuna Shantha
- RE: Encoded data retrieved when reading Parquet file Nipuna Shantha
How to use neo4j cypher/opencypher to query spark RDD/graphdb ERSyrfw212oe
- Re: How to use neo4j cypher/opencypher to query spark RDD/graphdb Artemis User
spark on kubernetes Mohammad Abdollahzade Arani
- Re: spark on kubernetes Qian Sun
- Re: spark on kubernetes Qian Sun
- Spark on Kubernetes Tarun raghav
- Re: Spark on Kubernetes Mich Talebzadeh
[Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions) Martin
- Re: [Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions) Hyukjin Kwon
Apache Spark Operator for Kubernetes? Clayton Wohl
- Re: Apache Spark Operator for Kubernetes? Artemis User
- RE: Apache Spark Operator for Kubernetes? Jim Halfpenny
[SparkListener] Calculating the total amount of re-computations / waste Faiz Halde
- Re: [SparkListener] Calculating the total amount of re-computations / waste Emil Ejbyfeldt
Executor heartbeats on Kubernetes Kristopher Kane
- Re: Executor heartbeats on Kubernetes Qian SUN
Efficiently updating running sums only on new data Greg Kopff
- Re: Efficiently updating running sums only on new data Artemis User
- Re: Efficiently updating running sums only on new data Igor Calabria
Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Chartist
- Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Sadha Chilukoori
- Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql？ Chartist
As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Oliver Plohmann
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Никита Романов
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Sean Owen
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Henrik Park
- Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3? Sean Owen
[Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? phoebe chen
- Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? Sean Owen
- Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release? Bjørn Jørgensen
Converting None/Null into json in pyspark Karthick Nk
- Re: Converting None/Null into json in pyspark Yeachan Park
- Re: Converting None/Null into json in pyspark Karthick Nk
- Re: Converting None/Null into json in pyspark Yeachan Park
Reading too many files Sachit Murarka
- Re: Reading too many files Sid
- Re: Reading too many files Henrik Pang
- Re: Reading too many files Enrico Minack
- Re: Reading too many files Artemis User
- Spike on number of tasks - dynamic allocation murat migdisoglu
- Re: Spike on number of tasks - dynamic allocation Mich Talebzadeh
- Re: Spike on number of tasks - dynamic allocation murat migdisoglu
- Re: Spike on number of tasks - dynamic allocation Mich Talebzadeh
WARN ProcfsMetricsGetter: Exception Surya Gopisetty
- Re: WARN ProcfsMetricsGetter: Exception Henrik Pang
Spark ML VarianceThresholdSelector Unexpected Results 姜鑫
- Re: Spark ML VarianceThresholdSelector Unexpected Results Sean Owen
- Re: Spark ML VarianceThresholdSelector Unexpected Results 姜鑫
Help with Shuffle Read performance Igor Calabria
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Tufan Rakshit
- Re: Help with Shuffle Read performance Vladimir Prus
- Re: Help with Shuffle Read performance Igor Calabria
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Leszek Reimus
- Re: Help with Shuffle Read performance Gourav Sengupta
- Re: Help with Shuffle Read performance Sungwoo Park
- Re: Help with Shuffle Read performance Leszek Reimus
- Re: Help with Shuffle Read performance Artemis User
- Re: Help with Shuffle Read performance Igor Calabria
depolying stage-level scheduling for Spark SQL and how to expose RDD code from Spark SQL? Chenghao Lyu
Does 'Stage cancelled because SparkContext was shut down' is a error lk_spark
[Spark Kubernetes] Question about Configurability of Labeling Driver Service Shiqi Sun
- Re: [Spark Kubernetes] Question about Configurability of Labeling Driver Service Shiqi Sun
Kyro Serializer not getting set : Spark3 rajat kumar
- Re: Kyro Serializer not getting set : Spark3 Qian SUN
- Re: Kyro Serializer not getting set : Spark3 rajat kumar
HELP, Populating an empty pyspark dataframe with auto-generated dates Jamie Arodi
Query regarding Proleptic Gregorian Calendar Spark3 Sachit Murarka
- Re: Query regarding Proleptic Gregorian Calendar Spark3 Sachit Murarka
Error - Spark STREAMING Akash Vellukai
- Re: Error - Spark STREAMING Anupam Singh
Re: Issue with SparkContext Bjørn Jørgensen
- Re: Issue with SparkContext javacaoyu
NoClassDefError and SparkSession should only be created and accessed on the driver. rajat kumar
- 答复: NoClassDefError and SparkSession should only be created and accessed on the driver. Xiao, Alton
- Re: NoClassDefError and SparkSession should only be created and accessed on the driver. rajat kumar
- Re: NoClassDefError and SparkSession should only be created and accessed on the driver. Paul Rogalinski
Spark Structured Streaming - stderr getting filled up karan alang
- Re: Spark Structured Streaming - stderr getting filled up karan alang
- Re: Spark Structured Streaming - stderr getting filled up karan alang
[how to]RDD using JDBC data source in PySpark javaca...@163.com
- 答复: [how to]RDD using JDBC data source in PySpark Xiao, Alton
- 回复: 答复: [how to]RDD using JDBC data source in PySpark javaca...@163.com
- Re: 答复: [how to]RDD using JDBC data source in PySpark Bjørn Jørgensen
- Re: Re: [how to]RDD using JDBC data source in PySpark javaca...@163.com
- Re: Re: [how to]RDD using JDBC data source in PySpark Bjørn Jørgensen
- Re: 答复: [how to]RDD using JDBC data source in PySpark Sean Owen
Driver throws exception every few hours Kiran Biswal
[Spark Core] Joining Same DataFrame Multiple Times Results in Column not getting dropped Shahban Riaz
[Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Enrico Minack
- Re: [Spark Internals]: Is sort order preserved after partitioned write? Swetha Baskaran
Big Data Contract Roles ? sri hari kali charan Tummala
Splittable or not? Sid
- Re: Splittable or not? Amit Joshi
- Re: Splittable or not? Sid
- Re: Splittable or not? Enrico Minack
- Re: Splittable or not? Sid
- Re: Splittable or not? Jack Goodson
Network time out property is not getting set in Spark Sachit Murarka
- Re: EXT: Network time out property is not getting set in Spark Vibhor Gupta
- Re: EXT: Network time out property is not getting set in Spark Sachit Murarka
Long running task in spark rajat kumar
- Re: Long running task in spark Sid
[SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage akshit marwah
- Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage Artemis User
- Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage Adam Binford
Dynamic shuffle partitions in a single job Vibhor Gupta
- Re: Dynamic shuffle partitions in a single job Anupam Singh
- RE: [EXTERNAL] Re: Dynamic shuffle partitions in a single job Kapil Kumar Singh
Spark SQL Mayur Benodekar
- Re: Spark SQL Gourav Sengupta
- Re: Spark SQL Mayur Benodekar
- Re: Spark SQL Gourav Sengupta
- Re: EXT: Re: Spark SQL Vibhor Gupta
Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Sean Owen
- Re: Pipelined execution in Spark (???) Sungwoo Park
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Gourav Sengupta
- Re: Pipelined execution in Spark (???) Russell Jurney
- Re: Pipelined execution in Spark (???) Russell Jurney
Spark equivalent to hdfs groups phiroc
- Re: Spark equivalent to hdfs groups Sean Owen
- Re: Spark equivalent to hdfs groups phiroc
- Re: Spark equivalent to hdfs groups Sean Owen
- Re: Spark equivalent to hdfs groups phiroc
Spark Structured Streaming - unable to change max.poll.records (showing as 1) karan alang
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.0-incubating Nicholas Jiang
Error in Spark in Jupyter Notebook Mamata Shee
- Re: Error in Spark in Jupyter Notebook Sean Owen
Apache Spark - How to concert DataFrame json string to structured element and using schema_of_json M Singh
Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
- Re: Jupyter notebook on Dataproc versus GKE Mich Talebzadeh
- Re: Jupyter notebook on Dataproc versus GKE Holden Karau
- Re: Jupyter notebook on Dataproc versus GKE Bjørn Jørgensen
Spark Issue with Istio in Distributed Mode Deepak Sharma
- Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
- Re: Spark Issue with Istio in Distributed Mode Deepak Sharma
Data Type Issue while upgrading to Spark3 rajat kumar
Creating Custom Broadcast Join Murali S
- ERROR MicroBatchExecution Ravi Chandran
running pyspark on kubernetes - no space left on device Manoj GEORGE
- Re: running pyspark on kubernetes - no space left on device Matt Proetsch
- Re: running pyspark on kubernetes - no space left on device Qian SUN