user
Thread
Date
Earlier messages
Later messages
Messages by Thread
Re: should one every make a spark streaming job in pyspark
Mich Talebzadeh
Re: Re: should one every make a spark streaming job in pyspark
Lingzhe Sun
Ctrl - left and right now working in Spark Shell in Windows 10
Salil Surendran
Re: Ctrl - left and right now working in Spark Shell in Windows 10
Sean Owen
spark - local question
张健BJ
Re: spark - local question
Sean Owen
Re: spark - local question
Bjørn Jørgensen
Re: spark - local question
Bjørn Jørgensen
How to find final status (Driver's) for an application
Violet Vin
Re: How to find final status (Driver's) for an application
Artemis User
Dynamic Scaling without Kubernetes
Artemis User
Re: Dynamic Scaling without Kubernetes
Holden Karau
Re: Dynamic Scaling without Kubernetes
Artemis User
Re: Dynamic Scaling without Kubernetes
Mich Talebzadeh
Running 30 Spark applications at the same time is slower than one on average
eab...@163.com
Re: Running 30 Spark applications at the same time is slower than one on average
Sean Owen
Re: Running 30 Spark applications at the same time is slower than one on average
Artemis User
Re: Running 30 Spark applications at the same time is slower than one on average
Sean Owen
[ANNOUNCE] Apache Spark 3.3.1 released
Yuming Wang
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Dongjoon Hyun
Re: [ANNOUNCE] Apache Spark 3.3.1 released
L. C. Hsieh
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Hyukjin Kwon
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Maxim Gekk
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Yang,Jie(INF)
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Jacek Laskowski
Re:[ANNOUNCE] Apache Spark 3.3.1 released
beliefer
Re: [ANNOUNCE] Apache Spark 3.3.1 released
Chao Sun
The Dataset unit test is much slower than the RDD unit test (in Scala)
Tanin Na Nakorn
Re: The Dataset unit test is much slower than the RDD unit test (in Scala)
Enrico Minack
Re: The Dataset unit test is much slower than the RDD unit test (in Scala)
Cheng Pan
Dynamic allocation on K8
Nikhil Goyal
Re: Dynamic allocation on K8
Shrikant Prasad
Prometheus with spark
Raj ks
Re: Prometheus with spark
Jacek Laskowski
Re: Prometheus with spark
Raja bhupati
Re: Prometheus with spark
Denny Lee
[PySpark, Spark Streaming] Bug in timestamp handling in Structured Streaming?
kai-michael.roes...@sap.com.INVALID
Spark partitioned By
venkatesh bandaru
pyspark connect to spark thrift server port
second_co...@yahoo.com.INVALID
Re: pyspark connect to spark thrift server port
Artemis User
Re: pyspark connect to spark thrift server port
second_co...@yahoo.com.INVALID
Re: pyspark connect to spark thrift server port
Artemis User
Encoded data retrieved when reading Parquet file
Nipuna Shantha
RE: Encoded data retrieved when reading Parquet file
Nipuna Shantha
How to use neo4j cypher/opencypher to query spark RDD/graphdb
ERSyrfw212oe
Re: How to use neo4j cypher/opencypher to query spark RDD/graphdb
Artemis User
spark on kubernetes
Mohammad Abdollahzade Arani
Re: spark on kubernetes
Qian Sun
Re: spark on kubernetes
Qian Sun
Spark on Kubernetes
Tarun raghav
Re: Spark on Kubernetes
Mich Talebzadeh
[Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions)
Martin
Re: [Feature Request] make unix_micros() and unix_millis() available in PySpark (pyspark.sql.functions)
Hyukjin Kwon
Apache Spark Operator for Kubernetes?
Clayton Wohl
Re: Apache Spark Operator for Kubernetes?
Artemis User
RE: Apache Spark Operator for Kubernetes?
Jim Halfpenny
[SparkListener] Calculating the total amount of re-computations / waste
Faiz Halde
Re: [SparkListener] Calculating the total amount of re-computations / waste
Emil Ejbyfeldt
Executor heartbeats on Kubernetes
Kristopher Kane
Re: Executor heartbeats on Kubernetes
Qian SUN
Efficiently updating running sums only on new data
Greg Kopff
Re: Efficiently updating running sums only on new data
Artemis User
Re: Efficiently updating running sums only on new data
Igor Calabria
Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Chartist
Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Sadha Chilukoori
Re: Why the same INSERT OVERWRITE sql , final table file produced by spark sql is larger than hive sql?
Chartist
As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Oliver Plohmann
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Никита Романов
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Sean Owen
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Henrik Park
Re: As a Scala newbie starting to work with Spark does it make more sense to learn Scala 2 or Scala 3?
Sean Owen
[Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
phoebe chen
Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
Sean Owen
Re: [Spark Core][Release]Can we consider add SPARK-39725 into 3.3.1 or 3.3.2 release?
Bjørn Jørgensen
Converting None/Null into json in pyspark
Karthick Nk
Re: Converting None/Null into json in pyspark
Yeachan Park
Re: Converting None/Null into json in pyspark
Karthick Nk
Re: Converting None/Null into json in pyspark
Yeachan Park
Reading too many files
Sachit Murarka
Re: Reading too many files
Sid
Re: Reading too many files
Henrik Pang
Re: Reading too many files
Enrico Minack
Re: Reading too many files
Artemis User
Spike on number of tasks - dynamic allocation
murat migdisoglu
Re: Spike on number of tasks - dynamic allocation
Mich Talebzadeh
Re: Spike on number of tasks - dynamic allocation
murat migdisoglu
Re: Spike on number of tasks - dynamic allocation
Mich Talebzadeh
WARN ProcfsMetricsGetter: Exception
Surya Gopisetty
Re: WARN ProcfsMetricsGetter: Exception
Henrik Pang
Spark ML VarianceThresholdSelector Unexpected Results
姜鑫
Re: Spark ML VarianceThresholdSelector Unexpected Results
Sean Owen
Re: Spark ML VarianceThresholdSelector Unexpected Results
姜鑫
Help with Shuffle Read performance
Igor Calabria
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Tufan Rakshit
Re: Help with Shuffle Read performance
Vladimir Prus
Re: Help with Shuffle Read performance
Igor Calabria
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Leszek Reimus
Re: Help with Shuffle Read performance
Gourav Sengupta
Re: Help with Shuffle Read performance
Sungwoo Park
Re: Help with Shuffle Read performance
Leszek Reimus
Re: Help with Shuffle Read performance
Artemis User
Re: Help with Shuffle Read performance
Igor Calabria
depolying stage-level scheduling for Spark SQL and how to expose RDD code from Spark SQL?
Chenghao Lyu
Does 'Stage cancelled because SparkContext was shut down' is a error
lk_spark
[Spark Kubernetes] Question about Configurability of Labeling Driver Service
Shiqi Sun
Re: [Spark Kubernetes] Question about Configurability of Labeling Driver Service
Shiqi Sun
Kyro Serializer not getting set : Spark3
rajat kumar
Re: Kyro Serializer not getting set : Spark3
Qian SUN
Re: Kyro Serializer not getting set : Spark3
rajat kumar
HELP, Populating an empty pyspark dataframe with auto-generated dates
Jamie Arodi
Query regarding Proleptic Gregorian Calendar Spark3
Sachit Murarka
Re: Query regarding Proleptic Gregorian Calendar Spark3
Sachit Murarka
Error - Spark STREAMING
Akash Vellukai
Re: Error - Spark STREAMING
Anupam Singh
Re: Issue with SparkContext
Bjørn Jørgensen
Re: Issue with SparkContext
javacaoyu
NoClassDefError and SparkSession should only be created and accessed on the driver.
rajat kumar
答复: NoClassDefError and SparkSession should only be created and accessed on the driver.
Xiao, Alton
Re: NoClassDefError and SparkSession should only be created and accessed on the driver.
rajat kumar
Re: NoClassDefError and SparkSession should only be created and accessed on the driver.
Paul Rogalinski
Spark Structured Streaming - stderr getting filled up
karan alang
Re: Spark Structured Streaming - stderr getting filled up
karan alang
Re: Spark Structured Streaming - stderr getting filled up
karan alang
[how to]RDD using JDBC data source in PySpark
javaca...@163.com
答复: [how to]RDD using JDBC data source in PySpark
Xiao, Alton
回复: 答复: [how to]RDD using JDBC data source in PySpark
javaca...@163.com
Re: 答复: [how to]RDD using JDBC data source in PySpark
Bjørn Jørgensen
Re: Re: [how to]RDD using JDBC data source in PySpark
javaca...@163.com
Re: Re: [how to]RDD using JDBC data source in PySpark
Bjørn Jørgensen
Re: 答复: [how to]RDD using JDBC data source in PySpark
Sean Owen
Driver throws exception every few hours
Kiran Biswal
[Spark Core] Joining Same DataFrame Multiple Times Results in Column not getting dropped
Shahban Riaz
[Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Enrico Minack
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Enrico Minack
Re: [Spark Internals]: Is sort order preserved after partitioned write?
Swetha Baskaran
Big Data Contract Roles ?
sri hari kali charan Tummala
Splittable or not?
Sid
Re: Splittable or not?
Amit Joshi
Re: Splittable or not?
Sid
Re: Splittable or not?
Enrico Minack
Re: Splittable or not?
Sid
Re: Splittable or not?
Jack Goodson
Network time out property is not getting set in Spark
Sachit Murarka
Re: EXT: Network time out property is not getting set in Spark
Vibhor Gupta
Re: EXT: Network time out property is not getting set in Spark
Sachit Murarka
Long running task in spark
rajat kumar
Re: Long running task in spark
Sid
[SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
akshit marwah
Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
Artemis User
Re: [SPARK STRUCTURED STREAMING] : Rocks DB uses off-heap usage
Adam Binford
Dynamic shuffle partitions in a single job
Vibhor Gupta
Re: Dynamic shuffle partitions in a single job
Anupam Singh
RE: [EXTERNAL] Re: Dynamic shuffle partitions in a single job
Kapil Kumar Singh
Spark SQL
Mayur Benodekar
Re: Spark SQL
Gourav Sengupta
Re: Spark SQL
Mayur Benodekar
Re: Spark SQL
Gourav Sengupta
Re: EXT: Re: Spark SQL
Vibhor Gupta
Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Sean Owen
Re: Pipelined execution in Spark (???)
Sungwoo Park
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Gourav Sengupta
Re: Pipelined execution in Spark (???)
Russell Jurney
Re: Pipelined execution in Spark (???)
Russell Jurney
Spark equivalent to hdfs groups
phiroc
Re: Spark equivalent to hdfs groups
Sean Owen
Re: Spark equivalent to hdfs groups
phiroc
Re: Spark equivalent to hdfs groups
Sean Owen
Re: Spark equivalent to hdfs groups
phiroc
Spark Structured Streaming - unable to change max.poll.records (showing as 1)
karan alang
[ANNOUNCE] Apache Kyuubi (Incubating) released 1.6.0-incubating
Nicholas Jiang
Error in Spark in Jupyter Notebook
Mamata Shee
Re: Error in Spark in Jupyter Notebook
Sean Owen
Apache Spark - How to concert DataFrame json string to structured element and using schema_of_json
M Singh
Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Re: Jupyter notebook on Dataproc versus GKE
Mich Talebzadeh
Re: Jupyter notebook on Dataproc versus GKE
Holden Karau
Re: Jupyter notebook on Dataproc versus GKE
Bjørn Jørgensen
Spark Issue with Istio in Distributed Mode
Deepak Sharma
Re: Spark Issue with Istio in Distributed Mode
Deepak Sharma
Re: Spark Issue with Istio in Distributed Mode
Deepak Sharma
Data Type Issue while upgrading to Spark3
rajat kumar
Creating Custom Broadcast Join
Murali S
ERROR MicroBatchExecution
Ravi Chandran
running pyspark on kubernetes - no space left on device
Manoj GEORGE
Re: running pyspark on kubernetes - no space left on device
Matt Proetsch
Re: running pyspark on kubernetes - no space left on device
Qian SUN
Earlier messages
Later messages