Columnar Statisics

2015-11-13 Thread sara mustafa
Hi, I am using Spark 1.5.2 and I notice the existence of the class org.apache.spark.sql.columnar.ColumnStatisticsSchema, How can I use it to calculate column statistics of a DataFrame? Thanks, -- View this message in context:

deploying Spark on standalone cluster

2015-03-14 Thread sara mustafa
Hi, I am trying to deploy spark on standalone cluster of two machines on for master node and one for worker node. i have defined the two machines in conf/slaves file and also i /etc/hosts, when i tried to run the cluster the worker node is running but the master node failed to run and throw this

Store the shuffled files in memory using Tachyon

2015-03-06 Thread sara mustafa
Hi all, Is it possible to store Spark shuffled files on a distributed memory like Tachyon instead of spilling them to disk? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Store-the-shuffled-files-in-memory-using-Tachyon-tp21944.html Sent from the Apache