Is it possible to send CSVSink metrics to HDFS

johnbutcher Thu, 14 Jul 2016 04:10:27 -0700

Hi,

(first ever post)


I experimenting with a Cloudera CDH5 cluster with Spark 1.5.0.
Have tried enabling the CSVSink metrics which seems to work to linux
directories such as /tmp.
However, I'm getting errors when trying to send to an HDFS directory.
Is it possible to use HDFS?

Error message from spark-submit:

16/07/14 11:43:39.489 WARN CsvReporter: Error writing to
application_1466718205476_2664.driver.jvm.total.max
java.io.IOException: No such file or directory

Code extract:

    val metrics = "hdfs://moonshot-ha-nameservice/user/jab31/metrics/"
    val conf = new SparkConf()
                   .setAppName("John's Evil Experiments")
                   .set("spark.metrics.conf.*.sink.csv.class",
"org.apache.spark.metrics.sink.CsvSink")
                   .set("spark.metrics.conf.*.sink.csv.period","1")
                   .set("spark.metrics.conf.*.sink.csv.unit","seconds")
                   .set("spark.metrics.conf.*.sink.csv.directory", metrics)
                   .set("spark.metrics.conf.worker.sink.csv.period","1")
                   .set("spark.metrics.conf.worker.sink.csv.unit","seconds")    
                
.set("spark.metrics.conf.master.source.jvm.class","org.apache.spark.metrics.source.JvmSource")
              
.set("spark.metrics.conf.worker.source.jvm.class","org.apache.spark.metrics.source.JvmSource")
.set("spark.metrics.conf.driver.source.jvm.class","org.apache.spark.metrics.source.JvmSource")
                
.set("spark.metrics.conf.executor.source.jvm.class","org.apache.spark.metrics.source.JvmSource")

HDFS directory:

# hadoop fs -ls hdfs://moonshot-ha-nameservice/user/jab31/metrics/
Found 1 items
-rw-r--r--   3 jab31 msc          4 2016-07-14 11:42
hdfs://moonshot-ha-nameservice/user/jab31/metrics/test.txt

Regards,

John





--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Is-it-possible-to-send-CSVSink-metrics-to-HDFS-tp27335.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Is it possible to send CSVSink metrics to HDFS

Reply via email to