Re: Akka disassociation on Java SE Embedded
How do you increase the spark block manager timeout? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Akka-disassociation-on-Java-SE-Embedded-tp6266p16176.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: Sorting a Sequence File
Here is the code in question //read in the hadoop sequence file to sort val file = sc.sequenceFile(input, classOf[Text], classOf[Text]) //this is the code we would like to avoid that maps the Hadoop Text Input to Strings so the sortyByKey will run file.map{ case (k,v) = (k.toString(), v.toString())} //perform the sort on the converted data val sortedOutput = file.sortByKey(true, 1) //write out the results sortedOutput.saveAsSequenceFile(output, Some(classOf[DefaultCodec])) -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Sorting-a-Sequence-File-tp15633p15647.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: K-means faster on Mahout then on Spark
What is the size of your vector mine is set to 20? I am seeing slow results as well with iteration=5, # of elements 200,000,000. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/K-means-faster-on-Mahout-then-on-Spark-tp3195p15168.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org
Re: Benchmarking Spark with YCSB
Thanks Jay. I honestly think I just had a senior moment or something. I was getting HiBench and YCSB confused. Has anyone attempted to port HiBench to using Spark? HiBench performs a lot of map/reduce and it would be a very interesting comparison for us. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Benchmarking-Spark-with-YCSB-tp5813p5953.html Sent from the Apache Spark User List mailing list archive at Nabble.com.
1.0.0 Release Date?
Can anyone comment on the anticipated date or worse case timeframe for when Spark 1.0.0 will be released? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/1-0-0-Release-Date-tp5664.html Sent from the Apache Spark User List mailing list archive at Nabble.com.