Re: Akka disassociation on Java SE Embedded

2014-10-10 Thread bhusted
How do you increase the spark block manager timeout?



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Akka-disassociation-on-Java-SE-Embedded-tp6266p16176.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Sorting a Sequence File

2014-10-02 Thread bhusted
Here is the code in question

//read in the hadoop sequence file to sort
 val file = sc.sequenceFile(input, classOf[Text], classOf[Text])

//this is the code we would like to avoid that maps the Hadoop Text Input to
Strings so the sortyByKey will run
 file.map{ case (k,v) = (k.toString(), v.toString())} 

//perform the sort on the converted data
val sortedOutput = file.sortByKey(true, 1)

//write out the results
sortedOutput.saveAsSequenceFile(output, Some(classOf[DefaultCodec]))



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Sorting-a-Sequence-File-tp15633p15647.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: K-means faster on Mahout then on Spark

2014-09-25 Thread bhusted
What is the size of your vector mine is set to 20? I am seeing slow results
as well with iteration=5, # of elements 200,000,000. 







--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/K-means-faster-on-Mahout-then-on-Spark-tp3195p15168.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



Re: Benchmarking Spark with YCSB

2014-05-17 Thread bhusted
Thanks Jay.  I honestly think I just had a senior moment or something.  I was
getting HiBench and YCSB confused.  Has anyone attempted to port HiBench to
using Spark?  HiBench performs a lot of map/reduce and it would be a very
interesting comparison for us.



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Benchmarking-Spark-with-YCSB-tp5813p5953.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.


1.0.0 Release Date?

2014-05-14 Thread bhusted
Can anyone comment on the anticipated date or worse case timeframe for when
Spark 1.0.0 will be released?



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/1-0-0-Release-Date-tp5664.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.