Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-20 Thread jagaximo
was able to adopt this approach. thanks! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String-that-include-large-Set-tp21248p21275.html Sent from the Apache Spark User List mailing list archive at Nabble.com

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-20 Thread Kevin (Sangwoo) Kim
-to-compute-RDD-String-Set-String- that-include-large-Set-tp21248p21275.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands

How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread jagaximo
. but create RDD in RDD, NullPointerException has occured. maybe impossible this operation What might be the issue and possible solution? please lend your wisdom -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread Pankaj Narang
Instead of counted.saveAsText(“/path/to/save/dir) if you call counted.collect what happens ? If you still face the same issue please paste the stacktrace here. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread Kevin Jung
in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String-that-include-large-Set-tp21248p21251.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread Kevin (Sangwoo) Kim
hoge RDD to make reduceByKey collect only counts of keys. -- View this message in context: http://apache-spark-user-list. 1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String- that-include-large-Set-tp21248p21251.html Sent from the Apache Spark User List mailing list archive

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread jagaximo
That i want to do, get unique count for each key. so take map() or countByKey(), not get unique count. (because duplicate string is likely to be counted)... -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String-that-include

Re: How to compute RDD[(String, Set[String])] that include large Set

2015-01-19 Thread Kevin (Sangwoo) Kim
() or countByKey(), not get unique count. (because duplicate string is likely to be counted)... -- View this message in context: http://apache-spark-user-list. 1001560.n3.nabble.com/How-to-compute-RDD-String-Set-String- that-include-large-Set-tp21248p21254.html Sent from the Apache Spark User List