Re: Sending large objects to specific RDDs

2016-01-17 Thread Daniel Imberman
dIndexes:RDD[(Int, InvIndex)] = >>>>>>> a.reduceByKey(generateInvertedIndex) >>>>>>> vectors:RDD.mapPartitions{ >>>>>>> iter => >>>>>>> val invIndex = invertedIndexes(samePartitionKey) >>>>>

Re: Sending large objects to specific RDDs

2016-01-16 Thread Daniel Imberman
ight get too complicated and become problematic) >> >> Any thoughts on how I could attack this issue would be highly appreciated. >> >> thank you for your help! >> >> >> >> -- >> View this message in context: >> http://apache-spark-us

Re: Sending large objects to specific RDDs

2016-01-16 Thread Ted Yu
t;>>> iter.map(invIndex.calculateSimilarity(_)) >>>>>> ) >>>>>> } >>>>>> >>>>>> How could I go about setting up the Partition such that the specific >>>>>> data >>>>>> st

Re: Sending large objects to specific RDDs

2016-01-15 Thread Ted Yu
l values (which would happen if I were >>>> to >>>> make a broadcast variable). >>>> >>>> One thought I have been having is to store the objects in HDFS but I'm >>>> not >>>> sure if that would be a suboptimal solution (It seems l

Re: Sending large objects to specific RDDs

2016-01-14 Thread Daniel Imberman
on (It seems like it could slow >>> down the process a lot) >>> >>> Another thought I am currently exploring is whether there is some way I >>> can >>> create a custom Partition or Partitioner that could hol

Sending large objects to specific RDDs

2016-01-13 Thread Daniel Imberman
hly appreciated. thank you for your help! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Sending-large-objects-to-specific-RDDs-tp25967.html Sent from the Apache Spark User List mailing list archive at Nabble.

Re: Sending large objects to specific RDDs

2016-01-13 Thread Ted Yu
ted and become problematic) > > Any thoughts on how I could attack this issue would be highly appreciated. > > thank you for your help! > > > > -- > View this message in context: > http://apache-spark-user-li

Re: Sending large objects to specific RDDs

2016-01-13 Thread Daniel Imberman
artition or Partitioner that could hold the data >> structure >> (Although that might get too complicated and become problematic) >> >> Any thoughts on how I could attack this issue w