Or Spark on HBase ) http://blog.cloudera.com/blog/2014/12/new-in-cloudera-labs-sparkonhbase/
-- Ruslan Dautkhanov On Tue, Jul 14, 2015 at 7:07 PM, Ted Yu <yuzhih...@gmail.com> wrote: > bq. that is, key-value stores > > Please consider HBase for this purpose :-) > > On Tue, Jul 14, 2015 at 5:55 PM, Tathagata Das <t...@databricks.com> > wrote: > >> I do not recommend using IndexRDD for state management in Spark >> Streaming. What it does not solve out-of-the-box is checkpointing of >> indexRDDs, which important because long running streaming jobs can lead to >> infinite chain of RDDs. Spark Streaming solves it for the updateStateByKey >> operation which you can use, which gives state management capabilities. >> Though for most flexible arbitrary look up of stuff, its better to use a >> dedicated system that is designed and optimized for long term storage of >> data, that is, key-value stores, databases, etc. >> >> On Tue, Jul 14, 2015 at 5:44 PM, Ted Yu <yuzhih...@gmail.com> wrote: >> >>> Please take a look at SPARK-2365 which is in progress. >>> >>> On Tue, Jul 14, 2015 at 5:18 PM, swetha <swethakasire...@gmail.com> >>> wrote: >>> >>>> Hi, >>>> >>>> Is IndexedRDD available in Spark 1.4.0? We would like to use this in >>>> Spark >>>> Streaming to do lookups/updates/deletes in RDDs using keys by storing >>>> them >>>> as key/value pairs. >>>> >>>> Thanks, >>>> Swetha >>>> >>>> >>>> >>>> -- >>>> View this message in context: >>>> http://apache-spark-user-list.1001560.n3.nabble.com/Is-IndexedRDD-available-in-Spark-1-4-0-tp23841.html >>>> Sent from the Apache Spark User List mailing list archive at Nabble.com. >>>> >>>> --------------------------------------------------------------------- >>>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org >>>> For additional commands, e-mail: user-h...@spark.apache.org >>>> >>>> >>> >> >