Hi Mich, Yes, Alluxio is commonly used to cache and share Spark RDDs and DataFrames among different applications and contexts. The data typically stays in memory, but with Alluxio's tiered storage, the "colder" data can be evicted out to other medium, like SSDs and HDDs. Here is a blog post discussing Spark RDDs and Alluxio: https://www.alluxio.com/blog/effective-spark-rdds-with-alluxio
Also, Alluxio also has the concept of an "Under filesystem", which can help you access your existing data across different storage systems. Here is more information about the unified namespace abilities: http://www.alluxio.org/docs/master/en/Unified-and-Transparent-Namespace.html Hope that helps, Gene On Thu, Oct 27, 2016 at 3:39 AM, Mich Talebzadeh <mich.talebza...@gmail.com> wrote: > Thanks Chanh, > > Can it share RDDs. > > Personally I have not used either Alluxio or Ignite. > > > 1. Are there major differences between these two > 2. Have you tried Alluxio for sharing Spark RDDs and if so do you have > any experience you can kindly share > > Regards > > > Dr Mich Talebzadeh > > > > LinkedIn * > https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw > <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* > > > > http://talebzadehmich.wordpress.com > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > > > On 27 October 2016 at 11:29, Chanh Le <giaosu...@gmail.com> wrote: > >> Hi Mich, >> Alluxio is the good option to go. >> >> Regards, >> Chanh >> >> On Oct 27, 2016, at 5:28 PM, Mich Talebzadeh <mich.talebza...@gmail.com> >> wrote: >> >> >> There was a mention of using Zeppelin to share RDDs with many users. From >> the notes on Zeppelin it appears that this is sharing UI and I am not sure >> how easy it is going to be changing the result set with different users >> modifying say sql queries. >> >> There is also the idea of caching RDDs with something like Apache Ignite. >> Has anyone really tried this. Will that work with multiple applications? >> >> It looks feasible as RDDs are immutable and so are registered tempTables >> etc. >> >> Thanks >> >> >> Dr Mich Talebzadeh >> >> >> LinkedIn * >> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw >> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* >> >> >> http://talebzadehmich.wordpress.com >> >> *Disclaimer:* Use it at your own risk. Any and all responsibility for >> any loss, damage or destruction of data or any other property which may >> arise from relying on this email's technical content is explicitly >> disclaimed. The author will in no case be liable for any monetary damages >> arising from such loss, damage or destruction. >> >> >> >> >> >