If you are wanting to share RDDs it might be a good idea to check out Tachyon / Alluxio.
For the Thrift server, I believe the datasets are located in your Spark cluster as RDDs and you just communicate with it via the Thrift JDBC Distributed Query Engine connector. 2016-05-17 5:12 GMT+10:00 Michael Segel <msegel_had...@hotmail.com>: > For one use case.. we were considering using the thrift server as a way to > allow multiple clients access shared RDDs. > > Within the Thrift Context, we create an RDD and expose it as a hive table. > > The question is… where does the RDD exist. On the Thrift service node > itself, or is that just a reference to the RDD which is contained with > contexts on the cluster? > > > Thx > > -Mike > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > >