I'm just using PMML. I haven't hit any limitation of its expressiveness, for the model types is supports. I don't think there is a point in defining a new format for models, excepting that PMML can get very big. Still, just compressing the XML gets it down to a manageable size for just about any realistic model.*
I can imagine some kind of translation from PMML-in-XML to PMML-in-something-else that is more compact. I've not seen anyone do this. * there still aren't formats for factored matrices and probably won't ever quite be, since they're just too large for a file format. On Tue, Nov 18, 2014 at 5:34 AM, Manish Amde <manish...@gmail.com> wrote: > Hi Charles, > > I am not aware of other storage formats. Perhaps Sean or Sandy can elaborate > more given their experience with Oryx. > > There is work by Smola et al at Google that talks about large scale model > update and deployment. > https://www.usenix.org/conference/osdi14/technical-sessions/presentation/li_mu > > -Manish > --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org