MADLib (http://madlib.net/) was designed to bring large-scale ML techniques to a relational database, primarily postgresql. MLlib assumes the data exists in some Spark-compatible data format.
I would suggest you pick the library that matches your data platform first. DISCLAIMER: I am the original author of MADLib, though EMC/Pivotal assumed ownership rather quickly. ~~~~~~ May All Your Sequences Converge On Dec 14, 2014, at 6:26 AM, "Venkat, Ankam" <ankam.ven...@centurylink.com> wrote: > Can somebody throw light on MLlib vs Madlib? > > Which is better for machine learning? and are there any specific use case > scenarios MLlib or Madlib will shine in? > > Regards, > Venkat Ankam > This communication is the property of CenturyLink and may contain > confidential or privileged information. Unauthorized use of this > communication is strictly prohibited and may be unlawful. If you have > received this communication in error, please immediately notify the sender by > reply e-mail and destroy all copies of the communication and any attachments.