Re: Spark history server running on Mongo

Ivan Sadikov Tue, 18 Jul 2017 09:45:04 -0700

Thanks for JIRA ticket reference! Frankly, I was aware of this work, but
didn't know that there was an API for storage implementation.


Will try exploring that as well, thanks!
On Wed, 19 Jul 2017 at 4:18 AM, Marcelo Vanzin <van...@cloudera.com> wrote:

> See SPARK-18085. That has much of the same goals re: SHS resource
> usage, and also provides a (currently non-public) API where you could
> just create a MongoDB implementation if you want.
>
> On Tue, Jul 18, 2017 at 12:56 AM, Ivan Sadikov <ivan.sadi...@gmail.com>
> wrote:
> > Hello everyone!
> >
> > I have been working on Spark history server that uses MongoDB as a
> datastore
> > for processed events to iterate on idea that Spree project uses for Spark
> > UI. Project was originally designed to improve on standalone history
> server
> > with reduced memory footprint.
> >
> > Project lives here: https://github.com/lightcopy/history-server
> >
> > These are just very early days of the project, sort of pre-alpha (some
> > features are missing, and metrics in some failed jobs cases are
> > questionable). Code is being tested on several 8gb and 2gb logs and aims
> to
> > lower resource usage since we run history server together with several
> other
> > systems.
> >
> > Would greatly appreciate any feedback on repository (issues/pull
> > requests/suggestions/etc.). Thanks a lot!
> >
> >
> > Cheers,
> >
> > Ivan
> >
>
>
>
> --
> Marcelo
>

Re: Spark history server running on Mongo

Reply via email to