Spark history server running on Mongo

Ivan Sadikov Tue, 18 Jul 2017 01:01:58 -0700

Hello everyone!

I have been working on Spark history server that uses MongoDB as a
datastore for processed events to iterate on idea that Spree project uses
for Spark UI. Project was originally designed to improve on standalone
history server with reduced memory footprint.


Project lives here: https://github.com/lightcopy/history-server

These are just very early days of the project, sort of pre-alpha (some
features are missing, and metrics in some failed jobs cases are
questionable). Code is being tested on several 8gb and 2gb logs and aims to
lower resource usage since we run history server together with several
other systems.

Would greatly appreciate any feedback on repository (issues/pull
requests/suggestions/etc.). Thanks a lot!


Cheers,

Ivan

Spark history server running on Mongo

Reply via email to