I have a standalone Spark cluster and have some jobs scheduled using crontab.
It works but I don't have all the real time monitoring to get emails or to control a flow for example. Thought about using the Spark "hidden" API to have a better control but seems the API is not officially documented and I don't see much talking about that on that web. Another option would be Oozie but looks like Oozie only works with Hadoop so I'd need to install it and change my architecture. Is there any other option you suggest? I'm using only open source versions (no dist) Thanks Get Outlook for iOS<https://aka.ms/o0ukef>