https://databricks.com/blog/2017/02/28/voice-facebook-using-apache-spark-large-scale-language-model-training.html?utm_campaign=Open%20Source&utm_content=47640295&utm_medium=social&utm_source=twitter
Always neglect to include the fact that spark has a complete copy of hive inside of it!
