Hi. I have a project I'm currently working on. The idea is to implement "scikit-learn" into Storm and integrate it with HDFS.
I've already implemented "scikit-learn". But, currently I'm using a text file to read and write. However, I need to use HDFS, but finding it hard to integrate with HDFS. Here is the link to github <https://github.com/kgzharas/StormTopologyTest>. (I only included files that I used, not whole project) Basically, I have a few questions if you don't mint to answer them 1) How to use HDFS to read and write? 2) Is my "scikit-learn" implementation correct? 3) How to create a Storm project? (Currently working in "storm-starter") These questions may sound a bit silly, but I really can't find a proper solution. Thank you for your attention to this matter. Sincerely, Zharas.