OK, your defeatism is my realism. Why has Negative Nancy intruded on this conversation?
I have a view into many large Hadoop users. The feedback from the minority that have tried Mahout is that it is inconsistent/unfinished ("a confederation of unrelated grad-school projects" as one put it), buggy, and hard to use except as a few copied snippets of code. Ouch! Only a handful that I'm aware of actually use it. Internally, there is a perception that there is no community attention to most of the code (see JIRA backlog). As a result -- software problems, community issues, little demand -- it is almost certainly not going to be in our next major packaging release, and was almost not in the current forthcoming one. Your Reality May Vary. This seems like yellow-flag territory for an Apache project though, if this is representative of a wider reality. So a conversation about whole other projects' worth of new functionality feels quite disconnected -- red-flag territory. To be constructive, here are four items that seem more important for something like "1.0.0" and are even a lot less work: - Use Hadoop .mapreduce API consistently - Standardize input output formats of all jobs - Remove use of deprecated code - Clear even a third of the open JIRA backlog (I still think it's fine to make different projects for quite different ideas. Hadoop has another ML project, and is about to have another other ML project. These good ideas might well better belong there. Here, I think there is a big need for shoring up if it's even going to survive to 1.0.) On Thu, Feb 27, 2014 at 5:25 PM, Sean Owen <sro...@gmail.com> wrote: > I think each of several > other of these points are probably on their own several times the amount of > work that has been put into this project over the past year so I'm > wondering if this close to realistic as a to do list for 1.0 of this > project. > That is means. I think that everything on this list is possible in relatively short order, but let's talk goals for a bit. What is missing here? What really doesn't matter?