[jira] Created: (MAHOUT-380) IllegalArgumentException from AbstractJDBCDataModel constructor which is extended by AbstractBooleanPrefJDBCDataModel

2010-04-16 Thread Rashmi Paliwal (JIRA)
IllegalArgumentException from AbstractJDBCDataModel constructor which is extended by AbstractBooleanPrefJDBCDataModel - Key: MAHOUT-380 URL:

[jira] Resolved: (MAHOUT-380) IllegalArgumentException from AbstractJDBCDataModel constructor which is extended by AbstractBooleanPrefJDBCDataModel

2010-04-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-380. -- Assignee: Sean Owen Fix Version/s: 0.4 Resolution: Fixed Oops! fixed

c# porting of mahout

2010-04-16 Thread pedram salehpoor
Hi, Is there any effort to port mahout to c# and .NET? I would have like to take part in one if there is. Thanks in advance, Best, Pedram Salehpoor

Re: c# porting of mahout

2010-04-16 Thread Sean Owen
None that I'm aware of, and I might suggest it would be hard at the moment for several reasons: - The code is changing very rapidly - The code depends heavily on Java libraries, notably Hadoop, which makes porting difficult On Fri, Apr 16, 2010 at 10:31 AM, pedram salehpoor

Re: c# porting of mahout

2010-04-16 Thread pedram salehpoor
For Hadoop I was thinking about making them assemblies usable for c#. But ever changing code is a problem. Do currently new features are added or the next version is more bug cleaning? On Fri, Apr 16, 2010 at 2:04 PM, Sean Owen sro...@gmail.com wrote: None that I'm aware of, and I might

Re: c# porting of mahout

2010-04-16 Thread Sean Owen
Lots of both -- I imagine it will be changing rapidly for the rest of the year. On Fri, Apr 16, 2010 at 10:48 AM, pedram salehpoor pedram.salehp...@gmail.com wrote: For Hadoop I was thinking about making them assemblies usable for c#. But ever changing code is a problem. Do currently new

Re: c# porting of mahout

2010-04-16 Thread pedram salehpoor
I had to develop something like mahout in C# for my project so I wanted to try to integrate my efforts in something useful. So is there anything that I can do to make it useful for mahout? On Fri, Apr 16, 2010 at 2:25 PM, Sean Owen sro...@gmail.com wrote: Lots of both -- I imagine it will be

Re: c# porting of mahout

2010-04-16 Thread Steven Bourke
Hi Pedram, What specific area is your project in? (Machine learning, distributed computing etc.) On 16 Apr 2010, at 11:12, pedram salehpoor wrote: I had to develop something like mahout in C# for my project so I wanted to try to integrate my efforts in something useful. So is there

Re: Having some trouble with SequentialAccessSparseVector.DenseVector

2010-04-16 Thread Sean Owen
Actually it does all work. I wrote some tests that verify it. I think my first question about index and cur works out because both are set to 0 -- and 0 is correct as the starting value of an array offset and index. And in the other case I believe it's intended that the two values are the current

Re: c# porting of mahout

2010-04-16 Thread pedram salehpoor
Hi, Steven My project needed to use machine learning based on distributed computing to cope with lots of data. On Fri, Apr 16, 2010 at 2:47 PM, Steven Bourke steven.bou...@ucd.ie wrote: Hi Pedram, What specific area is your project in? (Machine learning, distributed computing etc.) On

[jira] Updated: (MAHOUT-379) SequentialAccessSparseVector.equals does not agree with AbstractVector.equivalent

2010-04-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-379: - Attachment: MAHOUT-379.patch This is a pre-patch, per discussion on the mailing list. Is this too much

[jira] Commented: (MAHOUT-379) SequentialAccessSparseVector.equals does not agree with AbstractVector.equivalent

2010-04-16 Thread Danny Leshem (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12857778#action_12857778 ] Danny Leshem commented on MAHOUT-379: - Sean, your patch neither fixes the original

Re: mahout/solr integration

2010-04-16 Thread Anthony
All, I have begun work on an integration of Apache Solr and Mahout, http://github.com/algoriffic/lsa4solr which is related to #MAHOUT-343 (https://issues.apache.org/jira/browse/MAHOUT-343 ). The implementation is in Clojure and interfaces with both the DistributedLanczosSolver and the

Re: mahout/solr integration

2010-04-16 Thread Jake Mannix
Great to see you on here, Anthony! Hey the-rest-of-you! You should definitely check out his lsa4solr code, it's mostly very nice thin Clojure wrappers around the DistributedRowMatrix and related code, which allows for using Clojure's REPL to play interactively with the matrix (which in itself is

Re: mahout/solr integration

2010-04-16 Thread Robin Anil
On Fri, Apr 16, 2010 at 7:52 PM, Anthony lsa4s...@gmail.com wrote: All, I have begun work on an integration of Apache Solr and Mahout, http://github.com/algoriffic/lsa4solr which is related to #MAHOUT-343 (https://issues.apache.org/jira/browse/MAHOUT-343 ). The implementation is in

Re: mahout/solr integration

2010-04-16 Thread Sean Owen
Clojure isn't my cup of tea but that's not important. It's an interesting question, how much belongs under the Mahout tent? There's a tradeoff between excluding useful extensions to the project on the one hand, and becoming a spare parts bin of code of varying levels of maturity and support. I'm

Re: mahout/solr integration

2010-04-16 Thread Jeff Eastman
On 4/16/10 10:05 AM, Sean Owen wrote: Clojure isn't my cup of tea but that's not important. It's an interesting question, how much belongs under the Mahout tent? There's a tradeoff between excluding useful extensions to the project on the one hand, and becoming a spare parts bin of code of

Re: mahout/solr integration

2010-04-16 Thread Robin Anil
I Java would have been nicer (Though I am saying that without knowing how well Clojure binaries can talk with Java ones and vice versa) Clojure is on the JVM, Robin! http://clojure.org/ - the j in the name should have given the hint! :) :) Yeah I saw that, I had this really bad

Re: mahout/solr integration

2010-04-16 Thread Jake Mannix
So here's my take: once we're a TLP (next month sometime?), it is a good time to start allowing subprojects or submodules which are scripting layers on top of Mahout - whether they are PigLatin, or Cascalog, JRuby, or Clojure. If it's JVM-based, especially, having code/scripts which are drivers

Re: mahout/solr integration

2010-04-16 Thread Grant Ingersoll
On Apr 16, 2010, at 2:21 PM, Jake Mannix wrote: So here's my take: once we're a TLP (next month sometime?), it is a good time to start allowing subprojects or submodules which are Submodules, yes, subprojects, not so much, unless the committers are the same. We can definitely release

Re: mahout/solr integration

2010-04-16 Thread Robin Anil
Hmm... this was a bit scattered of a response, but I'm really loathe to turn away a) nice hooks between Solr and Mahout, b) scripting-style wrappers which could expand our community, and c) simply new functionality. +1 I definitely don't want to turn it away, I want to ensure that

Re: mahout/solr integration

2010-04-16 Thread Jake Mannix
On Fri, Apr 16, 2010 at 11:31 AM, Robin Anil robin.a...@gmail.com wrote: Hmm... this was a bit scattered of a response, but I'm really loathe to turn away a) nice hooks between Solr and Mahout, b) scripting-style wrappers which could expand our community, and c) simply new

Re: mahout/solr integration

2010-04-16 Thread Jake Mannix
On Fri, Apr 16, 2010 at 11:26 AM, Grant Ingersoll gsing...@apache.orgwrote: On Apr 16, 2010, at 2:21 PM, Jake Mannix wrote: So here's my take: once we're a TLP (next month sometime?), it is a good time to start allowing subprojects or submodules which are Submodules, yes, subprojects,

Re: mahout/solr integration

2010-04-16 Thread Sean Owen
On Fri, Apr 16, 2010 at 7:39 PM, Jake Mannix jake.man...@gmail.com wrote: I will start playing around with Anthony's github-based stuff, and see where a patch can be made.  The question is where it would go?  It's a fully functioning project already over on its own. I suppose that's my

Re: mahout/solr integration

2010-04-16 Thread Ted Dunning
At that point, they aren't really sub-projects, just multiple deliverable binaries. We have that already (collections, math, core) and the unification of Lucene and Solr was based on that as well. So it would be one project (mailing list, committer list) but multiple modules. On Fri, Apr 16,

Re: mahout/solr integration

2010-04-16 Thread Ted Dunning
I have a friend right now who needs a jruby integration layer. It is about how code is glued together. Java is lingua franca, but there are lots of sites that are using Groovy (in the finance world especially), Ruby (in the web 2.0 world) or cloJure (in the too-cool crowd). So I definitely see

Re: mahout/solr integration

2010-04-16 Thread Jake Mannix
On Fri, Apr 16, 2010 at 11:56 AM, Sean Owen sro...@gmail.com wrote: On Fri, Apr 16, 2010 at 7:39 PM, Jake Mannix jake.man...@gmail.com wrote: I will start playing around with Anthony's github-based stuff, and see where a patch can be made. The question is where it would go? It's a fully

Re: mahout/solr integration

2010-04-16 Thread Grant Ingersoll
On Apr 16, 2010, at 3:11 PM, Jake Mannix wrote: On Fri, Apr 16, 2010 at 11:56 AM, Sean Owen sro...@gmail.com wrote: On Fri, Apr 16, 2010 at 7:39 PM, Jake Mannix jake.man...@gmail.com wrote: I will start playing around with Anthony's github-based stuff, and see where a patch can be made.