What would this MatrixSuperView do?  Would ConcatenatedMatrix be a better name?

Sent from my iPhone

On Apr 12, 2013, at 1:26, Gokhan Capan <gkhn...@gmail.com> wrote:

> Ted,
> 
> How about a MatrixSuperView implements Matrix? (A MatrixView like 
> implementation)
> 
> 
> On Fri, Apr 12, 2013 at 2:28 AM, Gokhan Capan <gkhn...@gmail.com> wrote:
> So if I understood correctly, the algorithm still runs on matrix, and a 
> client still can pass a group of matrices.
>  
> Again it came to data preparation:)
> 
> I will refactor the implementation to run on single matrix, but provide tools 
> for turning the obvious client data into actual input to the algorithm.
> 
> Sent from my iPhone
> 
> On Apr 12, 2013, at 1:13, Ted Dunning <ted.dunn...@gmail.com> wrote:
> 
>> One easy thing to do is to build an adjoined matrix type that does the 
>> concatenation on the fly.
>> 
>> 
>> 
>> 
>> On Thu, Apr 11, 2013 at 1:43 PM, Gokhan Capan <gkhn...@gmail.com> wrote:
>> Yeah, it is simpler indeed.
>> 
>> I am going to think about alternative ways to make concatenation easier for 
>> clients. 
>> 
>> Thanks for your review
>> 
>> 
>> On Thu, Apr 11, 2013 at 10:45 PM, Robin Anil <robin.a...@gmail.com> wrote:
>> I would have folded them all as different feature ids in a single vector, 
>> makes things a lot simpler and faster.
>> 
>> Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc.
>> 
>> 
>> On Thu, Apr 11, 2013 at 11:19 AM, Gokhan Capan <gkhn...@gmail.com> wrote:
>> Hi Robin,
>> 
>> If you are asking why they are arrays, it is because to save clients from 
>> concatenating multiple matrices to create the input.
>> 
>> I am quoting from libFM paper: "For easier interpretation,
>> the features are grouped into indicators for the active user (blue), active 
>> item (red), other movies rated
>> by the same user (orange), the time in months (green), and the last movie 
>> rated (brown)." 
>> 
>> I thought a client would create multiple group of matrices, and he can just 
>> pass them all to the algorithm.
>> 
>> Then the wModel is w parameters, it is still array of vectors for me to keep 
>> the indexing consistent, and vModel is the V parameters.
>> 
>> Was that what you were asking?
>> 
>> 
>> On Thu, Apr 11, 2013 at 6:44 PM, Robin Anil <robin.a...@gmail.com> wrote:
>> Comments away. I was a bit confused by the use of Vector[] for w1 and 
>> Matrix[] for inputs.
>> 
>> Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc.
>> 
>> 
>> On Thu, Apr 11, 2013 at 10:00 AM, Gokhan Capan <gkhn...@gmail.com> wrote:
>> Ted,
>> Robin,
>> 
>> Although I did not test on a dataset yet, recently I've been implementing 
>> Factorization Machines with SGD optimization.
>> 
>> The initial implementation is at https://github.com/gcapan/mahout/tree/fm
>> 
>> Would you guys consider to take a look so I can make it better and running?
>> 
>> 
>> 
>> On Mon, Apr 1, 2013 at 8:45 PM, Nkechi Nnadi <nkechi.nn...@gmail.com> wrote:
>> Hello,
>> 
>> I'm long time lurker.  I would be interested in implementing these.  I
>> thought I would get my feet wet with contributing to wiki with tutorials
>> since I have used Mahout for recommendation and clustering in my
>> dissertation.  I have never contributed code before and I would love to
>> start now.
>> 
>> -Nkechi
>> 
>> 
>> On Sun, Mar 31, 2013 at 1:14 PM, Robin Anil <robin.a...@gmail.com> wrote:
>> 
>> > FMs work really well for a whole range of things. Having implemented them
>> > myself, I can extend my services as a reviewer if anyone is willing to
>> > start on it.
>> >
>> > Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc.
>> >
>> >
>> > On Sun, Mar 31, 2013 at 2:18 AM, Ted Dunning <ted.dunn...@gmail.com>
>> > wrote:
>> >
>> > > Relative to Dan's recent mention of SOM as possible new project, here are
>> > > slides from KDD Cup 2012 in which Stephen Rendle describes how he did
>> > using
>> > > a very straightforward implementation of Factorization Machines [1,2].
>> > >
>> > >
>> > > FMs are interesting in the context of Mahout because they can be used in
>> > a
>> > > wide variety of settings including recommendation and targeting and
>> > because
>> > > they have very good performance on a number of tasks.
>> > >
>> > > I should mention that Robin was the one who first mentioned FMs to me.
>> > >
>> > > The KDD 2012 competition [3] is of interest in any case because it
>> > provides
>> > > a large amount of realistic data for commercially important problems.
>> > >
>> > > [1]
>> > >
>> > >
>> > https://kaggle2.blob.core.windows.net/competitions/kddcup2012/2748/media/RendleSlides.pdf
>> > >
>> > > [2]
>> > >
>> > >
>> > https://kaggle2.blob.core.windows.net/competitions/kddcup2012/2748/media/Rendle.pdf
>> > >
>> > > [3] http://www.kddcup2012.org/
>> > >
>> >
>> 
>> 
>> 
>> -- 
>> Gokhan
>> 
>> 
>> 
>> 
>> -- 
>> Gokhan
>> 
>> 
>> 
>> 
>> -- 
>> Gokhan
>> 
> 
> 
> 
> -- 
> Gokhan

Reply via email to