Re: Seminal Random Forests speaker - FYI

2013-02-10 Thread Ted Dunning
The problem with "using Hindi pronunciation" is 3-fold. a) it is hard to get consistent answers about the Hindi pronunciation. This seems to be partly vexed by the assumption on the part of some of the people that I have asked that I want to know the way that native English speakers mis-pronounce

Re: Implicit preferences

2013-02-10 Thread Ted Dunning
On Sun, Feb 10, 2013 at 3:39 PM, Johannes Schulte < johannes.schu...@gmail.com> wrote: > ... > i am currently implementing a system of the same kind, LLR sparsified > "term"-cooccurrence vectors in lucene (since not a day goes by where i see > Ted praising this). > (turns red) > There are not o

Re: Implicit preferences

2013-02-10 Thread Ted Dunning
Yes you are learning two matrices but the cost scales with the input data size anyway so learning the two matrices costs the same as learning one matrix on the combined day set. You could argue that the search index is larger but the index is usually so small you don't even need more than one

Re: Implicit preferences

2013-02-10 Thread Ken Krugler
On Feb 10, 2013, at 2:39pm, Johannes Schulte wrote: > Hi, > > i am currently implementing a system of the same kind, LLR sparsified > "term"-cooccurrence vectors in lucene (since not a day goes by where i see > Ted praising this). > There are not only views and purchases, but also search terms,

Re: Implicit preferences

2013-02-10 Thread Johannes Schulte
Hi, i am currently implementing a system of the same kind, LLR sparsified "term"-cooccurrence vectors in lucene (since not a day goes by where i see Ted praising this). There are not only views and purchases, but also search terms, facets and a lot more textual information to be included in the co

Re: Implicit preferences

2013-02-10 Thread Sean Owen
Yeah I bet it does actually work well... but aren't you basically spending an extra step to make the "item-item" matrix, to relearn that bought-X and viewed-X go together? yeah you learn a lot more along the way, as this is item-based recommendation at heart. It seems like you could add back that k

Re: Seminal Random Forests speaker - FYI

2013-02-10 Thread Dawid Weiss
You may find some answers here? https://cwiki.apache.org/MAHOUT/mahoutname.html Dawid On Sun, Feb 10, 2013 at 6:27 PM, Daniel Donahoe wrote: > Dear Colleagues: > > > > As Vice Chair of the Utah section of the American Society of Mechanical > Engineers (ASME), I invited Professor Adele Cutler (Ut

Re: Implicit preferences

2013-02-10 Thread Ted Dunning
Actually treating the different interactions separately can lead to very good recommendations. The only issue is that the interactions are no longer dyadic. If you think about it, having two different kinds of interactions is like adjoining interaction matrices for the two different kinds of inte

Seminal Random Forests speaker - FYI

2013-02-10 Thread Daniel Donahoe
Dear Colleagues: As Vice Chair of the Utah section of the American Society of Mechanical Engineers (ASME), I invited Professor Adele Cutler (Utah State University, USU) to speak on Big Data. The ASME/UT executive committee agreed, and the meeting was hosted by the Mechanical Engineering Depart

Re: Implicit preferences

2013-02-10 Thread Sean Owen
I think you'd have to hack the code to not exclude previously-seen items, or at least, not of the type you wish to consider. Yes you would also have to hack it to add rather than replace existing values. Or for test purposes, just do the adding yourself before inputting the data. My hunch is that