Re: Mahout GSoC 2010 proposal: Association Mining

2010-04-09 Thread Lukáš Vlček
reading. Could you put up an interesting example to help us understand this method. Maybe starting from a transaction of shopping cart item ? A great demo is big plus for a GSOC project. Robin On Mon, Mar 29, 2010 at 1:46 AM, Lukáš Vlček lukas.vl...@gmail.com wrote: Hello, I would

Re: Mahout GSoC 2010 proposal: Association Mining

2010-04-09 Thread Lukáš Vlček
:26 AM, Lukáš Vlček lukas.vl...@gmail.com wrote: One example would be analysis of click stream, where you can learn that those people visiting some negative comment on product blog never enter order form. Not saying this is best example but in general this is the essence of it. You

Re: A request for prospective GSOC students

2010-04-03 Thread Lukáš Vlček
Hi, My proposal had the following subject: Mahout GSoC 2010 proposal: Association Mining It was missing time schedule and further implementation details. I can work on those missing parts but I was rather expecting some general discussion about this topic first before I invest time in time

Mahout GSoC 2010 proposal: Association Mining

2010-03-28 Thread Lukáš Vlček
Hello, I would like to apply for Mahout GSoC 2010. My proposal is to implement Association Mining algorithm utilizing existing PFPGrowth implementation ( http://cwiki.apache.org/MAHOUT/parallelfrequentpatternmining.html). As for the Assoiciation Mining I would like to implement a very general

Re: Freq. Pattern Mining page?

2010-02-12 Thread Lukáš Vlček
Robin, how much is your implementation based on http://infolab.stanford.edu/~echang/recsys08-69.pdf ? I mean if it is close enough then it might be a good idea to include a link to the paper. If it differs more then it would be very good to elaborate what is different in your approach. Regards,

Re: Freq. Pattern Mining page?

2010-02-12 Thread Lukáš Vlček
. and a sorting step to achieve compression of trees via TransactionTree class Can I take an extension on that? I will include the link to the paper for now Robin On Fri, Feb 12, 2010 at 8:54 PM, Lukáš Vlček lukas.vl...@gmail.com wrote: Robin, how much is your implementation based

Re: Mahout book

2009-09-22 Thread Lukáš Vlček
Hello Sean, as a Mahout fan I can help with charts, diagrams or schema pictures if needed. Let's make this book looking real good. Is it true that Manning is forcing authors to use MS Word? Still it should be possible to use PS, EPS or maybe PDF for vector graphics, correct? Anyway, I would love

Re: What's the plan for Mahout?

2009-09-07 Thread Lukáš Vlček
Hi, just a note: Wouldn't it be better to talk about MapReduce as opposed to Hadoop? This means that for each algorithm implemented in Mahout it should be clearly stated wheter it is MapReduce based implementation or not (or using other ways to make it scalable). I can imagine it could be useful

Re: What's the plan for Mahout?

2009-09-07 Thread Lukáš Vlček
May be there is no direct equivalent but there are many ways how one can build MapReduce architecture into existing system without Hadoop. And there is something all these systems have in common at high level. I can see many existing systems are adding MapReduce paradigm into their stack (e.g.:

Re: What's the plan for Mahout?

2009-09-07 Thread Lukáš Vlček
(In comparison, take a look at something as simple as logging. Through people inventing abstractions, and abstractions on abstractions, it's actually turned into something difficult to manage. Using SL4FJ, putting in the right bindings .jar so it routes through Log4J -- and don't forget

Re: Powered By Logo

2009-09-04 Thread Lukáš Vlček
me know if you have any comments. Regards, Lukas On Thu, Sep 3, 2009 at 3:56 PM, Lukáš Vlček lukas.vl...@gmail.com wrote: sure... :-) On Thu, Sep 3, 2009 at 3:55 PM, Grant Ingersoll gsing...@apache.orgwrote: Would it be possible to just get a scalable version so people can do

Re: Yourkit License for all of you

2009-09-03 Thread Lukáš Vlček
Hi, is there IntelliJIDEA support? Web page does not mention IDEA ide: http://www.yourkit.com/changes/index.jsp#ide Regards, Lukas On Wed, Sep 2, 2009 at 10:33 AM, Robin Anil robin.a...@gmail.com wrote: Dear Mahout Devs,Yourkit sales rep gave me my opensource license. If

Frequent Itemset Mining using MapReduce (interesting paper)

2009-04-01 Thread Lukáš Vlček
Hi, For anybody who might be interested in frequent itemset mining using MapReduce: http://www.haoyuanli.com/publication/recsys08-69.pdf Their application of search query recommendation and related search is interesing. Regards, Lukas -- http://blog.lukas-vlcek.com/

Re: aprior algorithm in MR

2008-08-20 Thread Lukáš Vlček
Hi, Actually, I have a plan to implement something like FP-Growth for Mahout (but due to lack of time the progress is slow so far). As for the tree traversal it is considered to be one of the most difficult tasks within MR paradigm (see original Google lecture videos on MR programming). However,

Re: Maven POMs

2008-08-05 Thread Lukáš Vlček
I was always asking myself what is so special about projects like Lucene, Hadoop (and possibly Mahout?) that maven can not be used as a project management tool for them. I haven't heard an answer until these days (... these projects are complex is not an answer for me). Lukas On Tue, Aug 5, 2008

Re: Maven POMs

2008-08-05 Thread Lukáš Vlček
. Why change it? On Aug 5, 2008, at 9:27 AM, Lukáš Vlček wrote: I was always asking myself what is so special about projects like Lucene, Hadoop (and possibly Mahout?) that maven can not be used as a project management tool for them. I haven't heard an answer until these days