Re: Bucketing (was Re: Wikia search goes live today)

2008-01-09 Thread Grant Ingersoll
Would be a nice contrib module, though... -Grant On Jan 9, 2008, at 5:30 AM, Andrzej Bialecki wrote: Otis Gospodnetic wrote: Sounds useful. I suppose this means one would have custom function for within-bucket-reordering? e.g. for a web search you might reorder based on the URL length if you

Re: Bucketing (was Re: Wikia search goes live today)

2008-01-09 Thread Andrzej Bialecki
Otis Gospodnetic wrote: Sounds useful. I suppose this means one would have custom function for within-bucket-reordering? e.g. for a web search you might reorder based on the URL length if you think shorter URLs are an indicator of Yes, that's precisely the idea. It combines the advantages of

Bucketing (was Re: Wikia search goes live today)

2008-01-08 Thread Otis Gospodnetic
uary 8, 2008 5:24:01 PM Subject: Re: Wikia search goes live today Ryan McKinley wrote: > Andrzej Bialecki wrote: >> Lukas Vlcek wrote: >>> So staring will be accommodated only during indexing phase. Does it >>> mean it >>> will be pretty static value not a dyn

Re: Wikia search goes live today

2008-01-08 Thread Dennis Kubes
Sorry about not responding to this before now, been a little busy :). For those of you who don't know me, I am a committer on the Nutch project. I have been working with Wikia since early July and more actively since the beginning of November. Before Wikia I helped start another search engin

Re: Wikia search goes live today

2008-01-08 Thread Andrzej Bialecki
Ryan McKinley wrote: Andrzej Bialecki wrote: Lukas Vlcek wrote: So staring will be accommodated only during indexing phase. Does it mean it will be pretty static value not a dynamically changing variable... correct? In other words if I add my starts to some document it won't affect the scorin

Re: Wikia search goes live today

2008-01-08 Thread Lukas Vlcek
I should note that this technique is probably not easily applicable to current Lucene scoring mechanism without additional development. On 1/8/08, Lukas Vlcek <[EMAIL PROTECTED]> wrote: > > After checking the Lucene API of ParallelReader it seems that the star > score could be stored in different

Re: Wikia search goes live today

2008-01-08 Thread Lukas Vlcek
After checking the Lucene API of ParallelReader it seems that the star score could be stored in different index which shares the same identifier for the documents. Such index could be small (partitioned to many small indices?) so the updates can be fast. Is that what you meant Andrzej? ;-) Anyway,

Re: Wikia search goes live today

2008-01-08 Thread Ryan McKinley
Andrzej Bialecki wrote: Lukas Vlcek wrote: So staring will be accommodated only during indexing phase. Does it mean it will be pretty static value not a dynamically changing variable... correct? In other words if I add my starts to some document it won't affect the scoring immediately but afte

Re: Wikia search goes live today

2008-01-08 Thread Andrzej Bialecki
Lukas Vlcek wrote: So staring will be accommodated only during indexing phase. Does it mean it will be pretty static value not a dynamically changing variable... correct? In other words if I add my starts to some document it won't affect the scoring immediately but after indexing cycle. Correct?

Re: Wikia search goes live today

2008-01-08 Thread Lukas Vlcek
> was referring to in my comment so I'm looking forward to more > >>> contributions from Dennis and his coworkers! :) > >>> > >>> Otis > >>> -- > >>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > >>> &

Re: Wikia search goes live today

2008-01-08 Thread Michael Stoppelman
gt; >>> after I posted my comment and clarified that he is that Wikia > >>> developer I > >>> was referring to in my comment.... so I'm looking forward to more > >>> contributions from Dennis and his coworkers! :) > >>> > >>> Otis

Re: Wikia search goes live today

2008-01-08 Thread Dennis Kubes
m: Grant Ingersoll <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, January 7, 2008 11:21:33 AM Subject: Re: Wikia search goes live today One other thing to note, you can definitely see Lucene in action (or Nutch, that is) by clicking on the score returned for a given document (t

Re: Wikia search goes live today

2008-01-08 Thread Mike Klaas
tributions from Dennis and his coworkers! :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Grant Ingersoll <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, January 7, 2008 11:21:33 AM Subject: Re: Wikia search goes live tod

Re: Wikia search goes live today

2008-01-08 Thread Grant Ingersoll
On Jan 8, 2008, at 2:55 AM, Lukas Vlcek wrote: BTW: 1) If they have made any improvements/changes to Nutch (or Lucene/ Hadoop) code and they keep it closed then how they can claim they are using open sourced algorithms? They are "using" it, they just aren't sharing it. Many companies out

Re: Wikia search goes live today

2008-01-07 Thread Lukas Vlcek
om Dennis and his coworkers! :) > > > > Otis > > -- > > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > - Original Message ---- > > From: Grant Ingersoll <[EMAIL PROTECTED]> > > To: java-user@lucene.apache.org > > Sent: Monday, January 7

Re: Wikia search goes live today

2008-01-07 Thread Lukas Vlcek
ematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > - Original Message > From: Grant Ingersoll <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Monday, January 7, 2008 11:21:33 AM > Subject: Re: Wikia search goes live today > > One other thin

Re: Wikia search goes live today

2008-01-07 Thread Otis Gospodnetic
ent: Monday, January 7, 2008 11:21:33 AM Subject: Re: Wikia search goes live today One other thing to note, you can definitely see Lucene in action (or Nutch, that is) by clicking on the score returned for a given document (try searching for Lucene) and you see, in all it's glory, the

Re: Wikia search goes live today

2008-01-07 Thread Grant Ingersoll
Hi, I noticed that Wikia search goes live today (see http://www.devxnews.com/article.php/3719906). Does anybody know where I could find more technical information about their solution? Are they going to contribute their enhancements back to Lucene/Nutch/Hadoop code? My understanding is that

Re: Wikia search goes live today

2008-01-07 Thread Grant Ingersoll
On Jan 7, 2008, at 7:48 AM, Lukas Vlcek wrote: Hi, I noticed that Wikia search goes live today (see http://www.devxnews.com/article.php/3719906). Does anybody know where I could find more technical information about their solution? Are they going to contribute their enhancements back to

Wikia search goes live today

2008-01-07 Thread Lukas Vlcek
Hi, I noticed that Wikia search goes live today (see http://www.devxnews.com/article.php/3719906). Does anybody know where I could find more technical information about their solution? Are they going to contribute their enhancements back to Lucene/Nutch/Hadoop code? My understanding is that as