My customer want to get the 10000th-10010th added docs So I have to sort by timestamp, to get top10010 docs' timestamp ……
2008/3/20, Walter Underwood <[EMAIL PROTECTED]>: > > Why do you want the 10,000th most relevant result? > That seems very, very odd. Most people need the most > relevant result. Maybe the ten most relevant results. > > "I'm searching for the movie 'Ratatouille', but please > give me the 10,001st result instead of that movie." > > If you explain your desire, we may have a better approach. > > wunder > == > Search Guy, Netflix > > On 3/19/08 10:43 PM, "李银松" <[EMAIL PROTECTED]> wrote: > > > I am not getting 10000 records > I am getting records from 10000-10010 > So I > > need the top10010 records' *sort field* to merge and get final > results,just > > like the distributed search > the data to transport is about 500k(10000 docs' > > scores) > and the QTime is about 100ms > but the total time I used is about 10+ > > seconds > I want to know it really cost so much time or something other is wrong > > . > > > > 2008/3/20, Walter Underwood <[EMAIL PROTECTED]>: > > > > Getting 10,000 > > records will be slow. > > > > What are you doing with 10,000 records? > > > > > > wunder > > > > On 3/19/08 10:07 PM, "李银松" <[EMAIL PROTECTED]> wrote: > > > > > I > > want to get the top 10000-10010 record from two different servers,So > > > > Ihave > > > to get top10010 scores from each server and merge them to get > > the > > results. > > I > > > found the cost time was mostly used in XMLResponseParser > > while parsing > > > the > > inputstream. > > I wander whether the costtime was used > > for net transport or > > > for Solr to > > prepare for transport? Or just > > something wrong with my server? > > > > > > > 在08-3-20,Yonik Seeley > > <[EMAIL PROTECTED]> 写道: > > > > > > 2008/3/19 李银松 > > > <[EMAIL PROTECTED]>: > > > > > > 1、When I set fl=score ,solr returns just as > > > fl=*,score ,not just scores > > > > > > Is it a bug or just do it on purpose? > > > > > > On > > > purpose... a score > > alone with no other context doesn't seem useful. > > > > > > > 2 > > > 、I'm using > > solrj to get about 10000 docs' score in LAN. It costs me > > > about > > > > > > > > > 10+ seconds first time(QTime is less than 100ms) , but 1-2 seconds > > > > > > > > second > > > > time with the same querystring. It seems a bit too long for > > the > > > first > > > > time(total size of the doc to transport is about 500k). > > Is there > > > > > > anything i > > > > can do with it? > > > > > > What are you trying > > to do with that many > > > scores? > > > Search engines are optimized more for > > retrieving the top n matches > > > > > > (where n is ~10 - 100) > > > > > > -Yonik > > > > > > > > > > > > > >