Some people in the Elasticsearch community are using random scoring [1]
to sample a document subset from the search results. Maybe something
similar could be implemented for Solr ?
There are probably more efficient sampling solution than this one, but
this solution is likely more
any plan to implement this feature in future?
>
> Post filter should work. Like collapsing query parser.
>
> Thanks,
> Yongtao
> -Original Message-
> From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
> Sent: Tuesday, September 27, 2016 9:25 PM
> To: solr-us
Thanks,
Yongtao
-Original Message-
From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
Sent: Tuesday, September 27, 2016 9:25 PM
To: solr-user
Subject: Re: how to sampling search result
I am not sure I understand what the business case is. However, you might be
able to do something wi
inal Message-
> From: Alexandre Rafalovitch [mailto:arafa...@gmail.com]
> Sent: Tuesday, September 27, 2016 9:25 PM
> To: solr-user
> Subject: Re: how to sampling search result
>
> I am not sure I understand what the business case is. However, you might be
> able to d
range 90 - 100.
> So random field will not help.
>
> Is it possible we can sampling based on search result?
>
> Thanks,
> Yongtao
> -Original Message-
> From: Mikhail Khludnev [mailto:m...@apache.org]
> Sent: Tuesday, September 27, 2016 11:16 AM
> To: solr-user
> Sub
.
> So random field will not help.
>
> Is it possible we can sampling based on search result?
>
> Thanks,
> Yongtao
> -Original Message-
> From: Mikhail Khludnev [mailto:m...@apache.org]
> Sent: Tuesday, September 27, 2016 11:16 AM
> To: solr-user
> Subject: Re: how to sampling
on search result?
Thanks,
Yongtao
-Original Message-
From: Mikhail Khludnev [mailto:m...@apache.org]
Sent: Tuesday, September 27, 2016 11:16 AM
To: solr-user
Subject: Re: how to sampling search result
Perhaps, you can apply a filter on random field.
On Tue, Sep 27, 2016 at 5:57 PM, googoo <
Perhaps, you can apply a filter on random field.
On Tue, Sep 27, 2016 at 5:57 PM, googoo wrote:
> Hi,
>
> Is it possible I can sampling based on "search result"?
> Like run query first, and search result return 1 million documents.
> With random sampling, 50% (500K) documents