Have a look at:

*
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis-delimited-payload-tokenfilter.html
*
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/modules-advanced-scripting.html




On 23 August 2014 15:04, Scott Decker <sc...@publishthis.com> wrote:

> Hey all,
>   a question on possible search paths/structure.  If we have a text
> document, and we have run our magic over it and come away with Topics and
> Entities (Like, Barack Obama and Apple Inc.) and we have a relevancy score
> for each one, what would be the best way to store and query against them?
>
> we currently are trying a parent/child relationship, where the children
> are the terms with their relevancy score and the scoring of the parent text
> document gets done from the relevancy scores of the children. That works.
> Just worried about speed of parent/child against millions of documents.
>
> Another way we could think of was, build our own scorer/analyzer.  If we
> are reading in tokens like BarackObama.93345|AppleInc.0034
> where it has the topic and the relevancy score to the document in it, i
> can build an analyzer to read those sorts of tokens, but is there any way
> to build a scorer that can use that token match data to score?
>
> and third, is there any other way to normalize this data into one document
> so we can score on it. That seems like it would be the fastest way to
> query, but my #2 option here is the only way I can think of doing it.
> Anyone else tagging their documents with relevancy scores to topics, on the
> document and then letting people search for those topics and pulling back
> the relevant docs based on the per document relevancy scores?
>
> Thanks,
> Scott
>
>  --
> You received this message because you are subscribed to the Google Groups
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to elasticsearch+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/elasticsearch/9434db79-363f-4470-bf91-b960908c2de6%40googlegroups.com
> <https://groups.google.com/d/msgid/elasticsearch/9434db79-363f-4470-bf91-b960908c2de6%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAPt3XKQmp%3D4Tjft6REtvWZ-2%3DmhqPqnnk-OXtK3oPqw4iNFJmw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to