Re: elasticsearch phraze term frequency .tf() containing multiple words

2014-10-29 Thread barry
You can also look at developing a custom analyzer so that your phrase is not broken up at white space when indexed. http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis.html Selecting the correct combination of char filters and tokenizers will retain phrases. For ex

Re: elasticsearch phraze term frequency .tf() containing multiple words

2014-10-29 Thread barry
You can also look at developing a custom analyzer so that your phrase is not broken up at white space. http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis.html Selecting the correct combination of char filters and tokenizers will retain phrases. On Tuesday, October

Re: elasticsearch phraze term frequency .tf() containing multiple words

2014-10-28 Thread vineeth mohan
Hello Valergi , This wont work , normally becuase the string would be tokenized into green and energy. If you use shingle token filter and set it as 2 , it might work. Or in this case , you can see the position value of both the token using the script and if its next to each other , you can take

elasticsearch phraze term frequency .tf() containing multiple words

2014-10-28 Thread valerij . vasilcenko
I want to access frequency of a phraze combined from multiple words e.g. "green energy" I can access tf of "green" and "energy", example: "function_score": { "filter" : { "terms" : { "content" : ["energy","green"]} }, "script_score": { "script": "_index['content']['en