elasticsearch english words analyzer

2014-09-27 Thread Peiyong Lin
Hi there, I found that elasticsearch can't split english words if they don't have whitespace or dash between them, for example, "reversestring" should be split into "reverse" and "string". I think the it's because I didn't configure well. I have tried several analyzers listed on the elasticsearc

Any idea to remove the duplicates from the search results?

2014-09-26 Thread Peiyong Lin
Hi all, I have some documents that look like: { "name": "", "image_url: "", "simHash": "" } And there are many duplicate documents with the same `simHash` value, any idea to remove the duplicates instead of removing them in indexing time? I have seen solutions here: http://stacko

got QueryPhaseExecutionException when using custom query parser

2014-08-26 Thread Peiyong Lin
Hi all, I wrote my own custom query parser, and extended elasticsearch as a plugin, the code is in the following link. query parser http://pastebin.mozilla.org/6172836 customized query http://pastebin.mozilla.org/6172837 plugin http://pastebin.mozilla.org/6172844 I used the default settings of

how to use my customer lucene query?

2014-08-22 Thread Peiyong Lin
I have extend a new query, and I can use it with `Query q = new ExtendedBooleanQuery()` in Lucene. I want to use it in elasticsearch. How can I integrate it to elasticsearch? I googled but found no guide to integrate. Or is there any plugin point like analyzer to add my own custom made Lucen

How to use Customized Query of Lucene in Elasticsearch

2014-08-21 Thread Peiyong Lin
Hi all, I have written a customized Query and Scorer for Lucene, but now I want to use this Query in Elasticsearch. I know that I can use it as a plugin, but what should I do to register it as a plugin? I have searched for documentations or examples but I found nothing mention how to use a cus