Re: CPU Intensive Scoring Alternatives

2017-02-21 Thread Fuad Efendi
“text_general” fields, with default for Solr operator “OR”, and default “mm” (which should be “1” for “OR) From: Walter Underwood Reply: solr-user@lucene.apache.org Date: February 21, 2017 at 5:24:23 PM To: solr-user@lucene.apache.org Subject: Re: CPU Intensive Scoring Alternatives 300

Re: CPU Intensive Scoring Alternatives

2017-02-21 Thread Walter Underwood
; >> mailto:solr-user@lucene.apache.org>> >> mailto:solr-user@lucene.apache.org>>, Ahmet >> Arslan mailto:iori...@yahoo.com>> >> mailto:iori...@yahoo.com>> >> Date: February 21, 2017 at 3:02:11 AM >> To: solr-user@lucene.apache.org <ma

Re: CPU Intensive Scoring Alternatives

2017-02-21 Thread Doug Turnbull
wrote: > Thank you Ahmet, I will try it; sounds reasonable > > > From: Ahmet Arslan > Reply: solr-user@lucene.apache.org > , Ahmet Arslan > > Date: February 21, 2017 at 3:02:11 AM > To: solr-user@lucene.apache.org > > Subject: Re: CPU Intensive Scoring Alternati

Re: CPU Intensive Scoring Alternatives

2017-02-21 Thread Fuad Efendi
Thank you Ahmet, I will try it; sounds reasonable From: Ahmet Arslan Reply: solr-user@lucene.apache.org , Ahmet Arslan Date: February 21, 2017 at 3:02:11 AM To: solr-user@lucene.apache.org Subject: Re: CPU Intensive Scoring Alternatives Hi, New default similarity is BM25. May be

Re: CPU Intensive Scoring Alternatives

2017-02-21 Thread Ahmet Arslan
Hi, New default similarity is BM25. May be explicitly set similarity to tf-idf and see how it goes? Ahmet On Tuesday, February 21, 2017 4:28 AM, Fuad Efendi wrote: Hello, Default TF-IDF performs poorly with the indexed 200 millions documents. Query "Michael Jackson" may run 300ms, and "Mich

CPU Intensive Scoring Alternatives

2017-02-20 Thread Fuad Efendi
Hello, Default TF-IDF performs poorly with the indexed 200 millions documents. Query "Michael Jackson" may run 300ms, and "Michael The Jackson" over 3 seconds. eDisMax. Because default operator "OR" and stopword "The" we have 50-70 millions documents as a query result, and scoring is CPU intensiv