RE: Spellchecking - looking for general advice

2014-05-03 Thread Susheel Kumar
How much is the maxEdits you have set. It should catch restaurant example with edit distance set to 2. Thanks, Susheel -Original Message- From: Maciej Dziardziel [mailto:fied...@gmail.com] Sent: Friday, May 02, 2014 7:05 PM To: solr-user@lucene.apache.org Subject: Spellchecking -

Re: Spellchecking - looking for general advice

2014-05-03 Thread Maciej Dziardziel
Hi I've set it to 2, but python implementation of Levenshtein says its 3 for restraunt - restaurant. On Sat, May 3, 2014 at 2:44 PM, Susheel Kumar susheel.ku...@thedigitalgroup.net wrote: How much is the maxEdits you have set. It should catch restaurant example with edit distance set to 2.

Re: PostingHighlighter complains about no offsets

2014-05-03 Thread Michael Sokolov
For posterity, in case anybody follows this thread, I tracked the problem down to WordDelimiterFilter; apparently it creates an offset of -1 in some case, which PostingsHighlighter rejects. -Mike On 5/2/2014 10:20 AM, Michael Sokolov wrote: I checked using the analysis admin page, and I

Re: Re: PostingHighlighter complains about no offsets

2014-05-03 Thread Markus Jelsma
Hello michael, you are not on lucene 4.8? https://issues.apache.org/jira/plugins/servlet/mobile#issue/LUCENE-5111 Michael Sokolov msoko...@safaribooksonline.com schreef:For posterity, in case anybody follows this thread, I tracked the problem down to WordDelimiterFilter; apparently it creates

Re: Re: PostingHighlighter complains about no offsets

2014-05-03 Thread Ahmet Arslan
Hi, so this is all about posIncAttribute?  I had opened https://issues.apache.org/jira/browse/SOLR-3193, about ReversedWildcardFilterFactory is causing highlighter exceptions. I wonder   ReversedWildcardFilter has similar bug. Ahmet On Saturday, May 3, 2014 9:39 PM, Markus Jelsma

Re: PostingHighlighter complains about no offsets

2014-05-03 Thread Michael Sokolov
No not yet; but that could be one more reason to upgrade. The performance boost from PH is quite nice. In my test, it's about 7x faster than the default highlighter, almost 2x faster than fast vector highlighter, and only about a 50% penalty compared to no highlighting at all, so this could

RE: Spellchecking - looking for general advice

2014-05-03 Thread Susheel Kumar
Got it. Are you also considering Stemming Phonetic here. For e.g. phonetic may catch some of the restaurant variations and recruiter recruited may convert to base words and at last spell check would have catch all situation. -Original Message- From: Maciej Dziardziel

Every minute update on solrclound

2014-05-03 Thread eakarsu
I am using solr 4.3.1 attached solrconfig.xml file. There is no softCommit enabled in config file but master node is receiving continuously an update document evry minute. I could not figure it out where this update is coming. Solr cloud master: [04/May/2014:01:50:54 +] GET

Re: Every minute update on solrclound

2014-05-03 Thread Shawn Heisey
On 5/3/2014 8:01 PM, eakarsu wrote: I am using solr 4.3.1 attached solrconfig.xml file. There is no softCommit enabled in config file but master node is receiving continuously an update document evry minute. I could not figure it out where this update is coming. Solr cloud master: