Re: Wildcard searches with leading and ending wildcard

2012-06-29 Thread Jack Krupansky
I think a doubled-ended wildcard essentially defeats the whole point of the reverse wildcard filter, which is to improve performance by avoiding a leading wildcard. So, if your data is such that a leading wildcard is okay, just use normal wildcards to begin with. -- Jack Krupansky -Origin

Re: NGram and full word

2012-06-29 Thread Lan
The search for the full word arkadicolson exceeds 8 characters so thats why it's not working. The fix is to add another field that will tokenize into full words. The query would look like this some_field_ngram:arkadicolson AND some_field_whole_word:arkadicolson -- View this message in context:

RE: NGram and full word

2012-06-29 Thread Klostermeyer, Michael
With the help of this list, I solved a similar issue by altering my query as follows: Before (did not return full word matches): q=searchTerm* After (returned full-word matches and wildcard searches as you would expect): q=searchTerm OR searchTerm* You can also boost the exact match by doing th

Re: Solr - query

2012-06-29 Thread Michael Della Bitta
I think quotes are legal in URL encoding, so you might get away with just putting a + between 1234 and BAY or failing that, %20. Usually it's easier if you use a Solr client-side library to make these types of calls so URL encoding isn't your problem, but I'm not sure if that's a route that's avai

Re: Replication Issue

2012-06-29 Thread Michael Della Bitta
Ugh, after a mess of additional flailing around, it appears I just discovered that the Replicate Now form on the Replication Admin page does not work in the text-based browser 'links'. :( Running /replication?command=fetchindex" with curl did the trick. Now everything is synced up. Thanks for you

Re: Searching against stored wild cards

2012-06-29 Thread Upayavira
Skip the asterisk and analyse you search terms as an ngram, maybe an edge-ngram, and then it'll match. You'd be querying for: A AB AB- AB-C AB-CD AB-CD- etc... Any of those terms would match your terms. Upayavira On Fri, Jun 29, 2012, at 06:35 PM, Kissue Kissue wrote: > Hi, > > I Want to know

Re: Replication Issue

2012-06-29 Thread Erick Erickson
Clocks on the separate machines are irrelevant, so don't worry about that bit. The index version _starts out_ as a timestamp as I understand it, but from there on when you change the index and commit it should just bump up NOT get a new timestamp. 1> it's strange that the version on the master wh

Searching against stored wild cards

2012-06-29 Thread Kissue Kissue
Hi, I Want to know if it is in any way possible for me to do this Solr: 1. Store this field in Solr index - AB-CD-EF-* 2. Do a search for AB-CD-EF-GH and return back AB-CD-EF-* Thanks.

Re: Is it compulsory to define a tokenizer when defining field types in solr

2012-06-29 Thread Kissue Kissue
Thanks Erick for the clarification. Cheers! On Fri, Jun 29, 2012 at 2:08 PM, Erick Erickson wrote: > Yes, it's mandatory to define at least one tokenizer (and only one > tokenizer). If > you need the whole input treated as one token, you can use > KeywordTokenizerFactory. > > Best > Erick > > On

Solr - query

2012-06-29 Thread gopes
HI , I am searching a string using wildcard and I would like to change my query from http://localhost:/solr/addrinst/select?q=1234+BAY&start=0&rows=10 to http://localhost:/solr/addrinst/select?q="1234 BAY"&start=0&rows=10 my request hanlder is edismax all 10 id name St

Re: Replication Issue

2012-06-29 Thread Michael Della Bitta
Nevermind, I realized that my master index was not tickling the index version number when a commit or optimize happened. I gave in and nuke and paved it, and now it seems fine. Is there any known reason why this would happen, so I can avoid this in the future? Thanks, Michael Della Bitta -

Why won't dismax create multiple DisjunctionMaxQueries when autoGeneratePhraseQueries is false?

2012-06-29 Thread Joel Rosen
Hi, I am trying to configure Solr for Chinese search and I've been having trouble getting the dismax query parser to behave correctly. In schema.xml, I'm using SmartChineseAnalyzer on my fulltext field with autoGeneratePhraseQueries="false". I've verified that it is correctly tokenizing Chinese w

Using custom user-defined caches to store user app data while indexing

2012-06-29 Thread Iana Atanassova
Hi, I'm trying to implement a custom UpdateRequestProcessorFactory class that works with the XSLT Request handler for indexing. My UpdateRequestProcessorFactory has to examine some of the document fields and compare them against some regular expressions that are stored in an external MySQL databas

Re: Trying to avoid filtering on score, as I'm told that's bad

2012-06-29 Thread mcb
Thanks, this worked using: &qq={!func}sub(sum(geodist(pt1,30.271567,-97.741886),geodist(pt2,36.054889,-95.716187),product(1.609344, Dist)), 1000) asc &sort=$qq &fq={!frange u=100}$qq -- View this message in context: http://lucene.472066.n3.nabble.com/Trying-to-avoid-filtering-on-score-as-I-m-

Re: How do we use HTMLStripCharFilterFactory

2012-06-29 Thread derohit
thnks @Kiran...will do things u have suggested and hope it works...thnks again.. Rgds Rohit -- View this message in context: http://lucene.472066.n3.nabble.com/How-do-we-use-HTMLStripCharFilterFactory-tp3991955p3992104.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Strange "spikes" in query response times...any ideas where else to look?

2012-06-29 Thread solr
Otis, Thanks for the response. We'll check out that tool and see how it goes. Regarding JMeter...you are exactly correct in that I was assuming 1 thread = 1 query per second. I thought we had set up some sort of throttling mechanism to ensure that...and clearly I was mistaken. By the math

Replication Issue

2012-06-29 Thread Michael Della Bitta
Hi, I'm having trouble with replication on a brand new rollout of 3.6. Basically I've traced it to the slave always thinking the index it creates when it warms up is newer than what's on the master, no matter what I do... deleting the slave's index, committing or optimizing on the master, etc. I c

Wildcard searches with leading and ending wildcard

2012-06-29 Thread maurizio1976
Hi all, I've been searching for an answer to this everywhere but I can never find an answer that is perfect for my case, so I'll ask this myself. I'm on Solr 3.6. I'm using I use the *ReversedWildcardFilterFactory* in a field containing a telephone number. So only one word to be indexed, no phrase

Re: Is it compulsory to define a tokenizer when defining field types in solr

2012-06-29 Thread Erick Erickson
Yes, it's mandatory to define at least one tokenizer (and only one tokenizer). If you need the whole input treated as one token, you can use KeywordTokenizerFactory. Best Erick On Thu, Jun 28, 2012 at 11:10 AM, Kissue Kissue wrote: > Hi, > > When defining a fieldtype is it compulsory to include

Re: Query Logic Question

2012-06-29 Thread Erick Erickson
I think you're assuming that this is Boolean logic. It's not, see: http://www.lucidimagination.com/blog/2011/12/28/why-not-and-or-and-not/ Best Erick On Thu, Jun 28, 2012 at 9:27 AM, Rublex wrote: > Jack, > > Thank you the *:* solutions seems to work. > > -- > View this message in context: > ht

Re: what is precisionStep and positionIncrementGap:

2012-06-29 Thread Erick Erickson
For PrecisionStep, see: http://lucene.apache.org/core/3_6_0/api/all/org/apache/lucene/search/NumericRangeQuery.html?is-external=true positionIncrementgap is for multiValued text fields, it is the "space" put between the last token of one entry and the first of the next. e.g. some stuff more things

Re: Strange behaviour with default request handler

2012-06-29 Thread Ahmet Arslan
> And when i search for "soph", i only get "Sophie" in the > results and not "Sophia". Do you want your query q=soph to return both Sophie and Sophia? If that's the case then you can use wildcard queries. q=soph* Also you didn't provide field definition type="text". It seems that you have stemmi

leaks in solr

2012-06-29 Thread Bernd Fehling
Hi list, while monitoring my solr 3.6.1 installation I recognized an increase of memory usage in OldGen JVM heap on my slave. I decided to force Full GC from jvisualvm and send optimize to the already optimized slave index. Normally this helps because I have monitored this issue over the past. Bu

NGram and full word

2012-06-29 Thread Arkadi Colson
Hi I have a question regarding the NGram filter and full word search. When I insert "arkadicolson" into Solr and search for "arkadic", solr will find a match. When searching for "arkadicols", Solr will not find a match because the maxGramSize is set to 8. However when searching for the full wo