Re: Query regarding solr plugin.

2011-04-25 Thread rajini maski
Erick , * * * Thanks.* It was actually a copy mistake. Anyways i did a redo of all the below mentioned steps. I had given class name as filter class=pointcross.orchSynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ I did it again now following few different steps following

Re: Suggester with multi terms

2011-04-25 Thread Em
blocky, Shingles should be your way. Regards, Em -- View this message in context: http://lucene.472066.n3.nabble.com/Suggester-with-multi-terms-tp2859547p2860419.html Sent from the Solr - User mailing list archive at Nabble.com.

Different Cluster Results on Different Servers, with same SOLR setup

2011-04-25 Thread Pawan Darira
Hi I have same Solr 1.4 setup on two different servers, One for production One for Staging. My production server gives proper cluster Staging server give wrong cluster. The problem is for date related cluster only I have checked all the configuration setup. everything seems fine. i am

Re: Query regarding solr plugin.

2011-04-25 Thread Erick Erickson
Looking at things more carefully, it may be one of your dependent classes that's not being found. A couple of things to try. 1 when you do a 'jar -tfv yourjar, you should see output like: 1183 Sun Jun 06 01:31:14 EDT 2010 org/apache/lucene/analysis/sinks/TokenTypeSinkTokenizer.class and your

Re: Different Cluster Results on Different Servers, with same SOLR setup

2011-04-25 Thread Erick Erickson
There's not much information to go on here. You haven't stated the problem so people unfamiliar with your setup can understand it. What is the error you're getting? Show us the configurations, please. You might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Mon, Apr

Re: Unable to load EntityProcessor implementation for entity:16865747177753

2011-04-25 Thread vrpar...@gmail.com
Thanks firdous_kind86 i replace tikaentityprocessor with xpathentityprocessor and works fine -- View this message in context: http://lucene.472066.n3.nabble.com/Unable-to-load-EntityProcessor-implementation-for-entity-16865747177753-tp2846513p2861229.html Sent from the Solr - User mailing list

how to concatenate two nodes of xml with xpathentityprocessor

2011-04-25 Thread vrpar...@gmail.com
hello , i am using Xpathentityprocessor to do index xml files below is my xml file Full Customer name=a id=1 .. other attributes CustomerA/Customer Customer name=b id=2 .. other attributes ThisB/Customer Customer name=c id=3 .. other attributes AnyC/Customer

Re: MoreLikeThis

2011-04-25 Thread Brian Lamb
It finds something under match but just nothing under response. I tried turning on debugQuery=on but I did not see anything that jumped out at me as a bug or anything. Is there some kind of threshold setting that I can tinker with to see if that is the problem? On Sun, Apr 24, 2011 at 2:37 AM,

RE: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Robert Petersen
Sorry, that was supposed to be just another way to say the same thing... OK look here is my current situation. Even with preserveOriginal and concatAll set, I am still getting an even odder result. I set up sku=218078624 with title= Beanbag AppleTV Friction Dash Mount for GPS and index it in

Re: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Yonik Seeley
On Mon, Apr 25, 2011 at 12:15 PM, Robert Petersen rober...@buy.com wrote: The search and index analyzer stack are the same. Ahhh, they should not be! Using both generate and catenate in WDF at query time is a no-no. Same reason you can't have multi-word synonyms at query time:

Good protwords.txt ?

2011-04-25 Thread Otis Gospodnetic
Hi, Are there any good / comprehensive examples of protwords.txt for English? Or good stemdict.txt examples that work with StemmerOverrideFilterFactory? Would be good to have a good example to include in Solr distribution... Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene -

RE: Solr - Multi Term highlighting issue

2011-04-25 Thread Ramanathapuram, Rajesh
Hi Robert, Thanks for your help. This looks much closer to my issue(may be not). Unfortunately, I can't switch to solr version 3.1 yet. I hope to revisit and update this post when I do. Thanks thanks regards, Rajesh Ramana Enterprise Applications, Turner Broadcasting System, Inc.

Re: Good protwords.txt ?

2011-04-25 Thread Robert Muir
On Mon, Apr 25, 2011 at 2:05 PM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi, Are there any good / comprehensive examples of protwords.txt for English? Or good stemdict.txt examples that work with StemmerOverrideFilterFactory? Would be good to have a good example to include in Solr

Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Hi, How do people handle cases where synonyms are used and there are multiple version of the original word that really need to point to the same set of synonyms? For example: Consider singular and plural of the word responsibility. One might have synonyms defined like this:

RE: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Robert Petersen
Aha! I knew something must be awry, but when I looked at the analysis page output, well it sure looked like it should match. :) OK here is the query side WDF that finally works, I just turned everything off. (yay) First I tried just completely removeing WDF from the query side analyzer stack

Lucene Rev Stump the Chump

2011-04-25 Thread Grant Ingersoll
Hey everyone, As you no doubt by now know, Lucene Revolution, the second annual Lucene/Solr conference sponsored by Lucid Imagination, is happening out in San Francisco at the end of May. There are a lot of really great talks and speakers from across the spectrum (check out

Re: Multi-word Solr Synonym issue

2011-04-25 Thread Chris Hostetter
: Subject: Multi-word Solr Synonym issue : In-Reply-To: banlktikq66d40+dprrdyihshsjhdmxs...@mail.gmail.com http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead

Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
I have a field 'type' that has several values. If it's type 'foo' then it also has a field 'restriction_id'. What I want is a filter query which says either it's not a 'foo' or if it is then it has the restriction '1' I expect two matches - one of type 'bar' and one of type 'foo' Neither

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Jonathan Rochkind
The solr 'lucene' query parser (that's being used there, in an fq) sometimes has trouble with pure negative clauses in an OR. Even though it can handle pure negative queries like -type:foo, it has trouble with pure negative in an OR like you are doing. At least in 1.4.1, don't know if it's

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said: This is what I do instead, to rewrite the query to mean the same thing but not give the lucene query parser trouble: fq=( (*:* AND -type:foo) OR restriction_id:1) *:* means everything, so (*:* AND -type:foo) means the same

Re: normalizing the score

2011-04-25 Thread Chris Hostetter
: All I found was: http://search.lucidimagination.com/search/document/9d06882d97db5c59/a_question_about_solr_score : : where Hoss suggests to normalize depending on the maxScore. to be clear, i do not (nor have i ever) suggested that someone normalize based on maxScore. my point there was

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Yonik Seeley
On Mon, Apr 25, 2011 at 4:49 PM, Simon Wistow si...@thegestalt.org wrote: On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said: This is what I do instead, to rewrite the query to mean the same thing but not give the lucene query parser trouble: fq=( (*:* AND -type:foo) OR

Re: normalizing the score

2011-04-25 Thread Paul Libbrecht
Thanks for the precision Hoss, that is helpful an explanation. I am still unsure how it is ever possible to display score-bars for which you need some normalization... but that's for another day. I feel indications of match quality is still somehow a science that has not blossomed yet. Sorting

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Jonathan Rochkind
Yeah, I do the (*:* AND -type:foo) OR something:else thing on my own pretty big index, and it's not slow at all. At least no slower than doing any other X OR Y where X and Y both include lots of results. Pre-warming the field cache for, in this case, the 'type' field may help. Same as it

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
On Mon, Apr 25, 2011 at 05:02:12PM -0400, Yonik Seeley said: It really shouldn't be that slow... how many documents are in your index, and how many match -type:foo? Total number of docs is 161,000,000 type:foo 39,000,000 -type:foo 122,200,000 type:bar 90,000,000 We're aware it's large and

Re: Reloading synonyms.txt without downtime

2011-04-25 Thread Chris Hostetter
: Apparently, when one RELOADs a core, the synonyms file is not reloaded. Is this : : the expected behaviour? Is it the desired behaviour? this is not expected, nor is it desired (by me) nor can i reproduce the problem you are talking about. steps i attempted to reproduce: 1) started the

Problems with Spellchecker in 3.1

2011-04-25 Thread Bob Sandiford
Oops. Sorry. I'm hijacking my own thread to put a real Subject in place... Bob Sandiford | Lead Software Engineer | SirsiDynix P: 800.288.8020 X6943 | bob.sandif...@sirsidynix.com www.sirsidynix.com  -Original Message- From: Bob Sandiford Sent: Monday, April 25, 2011 5:34 PM To:

Scaling Search with Big Data/Hadoop and Solr now available at Lucene Revolution

2011-04-25 Thread Jay Hill
I've worked with a lot of different Solr implementations, and one area that is emerging more and more is using Solr in combination with other big data solutions. My company, Lucid Imagination, has added a two-day course to our upcoming Lucene Revolution conference, Scaling Search with Big Data and

solr sorting on multiple conditions, please help

2011-04-25 Thread James Lin
Hi Folks, I got a problem on solr sorting as below: sort=query({!v=area_id: 78153}) desc, score desc What I want to achieve is sort by if there is a match with area_id, then sort by the actual score problem is, area_id is a multiple value, the result I am getting does not sort by the actual

Re: Good protwords.txt ?

2011-04-25 Thread Otis Gospodnetic
Hi Robert, That's some old thread from 1969 - that's before my time! :) I'm not sure what 2+2lemma.txt is... aha, I see it on http://wordlist.sourceforge.net/12dicts-readme-r5.html -- a headword + N related words. I don't think this will help me tame the overly aggressive Porter stemmer,

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Hi Otis Robert, - Original Message How do people handle cases where synonyms are used and there are multiple version of the original word that really need to point to the same set of synonyms? For example: Consider singular and plural of the word responsibility. One

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Lance Norskog
This has come up with stemming: you can stem your synonym list with the FieldAnalyzer Solr http call, then save the final chewed-up terms as a new synonym file. You then use that one in the analyzer stack below the stemmer filter. On Mon, Apr 25, 2011 at 9:15 PM, Otis Gospodnetic

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Right, instead of this in synonyms file: responsibility, obligation, duty I could stem each of the above words/synonyms and have something like this in synonyms file: respons, oblig, duti But somehow this feels bad (well, so does sticking word variations in what's supposed to be a

Re: Query regarding solr plugin.

2011-04-25 Thread rajini maski
Thanks Erick. I have added my replies to the points you did mention. I am somewhere going wrong. I guess do I need to club both the jars or something ? If yes, how do i do that? I have no much idea about java and jar files. Please guide me here. A couple of things to try. 1 when you do a 'jar