Re: Query regarding solr plugin.

2011-04-25 Thread rajini maski
Thanks Erick. I have added my replies to the points you did mention. I am somewhere going wrong. I guess do I need to club both the jars or something ? If yes, how do i do that? I have no much idea about java and jar files. Please guide me here. A couple of things to try. 1> when you do a 'jar -t

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Right, instead of this in synonyms file: responsibility, obligation, duty I could stem each of the above words/synonyms and have something like this in synonyms file: respons, oblig, duti But somehow this feels bad (well, so does sticking word variations in what's supposed to be a synon

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Lance Norskog
This has come up with stemming: you can stem your synonym list with the FieldAnalyzer Solr http call, then save the final chewed-up terms as a new synonym file. You then use that one in the analyzer stack below the stemmer filter. On Mon, Apr 25, 2011 at 9:15 PM, Otis Gospodnetic wrote: > Hi Otis

Re: Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Hi Otis & Robert, - Original Message > > How do people handle cases where synonyms are used and there are multiple > version of the original word that really need to point to the same set of > synonyms? > > For example: > Consider singular and plural of the word "responsibility".

Re: Good protwords.txt ?

2011-04-25 Thread Otis Gospodnetic
Hi Robert, That's some old thread from 1969 - that's before my time! :) I'm not sure what 2+2lemma.txt is... aha, I see it on http://wordlist.sourceforge.net/12dicts-readme-r5.html -- a headword + N related words. I don't think this will help me tame the overly aggressive Porter stemmer, alt

solr sorting on multiple conditions, please help

2011-04-25 Thread James Lin
Hi Folks, I got a problem on solr sorting as below: sort=query({!v="area_id: 78153"}) desc, score desc What I want to achieve is sort by if there is a match with area_id, then sort by the actual score problem is, area_id is a multiple value, the result I am getting does not sort by the actual s

Scaling Search with Big Data/Hadoop and Solr now available at Lucene Revolution

2011-04-25 Thread Jay Hill
I've worked with a lot of different Solr implementations, and one area that is emerging more and more is using Solr in combination with other "big data" solutions. My company, Lucid Imagination, has added a two-day course to our upcoming Lucene Revolution conference, "Scaling Search with Big Data a

Problems with Spellchecker in 3.1

2011-04-25 Thread Bob Sandiford
Oops. Sorry. I'm hijacking my own thread to put a real Subject in place... Bob Sandiford | Lead Software Engineer | SirsiDynix P: 800.288.8020 X6943 | bob.sandif...@sirsidynix.com www.sirsidynix.com  > -Original Message- > From: Bob Sandiford > Sent: Monday, April 25, 2011 5:34 PM > To

Re: Reloading synonyms.txt without downtime

2011-04-25 Thread Chris Hostetter
: Apparently, when one RELOADs a core, the synonyms file is not reloaded. Is this : : the expected behaviour? Is it the desired behaviour? this is not expected, nor is it desired (by me) nor can i reproduce the problem you are talking about. steps i attempted to reproduce: 1) started the

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
On Mon, Apr 25, 2011 at 05:02:12PM -0400, Yonik Seeley said: > It really shouldn't be that slow... how many documents are in your > index, and how many match -type:foo? Total number of docs is 161,000,000 type:foo 39,000,000 -type:foo 122,200,000 type:bar 90,000,000 We're aware it's large an

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Jonathan Rochkind
Yeah, I do the (*:* AND -type:foo) OR something:else thing on my own pretty big index, and it's not slow at all. At least no slower than doing any other "X OR Y" where X and Y both include lots of results. Pre-warming the field cache for, in this case, the 'type' field may help. Same as it

Re: normalizing the score

2011-04-25 Thread Paul Libbrecht
Thanks for the precision Hoss, that is helpful an explanation. I am still unsure how it is ever possible to display score-bars for which you need some normalization... but that's for another day. I feel indications of match quality is still somehow a science that has not blossomed yet. Sorting

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Yonik Seeley
On Mon, Apr 25, 2011 at 4:49 PM, Simon Wistow wrote: > On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said: >> This is what I do instead, to rewrite the query to mean the same thing but >> not give the lucene query parser trouble: >> >> fq=( (*:* AND -type:foo) OR restriction_id:1) >>

Re: normalizing the score

2011-04-25 Thread Chris Hostetter
: All I found was: http://search.lucidimagination.com/search/document/9d06882d97db5c59/a_question_about_solr_score : : where Hoss suggests to normalize depending on the maxScore. to be clear, i do not (nor have i ever) suggested that someone normalize based on maxScore. my point there was th

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said: > This is what I do instead, to rewrite the query to mean the same thing but > not give the lucene query parser trouble: > > fq=( (*:* AND -type:foo) OR restriction_id:1) > > "*:*" means "everything", so (*:* AND -type:foo) means

Re: Negative OR in fq field not working as expected

2011-04-25 Thread Jonathan Rochkind
The solr 'lucene' query parser (that's being used there, in an fq) sometimes has trouble with "pure negative" clauses in an OR. Even though it can handle "pure negative" queries like "-type:foo", it has trouble with pure negative in an OR like you are doing. At least in 1.4.1, don't know if it

Negative OR in fq field not working as expected

2011-04-25 Thread Simon Wistow
I have a field 'type' that has several values. If it's type 'foo' then it also has a field 'restriction_id'. What I want is a filter query which says "either it's not a 'foo' or if it is then it has the restriction '1'" I expect two matches - one of type 'bar' and one of type 'foo' Neither

Re: Multi-word Solr Synonym issue

2011-04-25 Thread Chris Hostetter
: Subject: Multi-word Solr Synonym issue : In-Reply-To: http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject

Lucene Rev Stump the Chump

2011-04-25 Thread Grant Ingersoll
Hey everyone, As you no doubt by now know, Lucene Revolution, the second annual Lucene/Solr conference sponsored by Lucid Imagination, is happening out in San Francisco at the end of May. There are a lot of really great talks and speakers from across the spectrum (check out lucenerevolution.o

RE: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Robert Petersen
Aha! I knew something must be awry, but when I looked at the analysis page output, well it sure looked like it should match. :) OK here is the query side WDF that finally works, I just turned everything off. (yay) First I tried just completely removeing WDF from the query side analyzer stack b

Automatic synonyms for multiple variations of a word

2011-04-25 Thread Otis Gospodnetic
Hi, How do people handle cases where synonyms are used and there are multiple version of the original word that really need to point to the same set of synonyms? For example: Consider singular and plural of the word "responsibility". One might have synonyms defined like this: responsibilit

Re: Good protwords.txt ?

2011-04-25 Thread Robert Muir
On Mon, Apr 25, 2011 at 2:05 PM, Otis Gospodnetic wrote: > Hi, > > Are there any good / comprehensive examples of protwords.txt for English? > Or good stemdict.txt examples that work with StemmerOverrideFilterFactory? > > Would be good to have a good example to include in Solr distribution... > I

RE: Solr - Multi Term highlighting issue

2011-04-25 Thread Ramanathapuram, Rajesh
Hi Robert, Thanks for your help. This looks much closer to my issue(may be not). Unfortunately, I can't switch to solr version 3.1 yet. I hope to revisit and update this post when I do. Thanks thanks & regards, Rajesh Ramana Enterprise Applications, Turner Broadcasting System, Inc. 404.878.

Good protwords.txt ?

2011-04-25 Thread Otis Gospodnetic
Hi, Are there any good / comprehensive examples of protwords.txt for English? Or good stemdict.txt examples that work with StemmerOverrideFilterFactory? Would be good to have a good example to include in Solr distribution... Thanks, Otis Sematext :: http://sematext.com/ :: Solr - Lucene - N

Re: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Yonik Seeley
On Mon, Apr 25, 2011 at 12:15 PM, Robert Petersen wrote: > The search and index analyzer stack are the same. Ahhh, they should not be! Using both generate and catenate in WDF at query time is a no-no. Same reason you can't have multi-word synonyms at query time: http://wiki.apache.org/solr/Analyz

RE: term position question from analyzer stack for WordDelimiterFilterFactory

2011-04-25 Thread Robert Petersen
Sorry, that was supposed to be just another way to say the same thing... OK look here is my current situation. Even with preserveOriginal and concatAll set, I am still getting an even odder result. I set up sku=218078624 with title=" Beanbag AppleTV Friction Dash Mount for GPS " and index it in d

Re: MoreLikeThis

2011-04-25 Thread Brian Lamb
It finds something under "match" but just nothing under "response". I tried turning on debugQuery=on but I did not see anything that jumped out at me as a bug or anything. Is there some kind of threshold setting that I can tinker with to see if that is the problem? On Sun, Apr 24, 2011 at 2:37 AM,

how to concatenate two nodes of xml with xpathentityprocessor

2011-04-25 Thread vrpar...@gmail.com
hello , i am using Xpathentityprocessor to do index xml files below is my xml file CustomerA ThisB AnyC now i want to concatenate in index so that when i search it gives below result CData with id attribute--- like CustomerAThisB or something like that is it possible by RegexTran

Re: Unable to load EntityProcessor implementation for entity:16865747177753

2011-04-25 Thread vrpar...@gmail.com
Thanks firdous_kind86 i replace tikaentityprocessor with xpathentityprocessor and works fine -- View this message in context: http://lucene.472066.n3.nabble.com/Unable-to-load-EntityProcessor-implementation-for-entity-16865747177753-tp2846513p2861229.html Sent from the Solr - User mailing list a

Re: Different Cluster Results on Different Servers, with same SOLR setup

2011-04-25 Thread Erick Erickson
There's not much information to go on here. You haven't stated the problem so people unfamiliar with your setup can understand it. What is the error you're getting? Show us the configurations, please. You might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Mon, Apr

Re: Query regarding solr plugin.

2011-04-25 Thread Erick Erickson
Looking at things more carefully, it may be one of your dependent classes that's not being found. A couple of things to try. 1> when you do a 'jar -tfv ", you should see output like: 1183 Sun Jun 06 01:31:14 EDT 2010 org/apache/lucene/analysis/sinks/TokenTypeSinkTokenizer.class and your stateme

Different Cluster Results on Different Servers, with same SOLR setup

2011-04-25 Thread Pawan Darira
Hi I have same Solr 1.4 setup on two different servers, One for production & One for Staging. My production server gives proper cluster & Staging server give wrong cluster. The problem is for "date" related cluster only I have checked all the configuration & setup. everything seems fine. i am cr

Re: Suggester with multi terms

2011-04-25 Thread Em
blocky, Shingles should be your way. Regards, Em -- View this message in context: http://lucene.472066.n3.nabble.com/Suggester-with-multi-terms-tp2859547p2860419.html Sent from the Solr - User mailing list archive at Nabble.com.