Thanks Erick. I have added my replies to the points you did mention. I am
somewhere going wrong. I guess do I need to club both the jars or something
? If yes, how do i do that? I have no much idea about java and jar files.
Please guide me here.
A couple of things to try.
1> when you do a 'jar -t
Right, instead of this in synonyms file:
responsibility, obligation, duty
I could stem each of the above words/synonyms and have something like this in
synonyms file:
respons, oblig, duti
But somehow this feels bad (well, so does sticking word variations in what's
supposed to be a synon
This has come up with stemming: you can stem your synonym list with
the FieldAnalyzer Solr http call, then save the final chewed-up terms
as a new synonym file. You then use that one in the analyzer stack
below the stemmer filter.
On Mon, Apr 25, 2011 at 9:15 PM, Otis Gospodnetic
wrote:
> Hi Otis
Hi Otis & Robert,
- Original Message
>
> How do people handle cases where synonyms are used and there are multiple
> version of the original word that really need to point to the same set of
> synonyms?
>
> For example:
> Consider singular and plural of the word "responsibility".
Hi Robert,
That's some old thread from 1969 - that's before my time! :)
I'm not sure what 2+2lemma.txt is... aha, I see it on
http://wordlist.sourceforge.net/12dicts-readme-r5.html -- a headword + N
related
words. I don't think this will help me tame the overly aggressive Porter
stemmer, alt
Hi Folks,
I got a problem on solr sorting as below:
sort=query({!v="area_id: 78153"}) desc, score desc
What I want to achieve is sort by if there is a match with area_id, then
sort by the actual score
problem is, area_id is a multiple value, the result I am getting does not
sort by the actual s
I've worked with a lot of different Solr implementations, and one area that
is emerging more and more is using Solr in combination with other "big data"
solutions. My company, Lucid Imagination, has added a two-day course to our
upcoming Lucene Revolution conference, "Scaling Search with Big Data a
Oops. Sorry. I'm hijacking my own thread to put a real Subject in place...
Bob Sandiford | Lead Software Engineer | SirsiDynix
P: 800.288.8020 X6943 | bob.sandif...@sirsidynix.com
www.sirsidynix.com
> -Original Message-
> From: Bob Sandiford
> Sent: Monday, April 25, 2011 5:34 PM
> To
: Apparently, when one RELOADs a core, the synonyms file is not reloaded. Is
this
:
: the expected behaviour? Is it the desired behaviour?
this is not expected, nor is it desired (by me) nor can i reproduce the
problem you are talking about.
steps i attempted to reproduce:
1) started the
On Mon, Apr 25, 2011 at 05:02:12PM -0400, Yonik Seeley said:
> It really shouldn't be that slow... how many documents are in your
> index, and how many match -type:foo?
Total number of docs is 161,000,000
type:foo 39,000,000
-type:foo 122,200,000
type:bar 90,000,000
We're aware it's large an
Yeah, I do the (*:* AND -type:foo) OR something:else
thing on my own pretty big index, and it's not slow at all. At least no
slower than doing any other "X OR Y" where X and Y both include lots of
results.
Pre-warming the field cache for, in this case, the 'type' field may
help. Same as it
Thanks for the precision Hoss,
that is helpful an explanation.
I am still unsure how it is ever possible to display score-bars for which you
need some normalization... but that's for another day.
I feel indications of match quality is still somehow a science that has not
blossomed yet.
Sorting
On Mon, Apr 25, 2011 at 4:49 PM, Simon Wistow wrote:
> On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said:
>> This is what I do instead, to rewrite the query to mean the same thing but
>> not give the lucene query parser trouble:
>>
>> fq=( (*:* AND -type:foo) OR restriction_id:1)
>>
: All I found was:
http://search.lucidimagination.com/search/document/9d06882d97db5c59/a_question_about_solr_score
:
: where Hoss suggests to normalize depending on the maxScore.
to be clear, i do not (nor have i ever) suggested that someone normalize
based on maxScore.
my point there was th
On Mon, Apr 25, 2011 at 04:34:05PM -0400, Jonathan Rochkind said:
> This is what I do instead, to rewrite the query to mean the same thing but
> not give the lucene query parser trouble:
>
> fq=( (*:* AND -type:foo) OR restriction_id:1)
>
> "*:*" means "everything", so (*:* AND -type:foo) means
The solr 'lucene' query parser (that's being used there, in an fq)
sometimes has trouble with "pure negative" clauses in an OR.
Even though it can handle "pure negative" queries like "-type:foo", it
has trouble with pure negative in an OR like you are doing. At least in
1.4.1, don't know if it
I have a field 'type' that has several values. If it's type 'foo' then
it also has a field 'restriction_id'.
What I want is a filter query which says "either it's not a 'foo' or if
it is then it has the restriction '1'"
I expect two matches - one of type 'bar' and one of type 'foo'
Neither
: Subject: Multi-word Solr Synonym issue
: In-Reply-To:
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mailing list, please do not reply to
an existing message, instead start a fresh email. Even if you change the
subject
Hey everyone,
As you no doubt by now know, Lucene Revolution, the second annual Lucene/Solr
conference sponsored by Lucid Imagination, is happening out in San Francisco at
the end of May. There are a lot of really great talks and speakers from across
the spectrum (check out lucenerevolution.o
Aha! I knew something must be awry, but when I looked at the analysis
page output, well it sure looked like it should match. :)
OK here is the query side WDF that finally works, I just turned
everything off. (yay) First I tried just completely removeing WDF from
the query side analyzer stack b
Hi,
How do people handle cases where synonyms are used and there are multiple
version of the original word that really need to point to the same set of
synonyms?
For example:
Consider singular and plural of the word "responsibility". One might have
synonyms defined like this:
responsibilit
On Mon, Apr 25, 2011 at 2:05 PM, Otis Gospodnetic
wrote:
> Hi,
>
> Are there any good / comprehensive examples of protwords.txt for English?
> Or good stemdict.txt examples that work with StemmerOverrideFilterFactory?
>
> Would be good to have a good example to include in Solr distribution...
>
I
Hi Robert,
Thanks for your help.
This looks much closer to my issue(may be not). Unfortunately, I can't
switch to solr version 3.1 yet.
I hope to revisit and update this post when I do.
Thanks
thanks & regards,
Rajesh Ramana
Enterprise Applications, Turner Broadcasting System, Inc.
404.878.
Hi,
Are there any good / comprehensive examples of protwords.txt for English?
Or good stemdict.txt examples that work with StemmerOverrideFilterFactory?
Would be good to have a good example to include in Solr distribution...
Thanks,
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - N
On Mon, Apr 25, 2011 at 12:15 PM, Robert Petersen wrote:
> The search and index analyzer stack are the same.
Ahhh, they should not be!
Using both generate and catenate in WDF at query time is a no-no.
Same reason you can't have multi-word synonyms at query time:
http://wiki.apache.org/solr/Analyz
Sorry, that was supposed to be just another way to say the same thing...
OK look here is my current situation. Even with preserveOriginal and
concatAll set, I am still getting an even odder result.
I set up sku=218078624 with title=" Beanbag AppleTV Friction Dash Mount
for GPS " and index it in d
It finds something under "match" but just nothing under "response". I tried
turning on debugQuery=on but I did not see anything that jumped out at me as
a bug or anything. Is there some kind of threshold setting that I can tinker
with to see if that is the problem?
On Sun, Apr 24, 2011 at 2:37 AM,
hello ,
i am using Xpathentityprocessor to do index xml files
below is my xml file
CustomerA
ThisB
AnyC
now i want to concatenate in index so that when i search it gives below
result
CData with id attribute--- like CustomerAThisB or something like that
is it possible by RegexTran
Thanks firdous_kind86
i replace tikaentityprocessor with xpathentityprocessor and works fine
--
View this message in context:
http://lucene.472066.n3.nabble.com/Unable-to-load-EntityProcessor-implementation-for-entity-16865747177753-tp2846513p2861229.html
Sent from the Solr - User mailing list a
There's not much information to go on here. You haven't stated the
problem so people unfamiliar with your setup can understand it. What
is the error you're getting? Show us the configurations, please.
You might want to review:
http://wiki.apache.org/solr/UsingMailingLists
Best
Erick
On Mon, Apr
Looking at things more carefully, it may be one of your dependent classes
that's not being found.
A couple of things to try.
1> when you do a 'jar -tfv ", you should see
output like:
1183 Sun Jun 06 01:31:14 EDT 2010
org/apache/lucene/analysis/sinks/TokenTypeSinkTokenizer.class
and your stateme
Hi
I have same Solr 1.4 setup on two different servers, One for production &
One for Staging. My production server gives proper cluster & Staging server
give wrong cluster. The problem is for "date" related cluster only
I have checked all the configuration & setup. everything seems fine. i am
cr
blocky,
Shingles should be your way.
Regards,
Em
--
View this message in context:
http://lucene.472066.n3.nabble.com/Suggester-with-multi-terms-tp2859547p2860419.html
Sent from the Solr - User mailing list archive at Nabble.com.
33 matches
Mail list logo