Re: How do I add multiple values for same field with DIH script?

2020-01-16 Thread O. Klein
Yes, field is multivalued.

I managed to add an array to the content_text field and comma separated
values "foo,bar" eg. but not a " list" like normally you see with a
multivalued field.



--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html


How do I add multiple values for same field with DIH script?

2020-01-16 Thread O. Klein
row.put('content_text', "hello");
row.put('content_text', "this is a test");
return row;

will only return "this is a test"




--
Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Configsets and DIH.properties

2018-12-15 Thread O. Klein
Following up on 
http://lucene.472066.n3.nabble.com/Can-I-use-configsets-with-custom-stopwords-per-collection-td4416454.html

  
I am still looking for a good way to share schema.xml using configSets but
have individual DIH.properties per collection so delta queries work
properly.

Is this possible?




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Can I use configsets with custom stopwords per collection?

2018-12-05 Thread O. Klein
Ok. So with these suggestions, I found
https://lucene.apache.org/solr/guide/6_6/configuring-solrconfig-xml.html#Configuringsolrconfig.xml-ImplicitCoreProperties
So to test this I tried to use it in DIH as this has a similar issue with
configsets as every collection needs its own DIH.properties.



However does not work. Substituting ${solr.core.name} with core name, does
work.

Am I missing something?



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Can I use configsets with custom stopwords per collection?

2018-12-04 Thread O. Klein
Yeah, I'm not copying files. I want all collections to use 1 schema.

So I wonder, do managed stopwords work with configsets and store stopwords
per collection?

Also, what would be the substitution variable for collection name? Is there
a list somewhere?

Thanks!



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Can I use configsets with custom stopwords per collection?

2018-12-03 Thread O. Klein
Yeah, but if i define them in the schema of configset, The custom file with
stopwords is in a directory relative to the collection and not in configset.

So is there a way to define a path to stopwords with the collection as a
variable?



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Can I use configsets with custom stopwords per collection?

2018-12-03 Thread O. Klein
Im using Solr standalone and I want to use shared stopwords and custom
stopwords per collection. Is this possible?



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Autocomplete returning shingles

2018-05-04 Thread O. Klein
Yes, splitting in more documents would probably work.

Don't think I can do this easliy with Solr.

Looking into using facets now.



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Is it normal for BlendedInfixLookupFactory to not show terms?

2018-05-02 Thread O. Klein
BlendedInfixLookupFactory is not returning terms, but returns the field
value. If I change to FuzzyLookupFactory it works fine. Am I doing something
wrong?

   
  
default
BlendedInfixLookupFactory
position_linear
DocumentDictionaryFactory
weight
text_suggest
language
textSuggest
true
  




--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Autocomplete returning shingles

2018-05-02 Thread O. Klein
I need to use autocomplete with edismax (ngrams,edgegrams) to return shingled
suggestions. Field value "new york city" needs to return on query "ne" ->
"new","new york","new york city". With suggester this is easy. But im forced
to use edismax because I need to apply mutliple filter queries.

What is best approach to deal with this?

Any suggestions are appreciated.



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Search opening hours

2016-11-25 Thread O. Klein
Thank you for your reply David.

Yes, I ended up using a DateRangeField. Down side is that it needs frequent
updates. Luckily not an issue for my use case.

BTW how could I abuse DateRangeField for non-date data?




david.w.smi...@gmail.com wrote
> I just saw this conversation now.  I didn't read every word but I have to
> ask immediately: does DateRangeField address your needs?
> https://cwiki.apache.org/confluence/display/solr/Working+with+Dates  It
> was
> introduced in 5.0.
> 
> On Wed, Nov 16, 2016 at 4:59 AM O. Klein 

> klein@

>  wrote:
> 
>> Above implementation was too slow, so wondering if Solr 6 with all its
>> new
>> features provides a better solution to tackle operating hours. Especially
>> dealing with different timezones.
>>
>> Any thoughts?
>>
>>
>>
>>
>>
>> --
>> View this message in context:
>> http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4306073.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
> -- 
> Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker
> LinkedIn: http://linkedin.com/in/davidwsmiley | Book:
> http://www.solrenterprisesearchserver.com





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4307463.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2016-11-16 Thread O. Klein
Above implementation was too slow, so wondering if Solr 6 with all its new
features provides a better solution to tackle operating hours. Especially
dealing with different timezones.

Any thoughts?





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4306073.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-08 Thread O. Klein
BTW any idea how index speed is influenced?

I used worldbounds with -1 and 1 y-axes. But figured this could also be 0.

After changing to 0 indexing became a lot slower though (no exceptions in
log).



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227531.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-08 Thread O. Klein
Doesn't sound odd to me. I just expected index time to be faster with smaller 
"world"

I used minutes as scale first, but that slows it even down a lot more. So
changed to 15 minute interval to keep it reasonable.

Maybe there is a setting that can speed this up. Like the precisionStep in a
Triefield?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227606.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-07 Thread O. Klein
OK. Just wanted to see if it was possible for documentation.

I am using stateless script, so can fix it in client.

Thank you for the help.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227405.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-06 Thread O. Klein
Saw that, but not a lot of info about it.

>From my understanding, the way it supposed to work is that a value bigger
then boundary get's normalized.

I just get an exception "bad x not in boundary rect"

Any pointers?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227384.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-06 Thread O. Klein
OK. I got most of it working.

I created a worldBounds="0 -1 762 1"

15 minute intervals for a week.

And use "linestring(1 0, 2 0)" to index data for Monday 00:15 to 00:30

How do I get to index Sunday 24:00 to Monday 01:00 ?

I have a feeling the linestring just goes back and doesn't wrap around the
plane.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227374.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-09-05 Thread O. Klein
Darren,

So I created worldBounds="0 0 10080 0"

Didn't get polygons to work (needed at least 4 points that closed the
polygon), but I figured I only needed a line and check if a point was on
there. So I am using linestring now

Was wondering if this would work for Monday problem. Will the line be drawn
through the boundary and start at front again?






--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4227330.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-08-26 Thread O. Klein
Darren,

This sounds like solution I'm looking for. Especially nice fix for the
Sunday-Monday problem.

Never worked with spatial search before, so any pointers are welcome. 

Will start working on this solution.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4225443.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-08-26 Thread O. Klein
Thank you for responding.

Yonik's solution is what I had in mind. Was hoping for something more
elegant, as he said, but it will work.

The thing I haven't figured out is how to deal with closing times early
morning next day.

So it's 22:00 now and opening hours are 20:00 to 03:00

Can this be done with either or both approaches?





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4225339.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Search opening hours

2015-08-26 Thread O. Klein
Those options don't fix my problem with closing times the next morning, or is
there a way to do this?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250p4225354.html
Sent from the Solr - User mailing list archive at Nabble.com.


Search opening hours

2015-08-25 Thread O. Klein
I'm trying to find the best way to search for stores that are open NOW.

I have day of week, open and closing times.

I've seen some examples, but not an exact fit.

What is the best way to tackle this?

Thank you for any suggestions you have to offer.








--
View this message in context: 
http://lucene.472066.n3.nabble.com/Search-opening-hours-tp4225250.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Basic auth

2015-07-22 Thread O. Klein
Steven White wrote
 Thanks for updating the wiki page.  However, my issue remains, I cannot
 get
 Basic auth working.  Has anyone got it working, on Windows?

Doesn't work for me on Linux either.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-auth-tp4218053p4218519.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Basic auth

2015-07-22 Thread O. Klein
Yeah I can't get it to work on Jetty 9 either on Linux.

Just trying to password protect the admin pages.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Basic-auth-tp4218053p4218565.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: [Poll]: User need for Solr security

2015-03-13 Thread O. Klein
I used Tomcat to secure admin pages. Haven't looked into Jetty if/how to do
this, but some basic security like you mentioned: SSL, authenticated login
to Admin UI, authorization for Admin APIs would be nice to have.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Poll-User-need-for-Solr-security-tp4192624p4192816.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Help needed to understand zookeeper in solrcloud

2015-03-03 Thread O. Klein
Zookeeper requires a majority of servers to be available. For example: Five
machines ZooKeeper can handle the failure of two machines. That's why odd
numbers are recommended.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Help-needed-to-understand-zookeeper-in-solrcloud-tp4190631p4190633.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-13 Thread O. Klein
I am using the default on single node, which is frequency.

On the Wiki it says: In case of a distributed request to the
SpellCheckComponent, the shards are requested for at least five suggestions
even if the spellcheck.count parameter value is less than five. Once the
suggestions are collected, they are ranked by the configured distance
measure (Levenstein Distance by default) and then by aggregate frequency.

So for distributed this is different. Maybe James knows how to get the
behavior you are looking for.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352p4186214.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Stopwords in shingles suggester

2015-02-13 Thread O. Klein
I found the issue in Jira https://issues.apache.org/jira/browse/SOLR-6468


O. Klein wrote
 With more and more people starting to use the Suggester it seems that
 enablePositionIncrements for StopFilterFactory is still needed.
 
 Not sure why it is being removed from Solr5, but is there a way to keep
 the functionality beyond lucene 4.3 ? Or can this feature be reinstated?





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Stopwords-in-shingles-suggester-tp4166057p4186219.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Stopwords in shingles suggester

2015-02-12 Thread O. Klein
With more and more people starting to use the Suggester it seems that
enablePositionIncrements for StopFilterFactory is still needed.

Not sure why it is being removed from Solr5, but is there a way to keep the
functionality beyond lucene 4.3 ? Or can this feature be reinstated?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Stopwords-in-shingles-suggester-tp4166057p4185994.html
Sent from the Solr - User mailing list archive at Nabble.com.


alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-10 Thread O. Klein
Because of a lot of misspellings in content I am using alternativeTermCount
and maxResultsForSuggest to get suggestions even if terms are in index.
However when adding wordbreak dictionary the collation that was given before
is now empty.

Is there a way to make this work?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-10 Thread O. Klein
James,

That is very useful information. I tested it and can confirm that disabling
spellcheck in warmer solves core reload problem. 

Now with my use case I'm not trying to spellcheck and correct a whitespace.
If holy wood was queried with a mm of 100% it would have fewer hits then
hollywood and this would then be the best correction.

Is there a way to do this?

 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352p4185423.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-10 Thread O. Klein
I did some testing and the order of dictionaries doesn't seem to have an
effect. They are sorted by frequency. So if mm was applied holy wood would
have a lower frequency and solve this problem.

  suggestions:[
  holywood,{
numFound:4,
startOffset:0,
endOffset:8,
origFreq:4,
suggestion:[{
word:holy wood,
freq:71828},
  {
word:hollywood,
freq:2669},
  {
word:holyrood,
freq:14},
  {
word:homewood,
freq:737}]},
  correctlySpelled,false,
  collation,(holy wood),
  collation,hollywood]}}



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352p4185461.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-10 Thread O. Klein
Yeah that should work. Is this something you will change in the code?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352p4185489.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: alternativeTermCount and WordBreakSolrSpellChecker combination not working

2015-02-10 Thread O. Klein
Thank you for that answer James.

Increasing spellcheck.count did the trick.

Funny result for query holywood the suggestion is holy wood instead of
hollywood. Eventhough I have a mm of 100%.

Any way to fix that?

BTW when using maxCollationTries Solr hangs on core reload. Apparantly an
old bug, but hard to find as logs show nothing.

Below the results for holywood: 

suggestions:[
  holywood,{
numFound:4,
startOffset:0,
endOffset:8,
origFreq:4,
suggestion:[{
word:holy wood,
freq:70559},
  {
word:hollywood,
freq:2649},
  {
word:holyrood,
freq:14},
  {
word:homewood,
freq:737}]},
  correctlySpelled,false,
  collation,(holy wood)]}}



--
View this message in context: 
http://lucene.472066.n3.nabble.com/alternativeTermCount-and-WordBreakSolrSpellChecker-combination-not-working-tp4185352p4185368.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Problem with getting node active

2015-01-12 Thread O. Klein
UpdateLog got commented. Problem solved.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-with-getting-node-active-tp4178942p4179013.html
Sent from the Solr - User mailing list archive at Nabble.com.


Problem with getting node active

2015-01-12 Thread O. Klein
I have 4 cores, of which 2 recover just fine and 2 others never get really
active. Not when deleting index or changing clusterstate.json.

So I created a new collection (1 shard, 2 replicas on Solr 4.5 with 3
zookeeper ensemble) and added 1 document to it. It never gets active. Not
even on leader.

There are no exceptions in log.

How do I get zookeeper to see node as active again?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Problem-with-getting-node-active-tp4178942.html
Sent from the Solr - User mailing list archive at Nabble.com.


Stopwords in shingles suggester

2014-10-27 Thread O. Klein
Is there a way in Solr to filter out stopwords in shingles like ES does?

http://www.elasticsearch.org/blog/searching-with-shingles/



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Stopwords-in-shingles-suggester-tp4166057.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Stopwords in shingles suggester

2014-10-27 Thread O. Klein
Thank you all for your input.

The stopword is being replaced by the fillerToken as shown in the article. 

Changing positionIncrementGap makes no difference and as of Solr 4.4, the
enablePositionIncrements argument is no longer supported in the
StopFilterFactory.

So how do I get this working in Solr 4.5 and up?





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Stopwords-in-shingles-suggester-tp4166057p4166157.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Stopwords in shingles suggester

2014-10-27 Thread O. Klein
I changed luceneMatchVersion to 4.3 and got the behavior i was looking for.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Stopwords-in-shingles-suggester-tp4166057p4166192.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: I need a replacement for the QueryElevation Component

2014-07-11 Thread O. Klein
Actually it seems to me that the current behavior is unwanted.

For use cases of exact match (or lowercased), I can use string fieldtype or
keywords tokenizer.

When I want documents sponsored with only partial match I should be able to
use a whitespace tokenizer.

Current behavior prevents that as it concatenates the terms.

Or am I missing something?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/I-need-a-replacement-for-the-QueryElevation-Component-tp4146077p4146701.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: I need a replacement for the QueryElevation Component

2014-07-08 Thread O. Klein
You can sponsor more then 1 document per keyword.

query text=AAA
  doc id=A /
  doc id=B /
 /query

And you might want to try  str name=queryFieldTypestring/str instead
of another FieldType. I found that textFields remove whitespace and
concatenated the tokens.

Not sure if this is intended or not.







--
View this message in context: 
http://lucene.472066.n3.nabble.com/I-need-a-replacement-for-the-QueryElevation-Component-tp4146077p4146090.html
Sent from the Solr - User mailing list archive at Nabble.com.


Date number to text

2014-06-06 Thread O. Klein
Is there a way to tokenize 12-6-2014 as 12 june 2014 and the other way around
as well?

Making it possible for people to search on text and numbers for dates?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Date-number-to-text-tp4140323.html
Sent from the Solr - User mailing list archive at Nabble.com.


Live reload

2013-07-16 Thread O. Klein
I used the reload command to apply changes in synonyms.txt for example, but
with the  new mechanisme https://wiki.apache.org/solr/CoreAdmin#LiveReload  
this will not work anymore.

Is there another way to reload config files instead of restarting Solr?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Live-reload-tp4078318.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Live reload

2013-07-16 Thread O. Klein
My bad. I did some more testing as well and could not replicate the behavior.

Reloading synonyms works fine with a core reload.



Chris Hostetter-3 wrote
 : I used the reload command to apply changes in synonyms.txt for example,
 but
 : with the  new mechanisme
 lt;https://wiki.apache.org/solr/CoreAdmin#LiveReloadgt;  
 : this will not work anymore.
 
 the Live reload doesn't affect schema.xml settings and analyziers (like 
 changing stopwords or synonyms) ... when you reload, you should see your 
 new synonyms.txt file loaded.
 
 if you don't think you are seeing that behavior, then you need to provide 
 a lot more details about what versin you are using, what steps you are 
 trying, and what behavior you *are* seeing so that we can understand what 
 porblem you might be having...  
 
 https://wiki.apache.org/solr/UsingMailingLists
 
 i just did a simple sanity test on the 4x branch where i ran some stuff 
 through the analyzer UI screen, then changed hte synonyms file and did a 
 reload and saw the changes i expected when i re-loaded the analysis page.
 
 
 
 -Hoss





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Live-reload-tp4078318p4078400.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Crawl Anywhere -

2013-02-11 Thread O. Klein
Yes you can run CA on different machines.

In Manage you have to set target and engine for this to work.

I've never done this, so you have to contact the developer for more details.



SivaKarthik wrote
 Hi All,
  in our project, we need to download around millions of pages...
  so is there any support to do the crawling in distributed environment
 using crawl-anywhere apps?
   or wat could be the alternatives...?
 
  Thanks in advance..





--
View this message in context: 
http://lucene.472066.n3.nabble.com/ANNOUNCE-Web-Crawler-tp2607831p4039674.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: [ANNOUNCE] Web Crawler

2013-01-27 Thread O. Klein
This is actualy showing it works.

crawlerws is used by Crawl Anywhere UI and will pass it the correct
arguments when needed.




SivaKarthik wrote
 Hii,
  I'm trying to configure crawl-anywhere 3.0.3 version in my local system..
  i'm following the steps from the page
 http://www.crawl-anywhere.com/installation-v300/
  but, crawlerws is failing and throwing the below error message in the
 brower
   http://localhost:8080/crawlerws/
 error

 errno
 1
 /errno

 errmsg
 Missing action
 /errmsg
 /error
 Not sure where im doing wrong.. could please help me out to resolve the
 problem.. thank you.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/ANNOUNCE-Web-Crawler-tp2607831p4036520.html
Sent from the Solr - User mailing list archive at Nabble.com.


Solr hangs after core reload

2012-11-29 Thread O. Klein
Every time I try to do something with the cores from the admin UI, Solr hangs
with no exceptions.

Anyone else experiencing this?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-hangs-after-core-reload-tp4023206.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multi word synonyms

2012-11-29 Thread O. Klein
Found an article about the issue of  multi word synonyms
http://nolanlawson.com/2012/10/31/better-synonym-handling-in-solr/  .

Not sure it's the solution I'm looking for, but it may be for someone else.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-word-synonyms-tp3716292p4023220.html
Sent from the Solr - User mailing list archive at Nabble.com.


Using additional dictionary with DirectSolrSpellChecker

2012-10-10 Thread O. Klein
Is there some way to supplement the DirectSolrSpellChecker with a dictionary?

(In some cases terms are not used because of threshold, but should be
offered as spellcheck suggestion)



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Using-additional-dictionary-with-DirectSolrSpellChecker-tp4012873.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Using additional dictionary with DirectSolrSpellChecker

2012-10-10 Thread O. Klein
I don't want to tweak the threshold. For majority of cases it works fine.

It's for cases where term has low frequency but is spelled correctly.

If you lower the threshold you would also get incorrect spelled terms as
suggestions.


Robert Muir wrote
 These thresholds are adjustable: read the javadocs and tweak them.
 
 On Wed, Oct 10, 2012 at 5:59 AM, O. Klein lt;

 klein@

 gt; wrote:
 Is there some way to supplement the DirectSolrSpellChecker with a
 dictionary?

 (In some cases terms are not used because of threshold, but should be
 offered as spellcheck suggestion)



 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/Using-additional-dictionary-with-DirectSolrSpellChecker-tp4012873.html
 Sent from the Solr - User mailing list archive at Nabble.com.





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Using-additional-dictionary-with-DirectSolrSpellChecker-tp4012873p4012908.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multi-words synonyms matching

2012-06-05 Thread O. Klein
The reason multi word synonyms work better if you use LUCENE_33 is because
then Solr uses the SlowSynonymFilter instead of SynonymFilterFactory
(FSTSynonymFilterFactory).

But I don't know if the difference between them is a bug or not. Maybe
someone has more insight?




Bernd Fehling-2 wrote
 
 Are you sure with LUCENE_33 (Use of BitVector)?
 
 
 Am 31.05.2012 17:20, schrieb O. Klein:
 I have been struggling with this as well and found that using LUCENE_33
 gives
 the best results.
 
 But as it will be deprecated this is no everlasting solution. May
 somebody
 knows one?

 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-words-synonyms-matching-tp3898950p3987728.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multi-words synonyms matching

2012-06-01 Thread O. Klein
Looking for some more background information I stumbled upon
https://issues.apache.org/jira/browse/LUCENE-3668. If you read the last post
it confirms my issue. So maybe this is a bug?



Bernd Fehling-2 wrote
 
 Are you sure with LUCENE_33 (Use of BitVector)?
 
 
 Am 31.05.2012 17:20, schrieb O. Klein:
 I have been struggling with this as well and found that using LUCENE_33
 gives
 the best results.
 
 But as it will be deprecated this is no everlasting solution. May
 somebody
 knows one?

 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-words-synonyms-matching-tp3898950p3987241.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multi-words synonyms matching

2012-05-31 Thread O. Klein
I have been struggling with this as well and found that using LUCENE_33 gives
the best results.

But as it will be deprecated this is no everlasting solution. May somebody
knows one?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-words-synonyms-matching-tp3898950p3987048.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Solr edismax clarification

2012-02-17 Thread O. Klein

Indika Tantrigoda wrote
 
 Hi All,
 
 I am using edismax SearchHandler in my search and I have some issues in
 the
 search results. As I understand if the defaultOperator is set to OR the
 search query will be passed as  - The OR quick OR brown OR fox
 implicitly.
 
 

Did you also remove mm? If not  defaultOperator is ignored and it
follows mm settings.
http://wiki.apache.org/solr/DisMaxQParserPlugin#mm_.28Minimum_.27Should.27_Match.29

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Solr-edismax-clarification-tp3751013p3753260.html
Sent from the Solr - User mailing list archive at Nabble.com.


PatternReplaceFilterFactory group

2012-02-16 Thread O. Klein
PatternReplaceFilterFactory has no option to select the group to replace.

Is there a reason for this, or could this be a nice feature? 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/PatternReplaceFilterFactory-group-tp3750201p3750201.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: PatternReplaceFilterFactory group

2012-02-16 Thread O. Klein

steve_rowe wrote
 
 Hi O.,
 
 PatternReplaceFilter(Factory) uses Matcher.replaceAll() or replaceFirst(),
 both of which take in a string that can include any or all groups using
 the syntax $n, where n is the group number.  See the
 Matcher.appendReplacement() javadocs for an explanation of the
 functionality and syntax:
 lt;http://docs.oracle.com/javase/6/docs/api/java/util/regex/Matcher.html#appendReplacement%28java.lang.StringBuffer,%20java.lang.String%29gt;
 
 Steve
 
 -Original Message-
 From: O. Klein [mailto:klein@]
 Sent: Thursday, February 16, 2012 8:34 AM
 To: solr-user@.apache
 Subject: PatternReplaceFilterFactory group
 
 PatternReplaceFilterFactory has no option to select the group to replace.
 
 Is there a reason for this, or could this be a nice feature?
 
 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/PatternReplaceFilterFactory-group-
 tp3750201p3750201.html
 Sent from the Solr - User mailing list archive at Nabble.com.
 

Thanks. I should get it working then.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/PatternReplaceFilterFactory-group-tp3750201p3750650.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-15 Thread O. Klein

Koji Sekiguchi wrote
 
 (12/02/14 22:25), O. Klein wrote:
 I have not been able to find any logic in the behavior of hl.q and how it
 analyses the query. Could you explain how it is supposed to work?
 
 Nothing special on hl.q. If you use hl.q, the value of it will be used for
 highlighting rather than the value of q. There's no tricks, I think.
 
 koji
 -- 
 Apache Solr Query Log Visualizer
 http://soleami.com/
 

Field definitions:
content_text (no stopwords, only synonyms in index)
content_hl (stopwords, synonyms in index and query, and only field in hl.fl)

Searching is done with edismax on content_text

1. If I use a query like hl.q=spell Check it doesn't highlight terms with
uppercase, synonyms get highlighted (all fields have LowerCaseFilterFactory)

2. hl.q=content_hl:(spell Check) also highlights terms with uppercase,
synonyms are not highlighted

4. hl.q=content_hl:(spell Check) content_text:(spell Check) highlights terms
with uppercase and synonyms, but sometimes no highlights at all.

So if 1 also highlights terms with uppercase I get the behavior I need. I
can do this on client side, but maybe it's a bug?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3746817.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-14 Thread O. Klein

O. Klein wrote
 
 Hmm, now the synonyms aren't highlighted anymore.
 
 OK back to basic (im using trunk and FVH).
 
 What is the way to go about if I want to search on a field without
 stopwords, but still want to highlight the stopwords? (and still highlight
 synonyms and stemmed words)?
 

I made new field content_hl to prevent problems coming from copyField.

When using hl.q=content_hl:(spell Check) I now get highlighting including
stopwords.

but when using hl.q=content_hl:(SC) where SC is synonym I get no
highlighting.

Can you verify if synonyms work when using hl.q?




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3743317.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-14 Thread O. Klein

O. Klein wrote
 
 
 O. Klein wrote
 
 Hmm, now the synonyms aren't highlighted anymore.
 
 OK back to basic (im using trunk and FVH).
 
 What is the way to go about if I want to search on a field without
 stopwords, but still want to highlight the stopwords? (and still
 highlight synonyms and stemmed words)?
 
 
 I made new field content_hl to prevent problems coming from copyField.
 
 When using hl.q=content_hl:(spell Check) I now get highlighting including
 stopwords.
 
 but when using hl.q=content_hl:(SC) where SC is synonym I get no
 highlighting.
 
 Can you verify if synonyms work when using hl.q?
 

OK I got it working by using hl.q=content_hl:(spell Check)
content_text:(spell Check) but it makes no sense to me.

only difference between the 2 fields is the use of Stopwords.

What's also weird is that a query like hl.q=content_spell:(SC) also
highlights synonyms, eventhough this field has no synonyms.

I have not been able to find any logic in the behavior of hl.q and how it
analyses the query. Could you explain how it is supposed to work?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3743616.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-14 Thread O. Klein

Koji Sekiguchi wrote
 
 Uh, what you tried was that you changed the field between q and hl.q,
 that I've not expected use case when I proposed hl.q.
 
 Do you think that hl.text meats your needs?
 
 https://issues.apache.org/jira/browse/SOLR-1926?focusedCommentId=12871234page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-12871234
 
 koji
 -- 
 Apache Solr Query Log Visualizer
 http://soleami.com/
 

Well, If I understand it correctly, yes.

If this means that queries are analyzed like the field they are
highlighting. That would give the highlighter a lot more flexibility. 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3744054.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-13 Thread O. Klein

Koji Sekiguchi wrote
 
 (12/02/11 21:19), O. Klein wrote:

 Koji Sekiguchi wrote

 (12/01/24 9:31), O. Klein wrote:
 Let's say I search for spellcheck solr on a website that only
 contains
 info about Solr, so solr was added to the stopwords.txt. The query
 that
 will be parsed then (dismax) will not contain the term solr.

 So fragments won't contain highlights of the term solr. So when a
 fragment
 with the highlighted term spellcheck is generated, it would be less
 confusing for people who don't know how search engines work to also
 highlight the term solr.

 So my first test was to have a field with StopFilterFactory and search
 on
 that field, while using another field without StopFilterFactory to
 highlight
 on. This didn't do the trick.

 Are you saying that using hl.q parameter on highlight field while using
 q
 on
 the search field that has StopFilter and hl.q doesn't work for you?

 koji
 --
 http://www.rondhuit.com/en/


 At first glance using hl.q did the trick. I just have problems when I am
 using terms with uppercase. Eventhough I usefilter
 class=solr.LowerCaseFilterFactory/  on the highlighted field in both
 query
 and index I do get search results, but just no highlights (lowercasing
 the
 terms fixes the problem).

 Can someone confirm whether this is a bug?
 
 I don't see your situation. Giving us concrete examples (especially
 request parameters
 including q and hl.q) would help a lot!
 
 koji
 -- 
 http://www.rondhuit.com/en/
 

I got it fixed now I think.

I thought that if you used it like hl.q=spell Checker it would use the
query analysis of the field that was being highlighted as default. But in my
case it needs to be hl.q=content_hl:(spell Checker) for it to work. The
behavour I got default made no sense whatsoever.

Could you be so kind to explain a bit more how hl.q is supposed to work and
with some examples?

Thanx.




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3740114.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-13 Thread O. Klein
Hmm, now the synonyms aren't highlighted anymore.

OK back to basic (im using trunk and FVH).

What is the way to go about if I want to search on a field without
stopwords, but still want to highlight the stopwords? (and still highlight
synonyms and stemmed words)?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3740280.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-02-11 Thread O. Klein

Koji Sekiguchi wrote
 
 (12/01/24 9:31), O. Klein wrote:
 Let's say I search for spellcheck solr on a website that only contains
 info about Solr, so solr was added to the stopwords.txt. The query that
 will be parsed then (dismax) will not contain the term solr.

 So fragments won't contain highlights of the term solr. So when a
 fragment
 with the highlighted term spellcheck is generated, it would be less
 confusing for people who don't know how search engines work to also
 highlight the term solr.

 So my first test was to have a field with StopFilterFactory and search on
 that field, while using another field without StopFilterFactory to
 highlight
 on. This didn't do the trick.
 
 Are you saying that using hl.q parameter on highlight field while using q
 on
 the search field that has StopFilter and hl.q doesn't work for you?
 
 koji
 -- 
 http://www.rondhuit.com/en/
 

At first glance using hl.q did the trick. I just have problems when I am
using terms with uppercase. Eventhough I use filter
class=solr.LowerCaseFilterFactory/ on the highlighted field in both query
and index I do get search results, but just no highlights (lowercasing the
terms fixes the problem).

Can someone confirm whether this is a bug?

Thank you. 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3734892.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Multi word synonyms

2012-02-07 Thread O. Klein
Isn't that what autoGeneratePhraseQueries=true is for?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-word-synonyms-tp3716292p3723886.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Multi word synonyms

2012-02-07 Thread O. Klein
Well, if you want both multi word and single words I guess you will have to
create another field :) Or make queries like you suggested.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-word-synonyms-tp3716292p3724009.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: indexing data on solr

2012-02-05 Thread O. Klein
Read http://wiki.apache.org/solr/DataImportHandler for better method. The
FileListEntityProcessor is what you are looking for.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/indexing-data-on-solr-tp3717111p3717208.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Multi word synonyms

2012-02-05 Thread O. Klein
Your query analyser will tokenize simple sirup into simple and sirup
and wont match on simple syrup in the synonyms.txt

So you have to change the query analyzer into KeywordTokenizerFactory as
well.

It might be idea to make a field for synonyms only with this tokenizer and
another field to search on and use dismax. Never tried this though.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Multi-word-synonyms-tp3716292p3717215.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Which patch 236 to choose for collapse - Solr 3.5

2012-02-02 Thread O. Klein
group.ngroups=true doesnt give you the desired result?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3709711.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Which patch 236 to choose for collapse - Solr 3.5

2012-01-30 Thread O. Klein
You need to remove the 
class=org.apache.solr.handler.component.CollapseComponent from
solrconfig.xml as it no longer exists.

With group.format=simple you get output looking more like the output you are
used to.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3699595.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Which patch 236 to choose for collapse - Solr 3.5

2012-01-29 Thread O. Klein
You don't need to patch anymore. Field collapsing (Grouping) is standard in
Solr 3.3+

http://wiki.apache.org/solr/FieldCollapsing

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3697693.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Which patch 236 to choose for collapse - Solr 3.5

2012-01-29 Thread O. Klein
Copy paste the exception if you want help.

While grouping and collapsing are not exactly the same, the same behavior
can be achieved with grouping.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Which-patch-236-to-choose-for-collapse-Solr-3-5-tp3697685p3697773.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Suggester for Numbers

2012-01-29 Thread O. Klein
What suggester are you using? What is the config?

Or check
http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/
and
http://www.lucidimagination.com/blog/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/
for some nice examples.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Re-Suggester-for-Numbers-tp3698125p3698154.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Using SOLR Autocomplete for addresses (i.e. multiple terms)

2012-01-29 Thread O. Klein

Jan Høydahl / Cominvent wrote
 
 Hi,
 
 As you see, you've got an answer at StackOverflow already with a proposed
 solution to implement your own QueryConverter.
 
 Another way is to create a Solr core solely for Suggest, and tune it
 exactly the way you like. Then you can have it suggest from the whole
 input as well as individual tokens and weigh these as you choose, as well
 as implement phonetic normalization and other useful tricks.
 
 --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com
 Solr Training - www.solrtraining.com
 
 On 3. jan. 2012, at 00:52, Dave wrote:
 
 
Jan,

Awesome autosuggester. Very flexible and something I have been looking for,
for a long time.  Thanx for sharing this gem!

Tip: to enable phonetic searching, change phonetic field in qf to
textphon. Or add phonetic as a copyfield.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Using-SOLR-Autocomplete-for-addresses-i-e-multiple-terms-tp3627856p3698254.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: phrase auto-complete with suggester component

2012-01-25 Thread O. Klein

Tommy Chheng-2 wrote
 
 Thanks, I'll try out the custom class file. Any possibilities this
 class can be merged into solr? It seems like an expected behavior.
 
 
 On Tue, Jan 24, 2012 at 11:29 AM, O. Klein lt;klein@gt; wrote:
 You might wanna read
 http://lucene.472066.n3.nabble.com/suggester-issues-td3262718.html#a3264740
 which contains the solution to your problem.

 --
 View this message in context:
 http://lucene.472066.n3.nabble.com/phrase-auto-complete-with-suggester-component-tp3685572p3685730.html
 Sent from the Solr - User mailing list archive at Nabble.com.
 
 -- 
 Tommy Chheng
 

I agree. Suggester could use some attention. Looking at Wiki there were some
features planned, but not much has happened lately.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/phrase-auto-complete-with-suggester-component-tp3685572p3687495.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: phrase auto-complete with suggester component

2012-01-25 Thread O. Klein

O. Klein wrote
 
 I agree. Suggester could use some attention. Looking at Wiki there were
 some features planned, but not much has happened lately.
 

Or check out this post
http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/
looking very promising as an alternative.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/phrase-auto-complete-with-suggester-component-tp3685572p3689240.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-01-24 Thread O. Klein
Ah, I never used the hl.q

That did the trick. Thanx!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3684245.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Advanced stopword handling edismax

2012-01-24 Thread O. Klein

O. Klein wrote
 
 As I understand it with edismax in trunk, whenever you have a query that
 only contains stopwords then all the terms are required.
 
 But when I try this I only get an empty parsedQuery like: (+() () () () ()
 () () () () () ()
 FunctionQuery((1.0/(3.16E-11*float(ms(const(132710400),date(date_dt)))+1.0))^50.0))/no_coord
 
 Am I misunderstanding this feature? Or is something going wrong?
 

Can someone at least confirm that when using edismax and a query like to be
or not to be (for English stopword list) the parsed query is empty?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Advanced-stopword-handling-edismax-tp3677878p3684599.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: phrase auto-complete with suggester component

2012-01-24 Thread O. Klein
You might wanna read
http://lucene.472066.n3.nabble.com/suggester-issues-td3262718.html#a3264740
which contains the solution to your problem.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/phrase-auto-complete-with-suggester-component-tp3685572p3685730.html
Sent from the Solr - User mailing list archive at Nabble.com.


Highlighting stopwords

2012-01-23 Thread O. Klein
Im using trunk and FVH and eventhough I filter stopwords when searching, I
would like to highlight stopwords in fragments. Using a different field
without the stopwords filter did not have the desired effect.

Is there a way to do this?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3681901.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlighting stopwords

2012-01-23 Thread O. Klein

Koji Sekiguchi wrote
 
 (12/01/23 23:14), O. Klein wrote:
 Im using trunk and FVH and eventhough I filter stopwords when searching,
 I
 would like to highlight stopwords in fragments. Using a different field
 without the stopwords filter did not have the desired effect.
 
 Please provide more info. In particular, how your query look like, the
 expected results,
 actual results and the filter definition in schema.xml.
 
 Thank you,
 
 koji
 -- 
 http://www.rondhuit.com/en/
 

Let's say I search for spellcheck solr on a website that only contains
info about Solr, so solr was added to the stopwords.txt. The query that
will be parsed then (dismax) will not contain the term solr.

So fragments won't contain highlights of the term solr. So when a fragment
with the highlighted term spellcheck is generated, it would be less
confusing for people who don't know how search engines work to also
highlight the term solr.

So my first test was to have a field with StopFilterFactory and search on
that field, while using another field without StopFilterFactory to highlight
on. This didn't do the trick.

So thinking about it some more, I think some terms don't need to be
highlighted (e.g. the) and some need to be highlighted, but don't need to
affect the scoring of a document (e.g. solr) and others need to be
highlighted and affect scoring (e.g. spellcheck).

What are your thoughts on this?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlighting-stopwords-tp3681901p3683452.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Different mm for spellcheckquery

2012-01-21 Thread O. Klein

Dyer, James wrote
 
 I thought of a way you could do this with one query, if using edismax.  If
 you use spellcheck.q and insert AND between each keyword you'll make
 all the terms required regardless of the mm parameter.  I quickly tried
 this out and it seems to work if you use AND but not if you prefix all
 the terms with plus-signs.
 
 example:
 
 q=mispelt werdsdefType=edismaxmm=0spellcheck.q=mispelt AND werds ...
 {other spellcheck parameters}
 
 James Dyer
 E-Commerce Systems
 Ingram Content Group
 (615) 213-4311
 
 

Thanx for your answer. This could work for me.


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Different-mm-for-spellcheckquery-tp3669200p3677857.html
Sent from the Solr - User mailing list archive at Nabble.com.


Advanced stopword handling edismax

2012-01-21 Thread O. Klein
As I understand it with edismax in trunk, whenever you have a query that only
contains stopwords then all the terms are required.

But when I try this I only get an empty parsedQuery like: (+() () () () ()
() () () () () ()
FunctionQuery((1.0/(3.16E-11*float(ms(const(132710400),date(date_dt)))+1.0))^50.0))/no_coord

Am I misunderstanding this feature? Or is something going wrong?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Advanced-stopword-handling-edismax-tp3677878p3677878.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Improving Solr Spell Checker Results

2012-01-18 Thread O. Klein

Dyer, James wrote
 
 David,
 
 The spellchecker normally won't give suggestions for any term in your
 index.  So even if wever is misspelled in context, if it exists in the
 index the spell checker will not try correcting it.  There are 3
 workarounds:
 1. Use the patch included with SOLR-2585 (this is for Trunk/4.x only). 
 See https://issues.apache.org/jira/browse/SOLR-2585
 

When using trunk and DirectSolrSpellChecker I do get suggestions for terms
that are in the index. Lowering the thresholdTokenFrequency to 0.001 in my
case is giving me very good suggestions even if documents with the
misspelled word in them were found.

This combined with maxCollationTries (with all terms required) is giving
some sort of context sensitive suggestions.

Is this correct or is there something I'm missing?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Improving-Solr-Spell-Checker-Results-tp3658411p3669186.html
Sent from the Solr - User mailing list archive at Nabble.com.


Different mm for spellcheckquery

2012-01-18 Thread O. Klein
What is the best way to search with a mm of 0%, but use a mm of 100% on the
spellcheck query so maxCollationTries gives the best results?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Different-mm-for-spellcheckquery-tp3669200p3669200.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlight with multi word synonyms

2012-01-09 Thread O. Klein
Koji, maybe you missed my confirmation due to the hijacking of the thread.

I am using Solr 4.0 and after reindexing with LUCENE_33 I got the behaviour
for highlighting I want. So yeah, I can confirm this is a bug. 

Looking forward to a fix :)


Koji Sekiguchi wrote
 
 (11/12/24 21:20), O. Klein wrote:
 Copy pasted following text from lucene mailing list as it describes my
 problem:

  I'm trying to use multi-word synonyms. For example in my synonyms file
 I
 have nhl, national
 hockey league. If I do this index only, a search for nhl returns a
 correct
 match, but highlights
 the first word only, national. Ideally, it would highlight national
 hockey
 league or not highlight
 at all. If I do the synonyms at both index and query time, it finds the
 match and does the
 correct highlighting, but I understand it is not ideal to do synonyms at
 index and query time.
 I am expanding synonyms and using edismax. Thoughts? 

 I really only want to use synonym expansion on index only, but this
 behaviour makes that impossible.

 Please advise.
 
 I think this is a bug.
 
 I can reproduce the same problem with Solr 4.0. But if I set
 luceneMatchVersionLUCENE_33/luceneMatchVersion in solrconfig.xml and
 re-index, the problem is gone.
 
 What version of Solr are you using? Can you try LUCENE_33 and report the
 result?
 
 koji
 -- 
 http://www.rondhuit.com/en/
 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlight-with-multi-word-synonyms-tp3610466p3644439.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlight with multi word synonyms

2012-01-09 Thread O. Klein
Thanx!

Not looking at Lucene project I totally missed that.

Keep up the good work.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlight-with-multi-word-synonyms-tp3610466p3644729.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Highlight with multi word synonyms

2011-12-25 Thread O. Klein
I am using Solr 4.0 and after reindexing with LUCENE_33 I got the behaviour
for highlighting I want. So yeah, I can confirm this is a bug.



Koji Sekiguchi wrote
 
 (11/12/24 21:20), O. Klein wrote:
 Copy pasted following text from lucene mailing list as it describes my
 problem:

  I'm trying to use multi-word synonyms. For example in my synonyms file
 I
 have nhl, national
 hockey league. If I do this index only, a search for nhl returns a
 correct
 match, but highlights
 the first word only, national. Ideally, it would highlight national
 hockey
 league or not highlight
 at all. If I do the synonyms at both index and query time, it finds the
 match and does the
 correct highlighting, but I understand it is not ideal to do synonyms at
 index and query time.
 I am expanding synonyms and using edismax. Thoughts? 

 I really only want to use synonym expansion on index only, but this
 behaviour makes that impossible.

 Please advise.
 
 I think this is a bug.
 
 I can reproduce the same problem with Solr 4.0. But if I set
 luceneMatchVersionLUCENE_33/luceneMatchVersion in solrconfig.xml and
 re-index, the problem is gone.
 
 What version of Solr are you using? Can you try LUCENE_33 and report the
 result?
 
 koji
 -- 
 http://www.rondhuit.com/en/
 


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlight-with-multi-word-synonyms-tp3610466p3612118.html
Sent from the Solr - User mailing list archive at Nabble.com.


Highlight with multi word synonyms

2011-12-24 Thread O. Klein
Copy pasted following text from lucene mailing list as it describes my
problem:

 I'm trying to use multi-word synonyms. For example in my synonyms file I
have nhl, national
hockey league. If I do this index only, a search for nhl returns a correct
match, but highlights
the first word only, national. Ideally, it would highlight national hockey
league or not highlight
at all. If I do the synonyms at both index and query time, it finds the
match and does the
correct highlighting, but I understand it is not ideal to do synonyms at
index and query time.
I am expanding synonyms and using edismax. Thoughts? 

I really only want to use synonym expansion on index only, but this
behaviour makes that impossible.

Please advise.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Highlight-with-multi-word-synonyms-tp3610466p3610466.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Return message from XmlUpdateRequestHandler

2011-12-11 Thread O. Klein
I was wondering if there was some way to know whether a delete action was
successful or not.

I want to delete a document from index and get a response 1 document
deleted or whatever.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Return-message-from-XmlUpdateRequestHandler-tp3575400p3577687.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: DIH full import and clean

2011-12-10 Thread O. Klein
To get the behaviour I want when I delete all docs with a XML update and then
did a full-import with clean=false, which does run both root-entities. I
never got preImportDeleteQuery to work with clean=false or had DIH run both
root-entities with clean=true.




O. Klein wrote
 
 Can someone explain to me why, when I run full import with clean on it
 only runs the last entity and with clean off I get the behaviour I want
 (runs both entities)?
 
 I thought clean was only to clear the index before running.
 




--
View this message in context: 
http://lucene.472066.n3.nabble.com/DIH-full-import-and-clean-tp3574065p3575270.html
Sent from the Solr - User mailing list archive at Nabble.com.


Return message from XmlUpdateRequestHandler

2011-12-10 Thread O. Klein
Is there a way to get feedback from XML update messages like:

http://localhost:8983/solr/update?commit=truestream.body=%3Cdelete%3E%3Cquery%3Eoffice:Bridgewater%3C/query%3E%3C/delete%3E

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Return-message-from-XmlUpdateRequestHandler-tp3575400p3575400.html
Sent from the Solr - User mailing list archive at Nabble.com.


DIH full import and clean

2011-12-09 Thread O. Klein
Can someone explain to me why, when I run full import with clean on it only
runs the last entity and with clean off I get the behaviour I want (runs
both entities)?

I thought clean was only to clear the index before running.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/DIH-full-import-and-clean-tp3574065p3574065.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Language Detection in SOLR 3.5

2011-12-08 Thread O. Klein
Did you also install apache-solr-langid-4.0.jar ?

And the jars in contrib/langid/lib ?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Language-Detection-in-SOLR-3-5-tp3570177p3570202.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: when using group=true facet numbers are incorrect

2011-12-01 Thread O. Klein
https://issues.apache.org/jira/browse/SOLR-2898 has been created for this.

Thanx Martijn!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/when-using-group-true-facet-numbers-are-incorrect-tp3488605p3551741.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: when using group=true facet numbers are incorrect

2011-11-30 Thread O. Klein

Yonik Seeley-2-2 wrote
 
 On Mon, Nov 7, 2011 at 8:55 PM, Chris Hostetter
 lt;hossman_lucene@gt; wrote:

 : I understand that's a valid thing for faceting to do, I was just
 wondering
 : if there's any way to get it to do the faceting on the groups returned.
 : Otherwise I guess I'll need to convince the UI people to just show the
 : facets without the numbers.

 what you are asking about is generally refered to as post-group
 faceting
 and can be activated using group.truncate...
 
 We don't have true post group faceting currently (i.e. where the
 units for facet counts would be numbers of groups, not numbers of
 documents).
 group.truncate just truncates the list of documents in each group, and
 faceting still returns numbers of documents, not numbers of groups.
 This is why I advocated the name group.truncate instead of
 group.after, and have avoided any mention of post grouping on the
 wiki page.
 
 -Yonik
 http://www.lucidimagination.com
 

How to go about if you want to get the number of groups?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/when-using-group-true-facet-numbers-are-incorrect-tp3488605p3549411.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Preventing DIH to stop on error

2011-11-13 Thread O. Klein
I think it is best the person who commits this to trunk makes a new issue.

I am not a programmer, so I don't want to mess things up.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Preventing-DIH-to-stop-on-error-tp3496436p3504938.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Preventing DIH to stop on error

2011-11-11 Thread O. Klein

O. Klein wrote:
 
 org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir'
 value: url_example is not a directory Processing Document # 1
   at
 org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:123)
 
 I added onError=skip and onError=continue to all entities, but no
 luck.
 
 Do I need to apply patch  https://issues.apache.org/jira/browse/SOLR-1597
 https://issues.apache.org/jira/browse/SOLR-1597  ?
 
 Or is there another way?
 

The patch doesn't work anymore with current trunk, but should this feature
not be in trunk anyways?

Having an option to abort or skip an import is a lot better then just
aborting the import.

Who do I have to bribe to get this into trunk? :)

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Preventing-DIH-to-stop-on-error-tp3496436p3499549.html
Sent from the Solr - User mailing list archive at Nabble.com.


Preventing DIH to stop on error

2011-11-10 Thread O. Klein
org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir'
value: url_example is not a directory Processing Document # 1
at
org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.java:123)

I added onError=skip and onError=continue to all entities, but no luck.

Do I need to apply patch  https://issues.apache.org/jira/browse/SOLR-1597
https://issues.apache.org/jira/browse/SOLR-1597  ?

Or is there another way?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Preventing-DIH-to-stop-on-error-tp3496436p3496436.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Combine XML data with DIH

2011-10-16 Thread O. Klein

O. Klein wrote:
 
 
 O. Klein wrote:
 
 I have folder with XML files
 
 1.xml contains:
 idhttp://www.site.com/1.html/id
 contentblacontent/content
 titleblatitlelt;//titlegt;
 
 2.xml contains:
 idhttp://www.site.com/1.html/id
 titleblatitle2lt;//titlegt;
 
 I want to  create document in Solr:
 
 idhttp://www.site.com/1.html/id
 contentblacontent/content
 titleblatitle2lt;//titlegt;
 
 
 
 I changed my problem in the quotes as it's a little different and
 hopefully easier to solve.
 
 Can this be done with DIH? And how?
 

Hmm, I tried to index all docs and JOIN them on id. This didn't work as it
only shows the fields in the linked document.

Is there some way to show all the fields of the combined documents?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Combine-XML-data-with-DIH-tp3209413p3425844.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Combine XML data with DIH

2011-10-15 Thread O. Klein

O. Klein wrote:
 
 I have folder with XML files
 
 1.xml contains:
 idhttp://www.site.com/1.html/id
 contentblacontent/content
 
 2.xml contains:
 idhttp://www.site.com/1.html/id
 titleblatitlelt;//titlegt;
 
 I want to  create document in Solr:
 
 idhttp://www.site.com/1.html/id
 contentblacontent/content
 titleblatitlelt;//titlegt;
 
 

I changed my problem in the quotes as it's a little different and hopefully
easier to solve.

Can this be done with DIH? And how?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Combine-XML-data-with-DIH-tp3209413p3423888.html
Sent from the Solr - User mailing list archive at Nabble.com.


  1   2   >