Re: default text type and stop words

2007-11-07 Thread Will Martin
Thanks Mike. I realized the issue was not defined. I appreciate the guidance about process...very much. Billy Mike Klaas wrote: On 2-Nov-07, at 11:02 PM, [EMAIL PROTECTED] wrote: In a message dated 11/2/07 6:54:25 PM, [EMAIL PROTECTED] writes: Even if the actual problem is at the Lucen

Re: default text type and stop words

2007-11-06 Thread Grant Ingersoll
Another alternative that is to selectively use stopwords as in phrases or other places where they have meaning. In the past, stopword removal was mostly done to save disk space and some computation, but disk is cheap and computation, well, they can help you have better results if done righ

Re: default text type and stop words

2007-11-05 Thread Walter Underwood
I also said, "Stopword removal is a reasonable default because it works fairly well for a general text corpus." Ultraseek keeps stopwords but most engines don't. I think it is fine as a default. I also think you have to understand stopwords at some point. wunder On 11/5/07 9:59 PM, "Chris Hostett

Re: default text type and stop words

2007-11-05 Thread Chris Hostetter
: This isn't a problem in Lucene or Solr. It is a result of the analyzers : you have chosen to use. If you choose to remove stopwords, you will not : be able to match stopwords. I believe paul's point was that this use of stopwords is in the "text" fieldtype in the example schema.xml ... which m

Re: default text type and stop words

2007-11-05 Thread Walter Underwood
nted out an issue in the defaults. > > > I also didn't say not to deal with a bug that hypothetically could be in > a tightly coupled dependency. > > Paul > > -Original Message- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Sent: Friday, Novemb

Re: default text type and stop words

2007-11-05 Thread Yonik Seeley
On 11/5/07, Sundling, Paul <[EMAIL PROTECTED]> wrote: > I don't know if the problem is in Lucene, I didn't investigate further. Yes, this is standard lucene behavior, working-as-designed. Stop words are removed from the query as if they never existed at all (this makes some sense because they were

RE: default text type and stop words

2007-11-05 Thread Sundling, Paul
y. I merely pointed out an issue in the defaults. I also didn't say not to deal with a bug that hypothetically could be in a tightly coupled dependency. Paul -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Friday, November 02, 2007 11:02 PM To: solr-dev@lucene.

Re: default text type and stop words

2007-11-05 Thread Mike Klaas
On 2-Nov-07, at 11:02 PM, [EMAIL PROTECTED] wrote: In a message dated 11/2/07 6:54:25 PM, [EMAIL PROTECTED] writes: Even if the actual problem is at the Lucene level, perhaps it would be worth considering changes to the default to get around it. newbie here. is this common practice? fi

Re: default text type and stop words

2007-11-02 Thread MartinWmMo
In a message dated 11/2/07 6:54:25 PM, [EMAIL PROTECTED] writes: > Even if > the actual problem is at the Lucene level, perhaps it would be worth > considering changes to the default to get around it. > newbie here. is this common practice? find a bug in a tightly coupled dependency and not d

Re: default text type and stop words

2007-11-02 Thread Walter Underwood
Stopwords are fairly common in movie titles. There are even titles made entirely of stopwords. The first one I noticed was "Being There". I posted more of them here: http://wunderwood.org/most_casual_observer/2007/05/invisible_titles.html wunder == Search Guy Netflix On 11/2/07 3:53 PM, "Sundlin