.
Hope it helps...
Rajesh Munavalli
-Original Message-
From: Yonik Seeley [mailto:[EMAIL PROTECTED]
Sent: Mon 8/15/2005 7:47 PM
To: java-user@lucene.apache.org
Subject: Re: intra-word delimiters
That was the plan, but step (4) really seems problematic.
- term expansion this way can
On Aug 15, 2005, at 8:53 PM, Marvin Humphrey wrote:
Create a phrase query that when it encounters ab => { tokenlength
=> 2 } knows to look for something at position 3.
Fencepost error! That should have been "position 2".
Not that correcting the error makes the algo any more practical. ;)
On Aug 15, 2005, at 7:47 PM, Yonik Seeley wrote:
That was the plan, but step (4) really seems problematic.
- term expansion this way can lead to a lot of false matches
- phrase queries with many bordering words break
- settingt term positions such that phrase queries work on all combos
of subw
That was the plan, but step (4) really seems problematic.
- term expansion this way can lead to a lot of false matches
- phrase queries with many bordering words break
- settingt term positions such that phrase queries work on all combos
of subwords is non-trivial.
It seems like a better approach
On Aug 15, 2005, at 3:16 PM, Yonik Seeley wrote:
Another example:
Source Text contains "Canon Powershot SD500 7MP Digital Elph"
And I want to be able to match the following user queries:
Power Shot SD 500
CanonPowerShotSD500
SD 500 7 MP digitalelph
Canon-Powershot-SD 500
Any ideas?
How abou
Does anyone have solutions for handling intraword delimiters (case
changes, non-alphanumeric chars, and alpha-numeric transitions)?
If the source text is Wi-Fi, we want to be able to match the following
user queries:
wi fi
wifi
wi-fi
wi+fi
WiFi
One way is to index "wi", "fi", and "wifi".
However