Re: Breaking up words with a certain pattern and search by parts

2002-04-07 Thread Neil Pitman
Hi Sheldon, It was my understanding that you should parse the input text yourself (since you understand the deeper semantics). When you see "MEM12345" you can add {"MEM12345", "MEM" and "12345"} into the words to index. This is similar to converting the words to lowercase or stripping accent

Re: Breaking up words with a certain pattern and search by parts

2002-04-05 Thread Peter Carlson
Hi Sheldon, You will need to write a custom query parser. However, the current queryParser.jj is the right place to start. You can easily modify this to have a pattern of _LETTER only and _DIGIT only. Currently it allows numbers and letters to be combined as a term. If you have more questions,

Breaking up words with a certain pattern and search by parts

2002-04-05 Thread Sheldon Shi
In my project I would like to search for product code such as MEM12345 either by "MEM" or by "12345". I can't do that right now in Lucene 1.2. Prefix query doesn't do prefix search followed by numbers, and there is no "end with" type of search. How do I modify the HTMLParser to index MEM12345 as