: Interesting if you are not going to use an analyser... what then ? I'm : thinking of using javacc, because I oversimplified somewhat the 3 field : string structure, so I need a kind of small grammar for that.
Well, the specifics of "what else" is in your files is going to be the biggest factor in deciding how to find the bits of info you need. Let me try to put in perspective for you how your question sounded to me, as someone unfamiliar with your specific problem. the question sounded equivilent to if someone had asked; "I have a bunch of XML files, some of these XML files contain syntax that loks like this... <property name="${keyword}" min="${x}" max="${y}" /> where ${x} and ${y} are small numbers, and ${keyword} is from a fixed list of words. My idea is to simply build a TokenFilter that will look for those... do I have it right ?" ...and i would say: "Not really. Use an XML parser to parse your XML and extract your structured data, then add them to your Lucene Document." You're files may not be XML, but the basic premise is the same; use whatever code makes the most sense to parse whatever file format you are dealing with given what you know aboutthe files (not just the parts you want, but the other parts as well) Where an Analyzer might make sense is if you want to do processing on those bits of data after you find them ... stemming your keywords, or mapping them to synonyms, etc... -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]