Re: Field methods and usage

karl wettin Wed, 31 Jan 2007 16:50:25 -0800

31 jan 2007 kl. 12.25 skrev Christoph Pächter:

I was wondering, if there is anywhere a table (similar to Table 1.2An overviewof different field types, their characteristics, and their usage inLucene in
Action), listing the possible methods and their usage.


Implementations will differ, for example:


Store   |TermVector              |Index          |reasonable |Usage
YES     |NO                      |NO             |1          |URLs

|telephone number

You never have to store anything in the index, perhaps thatinformation is persistent somewhere else?

If you use a term vector or not depends very little on what kind ofinformation you store in there, it is up to what analysis you plan toinclude the documents in. Highlighting? More like this? Neural networks?

Some are more than happy with one large token. Other people mightwant to tokenize the exact same information.

An URL in [protocol://host:port/path], a phone number in country-,area, and district parts.

It really up to each and every implementer to decide what settings isbest for them.

Also, a Lucene index is not made up of static rows and columns theway a relational database is. The spoon does not exists. You can bendit any way you want. Documents in a corpus can share field that sharenames but not settings. Perhaps you only want to index phone numbersin a specific area code.


--
karl
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Field methods and usage

Reply via email to