On Mon, Jan 08, 2007, Nicolas Lalev×™e wrote about "Re: Payloads": > I have looked closer to how lucene index, and I realized that for the facet > feature, the kind of payload handling by Michael's patch are not designed for > that. In this patch, the payloads are in the posting, ie in the tis, frq, prx > files. Payload at the document level, that would be accessed in a scorer, > should be better in the TermVector files, which are ordered by docs and not > by term.
Well, it's sort of the same thing... Michael's patch allows putting payloads at each position in a posting list; If you create a posting list which has just one position per doc, you basically created a per-doc payload, ordered by doc (like all posting lists). And creating this posting list is easy: just pick an arbitrary field name F and an arbitrary word W, and index the term (F,W) with the payload you want for each document (basically, the list of categories that this document belongs to). I'm not saying this is the best way to do it, and certainly not the cleanest, but it's just one of the things that payloads enable you to do. -- Nadav Har'El | Wednesday, Jan 10 2007, 20 Tevet 5767 IBM Haifa Research Lab |----------------------------------------- |Lumber Cartel member #2224. http://nadav.harel.org.il |http://lumbercartel.freeyellow.com/ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]