Erik, It may be worth looking at the code here: http://issues.apache.org/jira/browse/LUCENE-328
The Bitsets in your example are likely to be very sparse (I imagine you know only too well how long it takes to write a book and therefore how many books there are likely to be per author! :))With such a sparse set per author BitSets could use a lot of memory. In this example I imagine a SortedVIntList per author would be a much more compact format. The code in the link contains a standard interface for a sorted list of ints with bitset,int array and VInt encoded implementations. The AndDocNrSkipper and OrDocNrSkipper classes can be used to perform set intersections on any combination of these int sets. Cheers, Mark ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]