Apologies for my flurry of questions.  Here is today's:

Is it normal behavior for the database made by xindice to take up slightly
more than double the amount of disk space as the files that you loaded into
it did?  Or am I really missing something?  I thought that by using
compression on the documents, this database should be smaller (or at least
comparable) than the files by themselves.

I have a directory with 19565 files in it, at an average of 2K each, for a
total of ~38 MB.  I loaded these into one collection, and now that
collection has a size of 80 MB.  

Here is how I created the collection (taken from the examples - I can't seem
to find any documentation on what other options there are for the
configuration of a collection - if someone could point me to some I'd
appreciate it):
 String collectionConfig =
            "<collection compressed=\"true\" name=\"" + collectionName +
"\">" +
            "   <filer class=\"org.apache.xindice.core.filer.BTreeFiler\"
gzip=\"true\"/>" +
            "</collection>";

Is there something else that I can do to get the size of the database down?
Am I missing something really obvious, like a call to optimize or compress
the data somewhere?

This just won't scale for me with its current behavior, so I hope I'm doing
something wrong.  If I'm not, does anyone have any suggestions for an XML
database that scales better?  

Thanks, 

Dan

Reply via email to