Miernik wrote: > Can someone who uses htdig with wwwoffle can tell me for example how > much disk space the htdig index etc files take for a corresponding > wwwoffle cache size? > > My wwwoffle cache is 1.85 GB and I have only 0.35 MB of free disk > space, and I wonder if I need to get more free disk space before > starting the indexing, or will it be fine? I never ran htdig before, so > I don't have an estimate how much it might be.
My cache is 5.6 GB and the htdig database is 460 MB, so the database is about 8% of the cache size. But I don't know the relation of text to non-text pages in my cache -- the htdig database is only for text pages. If there are mostly pictures, videos or binary stuff in the cache, the database will be smaller than it is when there are mostly text pages. Furthermore it seems to me, that the database will only grow and never become smaller due to database updates, even if the cache size is reduced. And I don't know how much is used temporarily while creating/updating the database. Finally, you need a fast machine to index a 1.85 GB cache. IIRC it was nearly 1 day for my cache when I moved it to the new computer a few months ago. Nils
