Yes, but this pruning could be more efficient. On a background
thread, get current segment from segments file, call the system wide
sync ( e.g. System.exec("fsync"), then you can purge the transaction
logs for all segments up to that one. Since it is a background
operation, you are not blocking the writing of new segments and tx logs.
On Feb 6, 2008, at 4:42 PM, Michael McCandless wrote:
robert engels wrote:
Do we have any way of determining if a segment is definitely OK/
VALID ?
The only way I know is the CheckIndex tool, and it's rather slow (and
it's not clear that it always catches all corruption).
If so, a much more efficient transactional system could be developed.
Serialize the updates to a log file. Sync the log. Update the
lucene index WITHOUT any sync. Log file writing/sync is VERY
efficient since it is sequential, and a single file.
Upon open of the index, detect if index was not shutdown cleanly.
If so, determine the last valid segment, delete the bad segments,
and then perform the updates (from the log file) since the last
valid segment was written.
The detection could be a VERY slow operation, but this is ok,
since it should be rare, and then you will only pay this price on
the rare occasion, not on every update.
Wouldn't you still need to sync periodically, so you can prune the
transaction log? Else your transaction log is growing as fast as the
index? (You've doubled disk usage).
Mike
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]