Thorsten:

> As the file content is very trivial like "2348246864;PCINIT2" and can be
>  sorted, i tink of something like stepping through the files line by line
>  and compare the line content

REBOL.org uses a compare utility to let script owners see changes in 
different versions of a script they've contributed to the Library. It is not 
currently 
publicly available.

I wrote it. And I was concerned not to use recursion as REBOL has a small 
stack for that. That limited the possible approaches.

As scripts are line-oriented things, I used the approach you suggest: sort 
both files, use a merge/compare to remove matching lines. On the rest, apply 
some heuristics to distinguish inserts and deletions from block moves.

It works best where there are not a large number of identical lines. Which is 
why, by default, it ignores blank lines when comparing ..... which, as a 
happy side-effect, is usually what you'd want when comparing source code 
versions.

It wouldn't work on the size files you have targeted as it acts on blocks in 
memory. But as most of its processes involve running up and down blocks 
flagging things there is no in principle reason why the same logic couldn't 
work on 
files -- given that read/direct works.

Is that enough hints for now?

Sunanda.
-- 
To unsubscribe from the list, just send an email to 
lists at rebol.com with unsubscribe as the subject.

Reply via email to