On Thu, Nov 17, 2011 at 1:02 PM, Philip Martin
<philip.mar...@wandisco.com> wrote:
> Attila Nagy <b...@fsn.hu> writes:
>
>> On 11/16/11 18:40, Philip Martin wrote:
>>> Attila Nagy<b...@fsn.hu>  writes:
>>>
>>>> I use pysvn for this and basically the code looks like this (in python):
>>>> def update_perms():
>>>>      for path in propchg:
>>>>          proplist = svn.propget('file:permissions', path)
>>>>          if not os.path.islink(path) and proplist.has_key(path):
>>>>              set_perms(path, proplist[path])
>>>> svn.update(walkroot)
>>>> update_perms()
>>>>
>>>> The svn update collects the changed entries (propchg) and update_perms
>>>> iterates on them and gets their file:permissions property and sets it
>>>> in the file system.
>>>>
>>>> And this is what takes ages (literally), compared to 1.6.
>>>> Any ideas about what could be done in this topic?
>>>
>>> It might be faster to run a recursive propget, which is a single
>>> transaction, and discard the output if it doesn't match one of the
>>> changed paths.
>>>
>> I will try this. Should this be true even for 10+ million files?
>
> It depends on the ratio of changed files to total files.  If there is
> only one changed file then the single propget will be faster.  If most
> of the files are changed then the recursive propget will be faster.

Yes, you'll need to test that a bit.

I do something similar here with a script that fetches all the log
entries for merged revisions (cherrypick merges, which I look up with
'svn mergeinfo --show-revs merged'): if the number of merged revisions
is low relative to their range, I perform a series of individual 'svn
log' requests. Otherwise, I do a single 'svn log -r<min>:<max>'
request, parse the output and discard the entries that are not
relevant. This made my script much faster in most cases.

This has nothing to do with recursive propget (or even with 1.7, I'm
using this in a 1.5 environment), but I'm just noting the similarity
of the problem here.

But for your propget problem, there may also be another option: 'svn
propget' can take multiple targets:

    usage: 1. propget PROPNAME [TARGET[@REV]...]

So if you collect the targets in a list, you can get the exact
information you want with a single propget.

That said, I'm not sure that 'svn propget file:permissions A B C' does
all its work in a single transaction, rather than using a transaction
per target (in which case this technique won't buy you much). Maybe
you can quickly verify this if it makes any performance impact  ...

-- 
Johan

Reply via email to