Not sure if this could be of interest beyond Bloodhound. It is a very simple script (and quite possibly over-complicated in reality).

The attachment referred to is here: https://issues.apache.org/bloodhound/attachment/ticket/156/emptyrevs.py

Just tested it on 1229640 empty commits:

   python3 emptyrevs.py 1229640 | eatmydata svnadmin
   --bypass-prop-validation load repos/

which took about 2 hours - I suspect that the svnadmin load was the real bottleneck.. it is just a few seconds to create about 92M of data if you direct to a file instead.

Cheers,
    Gary


On 08/07/2012 06:47 PM, Apache Bloodhound wrote:
#156: Local copy of bloodhound part of Apache repo for browse functionality
------------------------+-----------------------
   Reporter:  gjm        |      Owner:  nobody
       Type:  task       |     Status:  new
   Priority:  critical   |  Milestone:  Release 2
  Component:  siteadmin  |    Version:
Resolution:             |   Keywords:
------------------------+-----------------------

Comment (by gjm):

  As part of my investigation around creating a mirror of the bloodhound
  portion, I have written a very simple script (only complicated when I
  decided to have a quick look at the argparse module and python2/3 issues),
  inspired by a suggestion from Philip Martin.

  I have attached that script [attachment:emptyrevs.py here]. It may be
  worth putting in the bloodhound repository in case we need it again of
  course.


Reply via email to