Hi folks,

Thanks for all the help and hits -- much appreciated.

I ended up writing a few scripts myself that cleaned up
both the pages and users. It should do for now.

Please let me know if you see any problems with the
wiki, I hope I wasn't over-eager when cleaning up spam :)))

Cheers,
Dimi.

On 01/15/2013 11:10 AM, Kyle Auble wrote:
On Tue, Jan 15, 2013 at 1:06 PM, Dimi Paun wrote:
Thanks everyone for your help!
I'll take down the Pages spreadsheet.
Now, what about the users? Those are files (not directories) so we don't
face
the same low limit (32k), but it would be nice if we could, somehow, cleanup
those files as well.
If I'm remembering right, a full install of Moinmoin (not just running the
service portably in the unpacked tree) puts a moin command into /usr/bin.
The documentation for it isn't great yet, but you can find it at
http://master.moinmo.in/HelpOnMoinCommand.

Unfortunately, it doesn't have a mechanism for cleaning out users beyond
obvious duplicate accounts. One possibility that I was looking at is that
v1.5 of Moinmoin updates ".trail" files for all logged-in users, even if
the page trail display has been disabled.

The idea was to scan the user directory for all .trail files with a mod-time
older than a certain time period (I picked 1 year). If a user has logged in
to do anything more recently than then, it should show up in the mod-time of
the .trail file.

I wanted to test my scripts a little more, but this was one thing my
sweep-once script at https://bitbucket.org/kauble/moin-admin was designed
to do. Besides blanking-out and putting "file.new" instead of "file.tmp" in
line 96 of split-logs.pl, the logic seemed sound on small test batches. I
wanted to try it on a full copy of the Wine Wiki just to be safe though.

On Tue, Jan 15, 2013 at 4:40 AM, André Hentschel wrote:
This should also speed up that old wiki and maybe helps upgrading it (hopefully 
that'll happen soon :D).
I haven't touched a line of code in a couple of months (had a holiday job that
really knocked the wind from my sails at times), but after getting settled into
my classes over the next few days, I plan on working on moving the wiki to
v1.9 of Moinmoin again.

The one thing that would probably help a lot is if there was a regularly
updated tarball of the wiki content either at WineHQ or Lattica's FTP again. I
haven't messed with cron itself much, but my archive.cron script should pack
up the files correctly. The main complication is that the user dir probably
should be shared on a need-to-know basis because it contains weakly-hashed
password info.

Kyle


--
Dimi Paun
Lattica, Inc.



Reply via email to