Laura Arjona Reina <larj...@debian.org> writes: > Package: www.debian.org > User: www.debian....@packages.debian.org > Usertag: scripts > Severity: normal > > Hi > > the scripts "urlcheck" generate this log in the /logos folder: > > Looking into http://www.debian.org/logos/openlogo.xcf.gz > Error reading page: http://www.debian.org/logos/openlogo.xcf.gz > Looking into http://www.debian.org/logos/officiallogo.xcf.gz > Error reading page: http://www.debian.org/logos/officiallogo.xcf.gz > Looking into http://www.debian.org/logos/officiallogo-nd.xcf.gz > Error reading page: http://www.debian.org/logos/officiallogo-nd.xcf.gz > > I guess this means it tries to parse the xcf.gz files and probably we > need to update the script to skip such files (compressed images). > > Anybody familiarised with Python, who can help? > > The code of the script is here: > > https://salsa.debian.org/webmaster-team/cron/-/tree/master/urlcheck > > (I guess the main script, urlcheck.py, is where maybe the fix should be > made). > > The script is called by 3 cron jobs: > > 17 3 * * * cd /srv/www.debian.org/cron/urlcheck && ./run.urlcheck > 36 12 * * * cd /srv/www.debian.org/cron/urlcheck && > ./make.bad_link.pages > 5 13 * * * cd /srv/www.debian.org/cron/urlcheck && ./cleanup.logs > > and the daily logs are here: > https://www-master.debian.org/build-logs/urlcheck/ > (check logos folder).
Hi i did attach simple patch file. It is not best way. But just it works.
--- run.urlcheck.orig 2020-09-05 10:59:55.275539752 +0900 +++ run.urlcheck 2020-09-05 11:02:39.847539762 +0900 @@ -6,7 +6,7 @@ --ignore News/weekly/oldurl --ignore /Lists-Archives --ignore /cgi-bin/fom \ --ignore debian.org/fom --ignore /releases/ --ignore /international/ --ignore /security/ \ --ignore /devel/ --ignore /News/ --ignore /doc/ --ignore /distrib/ \ - --ignore /ports/ --ignore /intl/ \ + --ignore /ports/ --ignore /intl/ --ignore /logos/ \ http://www.debian.org/ >& logs/web.$date & ./urlcheck.py --require www.debian.org/international http://www.debian.org/international/ \ >& logs/web.$date.intl &
Sincerely, Byung-Hee -- ^고맙습니다 _救濟蒼生_ 감사합니다_^))//