jayvdb created this task.
jayvdb added a subscriber: jayvdb.
jayvdb added a project: Pywikibot-weblinkchecker.py.
Herald added subscribers: pywikibot-bugs-list, StudiesWorld, Aklapper.

TASK DESCRIPTION
  weblinkchecker has an ignore list, which includes web archives and rfc2606 
domains, but also includes several other websites and the reasons are not clear 
or current.  At least one entry is invalid.
  
  ```
      re.compile(r'.*[\./@]berlinonline\.de(/.*)?'),
      # above entry to be manually fixed per request at 
[[de:Benutzer:BLueFiSH.as/BZ]]
  
      # bot somehow can't handle their redirects:
      re.compile(r'.*[\./@]gso\.gbv\.de(/.*)?'),
  
      # bot can't handle their redirects:
      re.compile(r'.*[\./@]bodo\.kommune\.no(/.*)?'),
      re.compile(r'.*[\./@]jpl\.nasa\.gov(/.*)?'),  # bot rejected on the site
      re.compile(r'.*[\./@]itis\.gov(/.*)?'),  # bot rejected on the site
  ```
  
  See {T57282}:
  
  ```
      re.compile(r'.*[\./@]cev\.lu(/.*)?'),  # bot rejected on the site
  ```
  
  ```
      # very slow response resulting in bot error:
      re.compile(r'.*[\./@]science\.ksc\.nasa\.gov(/.*)?'),
  ```
  
  4db982d:
  ```
      # HTTP redirect loop
      re.compile(r'.*[\./@]britannica\.com(/.*)?'), 
  
      # bot rejected on the site:
      re.compile(r'.*[\./@]quickfacts\.census\.gov(/.*)?'),
  ```

TASK DETAIL
  https://phabricator.wikimedia.org/T124015

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: jayvdb
Cc: Aklapper, StudiesWorld, jayvdb, pywikibot-bugs-list



_______________________________________________
pywikibot-bugs mailing list
pywikibot-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/pywikibot-bugs

Reply via email to