Hi,
we are trying to extract all URLs in wiki articles from our Mediawiki
installation. We have tried Grep, Perl and Sed on mysql dumps, but it
is very difficult to get the URLs only, without some
garbage/text/comments before or after them.

Does anyone know of a better way to achieve this?

Thanks,
Andi

_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to