Re: For those ready to take the challenge

via Digitalmars-d-learn Sat, 10 Jan 2015 04:25:31 -0800

On Friday, 9 January 2015 at 17:18:43 UTC, Adam D. Ruppe wrote:

Huh, looking at the answers on the website, they're mostlyusing regular expressions. Weaksauce. And wrong - they don'tfind ALL the links, they find the absolute HTTP urls!

Yeah... Surprising, since languages like python includes a HTMLparser in the standard library.

Besides, if you want all resource links you have to do a lotbetter, since the following attributes can contain resourceaddresses: href, src, data, cite, xlink:href…

You also need to do entity expansion since the links can containhtml entities like "&".


Depressing.

Re: For those ready to take the challenge

Reply via email to