RFC1738 - Uniform Resource Locators (URL)
http://www.faqs.org/rfcs/rfc1738.html

RFC1808 - Relative Uniform Resource Locators
http://www.faqs.org/rfcs/rfc1808.html

> Hi, I'm using httpcli to save a webpage html doc and I extract all of
> it's image locations to a text file by saving the '<IMG SRC=' tags.
> Afterward I want to download all of the images, but how can I determine
> the TRUE location of the images? For example, say the image tag is:
> '<IMG SRC='test.com/photo.jpg'' - for all I know, "test.com" could just
> be a directory on the server or it could be the website. Another
> example, say the image tag is: '<IMG SRC='/photo.jpg'' - so the image is
> in the root directory of the website, but who knows what the root
> directory is? It may simply be 'test.com', or if the html doc is located
> in a subdirectory, it may be something like 'test.com/users/me'.
>
> So, what is the appropriate way to determine the actual true location of
> these images from the 'IMG' tags?
>
> Much thanks in advance. 
>   

-- 
To unsubscribe or change your settings for TWSocket mailing list
please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
Visit our website at http://www.overbyte.be

Reply via email to