[web2py:34586] Re: Retrieve URLs in redirect chain

LB22 Wed, 04 Nov 2009 08:23:36 -0800

Hi,

OK to be a little less ambiguous (apologies for that), the redirects
of the urls I'm probing happen within scripts, like those used in
affiliate links, or url shorteners. But it just so happens that when
say a user clicks a link, they don't only get redirected from point A
to point B, but actually get sent through point A.1, point A.2, and
then point A.3.


The purpose of this is to gather the domains to determine which ones
to add to a whitelist.

Thanks

On Nov 4, 4:05 pm, mdipierro <mdipie...@cs.depaul.edu> wrote:
> what do you mean by "masked". Do you have to parse pages or is this
> done via HTTP 303/307?
>
> On Nov 4, 9:54 am, LB22 <latn.bl...@googlemail.com> wrote:
>
>
>
> > Hi Everyone,
>
> > I'm not sure this is strictly a web2py question, but I figured I'd
> > pose the question anyway, so here it goes.
>
> > I had a task of going through several hundred urls to find out where
> > they lead, so I used URLLIB2 in a web2py app to get and display the
> > end destinations of the "masked" links. This part was easy.
>
> > The problem is that many of the links hop through more than one URL
> > before reaching the destination. So now I'm trying to retrieve the
> > URLs that are in the redirect chain. I'm not quite sure how to do
> > this.
>
> > In a nutshell, I want to get each requested URL in the journey to the
> > end URL, given the start URL.
>
> > Does anyone know how to achieve this?
>
> > Thanks
>
> > LB22
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"web2py-users" group.
To post to this group, send email to web2py@googlegroups.com
To unsubscribe from this group, send email to 
web2py+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/web2py?hl=en
-~----------~----~----~----~------~----~------~--~---

[web2py:34586] Re: Retrieve URLs in redirect chain

Reply via email to