On 1 November 2012 14:18, Tim Tomes <[email protected]> wrote:
> Are you trying to do what ewhois.com does with the analytics and adsense IDs?

Kind of, but in a way where you hold the database and tell it which
sites to index so you are not limited to just what the online sites
have pre-crawled.

Robin

> I was trying to script the same thing. However, parsing all of that
> data was a pain considering a developer can implement it several
> different ways. Sites like ewhois.com must have access to some sort of
> API for collecting that data. Let me know how it goes, because if you
> succeed, I'd like to bring you in on a project I am working on.
>
>
> On Thu, Nov 1, 2012 at 5:36 AM, Robin Wood <[email protected]> wrote:
>> I'm building a tool to scrape websites and pull out tracking codes so
>> I can see which sites are related based on who is tracking them.
>>
>> Google codes are good for this as they identify the tracker not the
>> site, Woopra tracking identifies the domain not the tracker so there
>> is no way back to the person/group tracking the site. What other web
>> tracking systems are out there which can be used to identify the
>> tracker rather than the site?
>>
>> In case that doesn't make sense, this is Woopra code:
>>
>> function woopraReady(tracker){
>>     tracker.setDomain('yourdomain.com');
>>     tracker.setIdleTimeout(300000);
>>     tracker.track();
>> }
>>
>> which identifies "yourdomain.com" but this is google
>>
>> try {
>> var pageTracker = _gat._getTracker("UA-7503551-1");
>> pageTracker._trackPageview();
>> } catch(err) {}
>>
>> which identifies the tracker.
>>
>> Robin
>> _______________________________________________
>> Pauldotcom mailing list
>> [email protected]
>> http://mail.pauldotcom.com/cgi-bin/mailman/listinfo/pauldotcom
>> Main Web Site: http://pauldotcom.com
>
>
>
> --
> Tim Tomes
> http://lanmaster53.com/
> _______________________________________________
> Pauldotcom mailing list
> [email protected]
> http://mail.pauldotcom.com/cgi-bin/mailman/listinfo/pauldotcom
> Main Web Site: http://pauldotcom.com
_______________________________________________
Pauldotcom mailing list
[email protected]
http://mail.pauldotcom.com/cgi-bin/mailman/listinfo/pauldotcom
Main Web Site: http://pauldotcom.com

Reply via email to