Hello,

I'm using a somewhat nifty trick to identify when Google believes a page 
was published. (Yes, I get this is not *exactly *what is meant by that 
date, but it's close enough -- and it's what I'm after.) By searching for 
allinurl:http://www.whatever.com and adding &as_qdr=y15 to the end of the 
executed query string (e.g., in the address bar), Google will preface the 
content section on the results page -- the characters immediately beneath 
the green URL -- to include the date I mention above.

Now this date often appears automatically if the date is (obviously) 
included in the post. For example, a search for a New York Times article 
using "allinurl:" needs no as_qdr=y[xx] to force Google to output the date 
-- it's just there.

What I'd like to do is programmatically extract this date for hundreds of 
URLs I have in a database. To do so, I've considered using the deprecated 
"AJAX API" (e.g., sending GET request to 
"http://ajax.googleapis.com/ajax/services/search/web?v=1.0&q=allinurl:http://www.whatever.com";)
 
and appending "&as_qdr=...". 

However, it's been documented that this API simply ignores such query 
modifiers. Instead, Google (and other forum posters) suggests that you 
include "daterange:*x*-*y*" *in the* query where *x *is the beginning date 
and y is the ending date (both in the Julian not Gregorian calendar). For 
example, they suggest simply appending "%20daterange:x-y" rather than 
adding another parameter with "&...". 

*While this works in filtering the results to the specified dates, it does 
NOT force the publish/index/whatever date to be prepended to the "content" 
field in the returned JSON.*

I'm stuck and thus asking for help. At this point the only thing I can 
think to do is programmatically make calls to http://www.google.com as if I 
were a browser, but this is of course against Google's ToS and likely to be 
(automatically) prohibited unless I take all sorts of care to truly emulate 
a human searcher.

Soo kind forum readers -- any ideas?

Thanks,
Patrick

-- 
-- 
You received this message because you are subscribed to the Google
Groups "Google AJAX APIs" group.
To post to this group, send email to
[email protected]
To unsubscribe from this group, send email to
[email protected]
To view this message on the web, visit
https://groups.google.com/d/msg/google-ajax-search-api/-/NcpQdgODMWsJ
For more options, visit this group at
http://groups.google.com/group/google-ajax-search-api?hl=en?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"Google AJAX APIs" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to