Re: [analog-help] Search query from images.google.com

2005-02-25 Thread Aengus
On Friday, February 25, 2005 3:36 PM [GMT],
Hugh Morris [EMAIL PROTECTED] wrote:

 Could someone tell me if it is possible to get the search query from
 requests referred from images.google.com (and the other image search
 engines).

 The request strings are a bit untidy, like:


http://images.google.com/imgres?imgurl=http://www.washto2004.org/net/external/images/sunday/BNSF%2520PHOTOS/MOUNTAIN%2520PEAK.JPGimgrefurl=http://www.washto2004.org/net/external/images/sunday/BNSF%2520PHOTOS/h=1200w=1600sz=696tbnid=A3gVV6ed4DkJ:tbnh=112tbnw=149start=1prev=/images%3Fq%3Dmountain%2Bpeak%26hl%3Den%26lr%3D%26sa%3DG

Your search string is in the last part of that referring URL -
prev=/images%3Fq%3Dmountain%2Bpeak%26hl%3Den%26lr%3D%26sa%3DG

That's an escaped version of the original quesrystring -
prev=/images?q=mountain+peakhl=enlr=sa=G

If I add
SEARCHENGINE http://images.google.com/* prev
to my analog.cfg file, then I get this entry in my Search Query Report:
1  /images?q=mountain peakhl=enlr=sa=g

And I get these 2 entries in my Search Word Report:
1  peakhl=enlr=sa=g
1  /images?q=mountain

I don't know if you could clean the results up any more than that
without preprocessing.

Aengus

+
|  TO UNSUBSCRIBE from this list:
|http://lists.meer.net/mailman/listinfo/analog-help
|
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
+


Re: [analog-help] Search query from images.google.com

2005-02-25 Thread Hugh Morris

On 2005-02-25 16:13 Aengus wrote:
On Friday, February 25, 2005 3:36 PM [GMT], Hugh Morris
[EMAIL PROTECTED] wrote:

Could someone tell me if it is possible to get the search query
from requests referred from images.google.com (and the other image
search engines).
The request strings are a bit untidy, like:

http://images.google.com/imgres?imgurl=http://www.washto2004.org/net/external/images/sunday/BNSF%2520PHOTOS/MOUNTAIN%2520PEAK.JPGimgrefurl=http://www.washto2004.org/net/external/images/sunday/BNSF%2520PHOTOS/h=1200w=1600sz=696tbnid=A3gVV6ed4DkJ:tbnh=112tbnw=149start=1prev=/images%3Fq%3Dmountain%2Bpeak%26hl%3Den%26lr%3D%26sa%3DG
Your search string is in the last part of that referring URL - 
prev=/images%3Fq%3Dmountain%2Bpeak%26hl%3Den%26lr%3D%26sa%3DG

That's an escaped version of the original quesrystring - 
prev=/images?q=mountain+peakhl=enlr=sa=G

If I add SEARCHENGINE http://images.google.com/* prev to my
analog.cfg file, then I get this entry in my Search Query Report: 1
/images?q=mountain peakhl=enlr=sa=g
And I get these 2 entries in my Search Word Report: 1
peakhl=enlr=sa=g 1  /images?q=mountain
I don't know if you could clean the results up any more than that 
without preprocessing.

Aengus
Thanks for that. It was interesting to see the report with that line 
added to the cfg file but it won't be very practical for daily use.

rhetoricalI wonder if it would be difficult to modify analog to deal 
with those encoded query strings?/rhetorical
+
|  TO UNSUBSCRIBE from this list:
|http://lists.meer.net/mailman/listinfo/analog-help
|
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
+