[analog-help] help with extracting browser data

2002-01-27 Thread Robert Brenstein

Can someone suggest a working logformat to identify the browser correctly?

Following are examples of typical log entries in a log file (monthly 
log circa 200 mb). I am showing only the end part of each line, the 
part that is problematic:

200 3456 ScoutAbout -
200 657151 Mozilla/3.01 (compatible;) -
200 9010 Mozilla/4.7 [de] (WinNT; I)
200 8502 Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0) Fetch API Request -
200 13722 Mozilla/4.7 [de] (WinNT; I) http://www/
200 11635 Mozilla/4.0 (compatible; MSIE 6.0; Windows 98) 
http://www/zeigen.html?seite=212
200 0 Mozilla/4.0+(compatible;+MSIE+5.5;+Windows+98) 
http://www.dfn.de/links/schule.html
200 27139 Mozilla/4.0 (compatible; MSIE 5.0; Windows 98; DigExt)
200 13435 FAST-WebCrawler/3.3 ([EMAIL PROTECTED];
200 29811 FAST-WebCrawler/3.3 ([EMAIL PROTECTED]; 
http://fast.no/support.php?c=faqs/crawler) -

The number of words identifying browser varies and there are no tabs 
separating the browser info from referred data. I try

%c %b %B %f
%c %b %A %f
%c %b -
%c - -

but this takes only word 1 of the browser info as browser data and 
uses the rest as referrer. I tried

%c %b %B http://%f

but that results in most entries reported as corrupted. The docs are 
somewhat skimpy about dealing with varying information.

Using the AUTO logformat does not work, in case someone wonders. I tried.

Robert
+
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|http://www.mail-archive.com/analog-help@lists.isite.net/
|http://lists.isite.net/listgate/analog-help/archives/
|http://www.tallylist.com/archives/index.cfm/mlist.7
+



Re: [analog-help] help with extracting browser data

2002-01-27 Thread Stephen Turner

On Mon, 28 Jan 2002, Robert Brenstein wrote:

 Can someone suggest a working logformat to identify the browser correctly?
 
 Following are examples of typical log entries in a log file (monthly 
 log circa 200 mb). I am showing only the end part of each line, the 
 part that is problematic:
 
 200 3456 ScoutAbout -
 200 657151 Mozilla/3.01 (compatible;) -
 200 9010 Mozilla/4.7 [de] (WinNT; I)
 200 8502 Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0) Fetch API Request -
 200 13722 Mozilla/4.7 [de] (WinNT; I) http://www/
 200 11635 Mozilla/4.0 (compatible; MSIE 6.0; Windows 98) 
 http://www/zeigen.html?seite=212
 200 0 Mozilla/4.0+(compatible;+MSIE+5.5;+Windows+98) 
 http://www.dfn.de/links/schule.html
 200 27139 Mozilla/4.0 (compatible; MSIE 5.0; Windows 98; DigExt)
 200 13435 FAST-WebCrawler/3.3 ([EMAIL PROTECTED];
 200 29811 FAST-WebCrawler/3.3 ([EMAIL PROTECTED]; 
 http://fast.no/support.php?c=faqs/crawler) -
 

You can't, because there is no way for analog to know where the browser ends
and the referrer begins. You will have to tell your web server to log in a
more sensible format (delimit the browser with quotes, for example); and if
you need to analyse these logs, you will have to write a preprocessor to put
them in a better format.

-- 
Stephen Turner, Cambridge, UKhttp://homepage.ntlworld.com/adelie/stephen/
This is Henman's 8th Wimbledon, and he's only lost 7 matches. BBC, 2/Jul/01

+
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|http://www.mail-archive.com/analog-help@lists.isite.net/
|http://lists.isite.net/listgate/analog-help/archives/
|http://www.tallylist.com/archives/index.cfm/mlist.7
+



Re: [analog-help] pulling the logformat from apache

2002-01-27 Thread j.bottoms

I have three logs.  the error log is irrelavent though.  so i have an access
log  and a ssl_engine_log.  but my httpd file calls for only one log file to
be created.
Is my format being broken up into two files is that what is ocurring?

the apache conf file for the logformat is as follows

LogFormat %h %l %u %t \%r\ %s %b \%{Referer}i\ \%
 {User-Agent}i\
 combined
LogFormat %h %l %u %t \%r\ %s %b common
LogFormat %{Referer}i - %U referer
LogFormat %{User-agent}i agent
LogFormat %h %l %u %t \%r\ %s %b \%{Referer}i\ \%
{User-Agent}i\
 %{SSL_PROTOCOL}x %{SSL_CIPHER}x ssl-combined
#CustomLog /usr/local/apache/logs/access_log common
#CustomLog /usr/local/apache/logs/referer_log referer
#CustomLog /usr/local/apache/logs/agent_log agent
#CustomLog /usr/local/apache/logs/ssl_request_log %t %
h %{SSL_PROTOCOL}x
 %{SSL_CIPHER}x \%r\ %b
#CustomLog /usr/local/apache/logs/access_log combined
CustomLog /usr/local/apache/logs/access_log ssl-combined


+
|  This is the analog-help mailing list. To unsubscribe from this
|  mailing list, go to
|http://lists.isite.net/listgate/analog-help/unsubscribe.html
|
|  List archives are available at
|http://www.mail-archive.com/analog-help@lists.isite.net/
|http://lists.isite.net/listgate/analog-help/archives/
|http://www.tallylist.com/archives/index.cfm/mlist.7
+