Well to answer my own question...

My Netscape/4.06 seems to be reported as a user-agent string of
'Mozilla/4.06' and so there is a 'transciption' being done by analog...but
its results seem correct :-)

But just to show where log analysis can take you, looking at the requests I
see they come from private address space and so it seems something on an
internal network is generating these requests. Now the task is to find out
what and why!

As for page counts, my best rationalisation is that the high overnight count
is spiders and so analog is correctly showing page requests that aren't
being recorded by the page view 'bug'. Then over day proxies are causing the
page bug to record higher page views than seen by the servers. It's the best
rationalisation I've come up with so far!

I've had analog dump out the log lines it sees as being corrupt and they do
indeed seem to be truncated and account for about 5% of the log lines which
seems high. Now to understand why the servers would be doing this!

.../Iain



-----Original Message-----
Sent: 21 February 2009 14:03
To: 'Support for analog web log analyzer'
Subject: RE: [analog-help] Problem with page counts

Well Cygwin is a big help, thanks... Only it now raises more questions!

One thing which is odd is that analog is reporting quite high usage of
Netscape 4 which seemed odd and so caused me to look further. So analog
says: 

5 172430        2.00% Netscape 
  170370        1.98%   Netscape/4 
  167023        1.94%     Netscape/4.06 
  3281  0.04%     Netscape/4.0 
  41                        Netscape/4.77 
  3                         Netscape/4.5 
  16                        Netscape/4.76 
  2                         Netscape/4.61 
  1                         Netscape/4.05 
  3                         Netscape/4.7 
  1645  0.02%   Netscape/7 
  1643  0.02%     Netscape/7.2 
  2                         Netscape/7.1 
  414                     Netscape/8 
  371                       Netscape/8.1 
  43                        Netscape/8.1.3 

Most of it seems to be Netscape 4.06 which indeed would be old. So I tried:

grep 'Netscape/' *.log > netscape.log

I then used Excel to summarise netscape.log and come up with...

user-agent      Total
Netscape/7.1    2               [matches analog]
Netscape/7.2    1695            [analog says 1695]
Netscape/8.0.4  5               [missing from analog]
Netscape/8.1    387             [analog says 371]
Netscape/8.1.3  43              [matches analog]
Grand Total             2132            [way off as analog sees lots of
Netscape/4 traffic]

grep does not find any 'Netscape/4' strings at all. Note some counts
correspond: Netscape/8.1.3 is 43 under both counts, Netscape/7.1 is 2 under
both counts.

Is there user-agent signature mapping going on within analog that is
relating some string[s] other than 'Netscape/4' to be Netscape v4 user
agents? These figures will be used to derive browser compatibility tests and
so I'll be challenged on my Netscape 4 figures and so want to be certain :-)

Thx.../Iain


-----Original Message-----
From: analog-help-boun...@lists.meer.net
[mailto:analog-help-boun...@lists.meer.net] On Behalf Of Stephen Turner
Sent: 20 February 2009 20:53
To: Support for analog web log analyzer
Subject: Re: [analog-help] Problem with page counts

2009/2/20 Iain Hunneybell <i...@ipmarketing.co.uk>:
>
> Sadly I have no UNIX host to hand and these are Gig files and so I 
> can't head/tail/grep easily. Windows grep dies... I'll write something 
> to parse the files so I can have a real look at the records...
>

Can you install Cygwin?

--
Stephen Turner
+-----------------------------------------------------------------------
+-
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html  List
| archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+-----------------------------------------------------------------------
+-


+------------------------------------------------------------------------
|  TO UNSUBSCRIBE from this list:
|    http://lists.meer.net/mailman/listinfo/analog-help
|
|  Analog Documentation: http://analog.cx/docs/Readme.html
|  List archives:  http://www.analog.cx/docs/mailing.html#listarchives
|  Usenet version: news://news.gmane.org/gmane.comp.web.analog.general
+------------------------------------------------------------------------

Reply via email to