Voytek,

The standard answer to many questions in this list is "Run analog with
settings on" which in your cas will probably show an include or exclude
statement which is causing the difficulty. I've run analog for a while now,
and all the problems I have had have come down to me configuring it wrongly.
So, you can either run analog -settings ...+other parameters you use or
start with a stripped down config file. Don't forget that analog will always
read the default config file (which may have include / exclude parms) unless
you tell it not to (-G on command line)

Good luck

Simon West

-----Original Message-----
From: Voytek Eymont [mailto:[EMAIL PROTECTED]]
Sent: 24 March 2001 00:42
To: [EMAIL PROTECTED]
Subject: [analog-help] A4/2: says 'unwanted', but, I want them


I've been running A for a year or longer on some virtual sites
we host, first on LGW, now, on Apache.

Originally had A ver 3 something, now, version 4, though, still use
definitions from A v 3

One of my web site has been subject to somewhat incread usage as of last
week; the daily Apache logs jumped to, on one particular day, to 100MB
log, and, it now at about 30MB/day, with about 180,000 lines per log. 

This analysis was produced by analog4.0/OS2. 

but, I seem to be getting zero, or very low hits out of the Analog.
though, I get a lot of unwanted lines...

what is going wrong ?

test CFG used to process a single file:

# Configuration file for analog 3.3
# See http://www.statslab.cam.ac.uk/~sret1/analog/
# See http://www.sbt.net.au/analog/docs/
#
# Auto-generated by \analog\ana2.cmd on 25 Oct 1999 16:48:18
# Auto-generated configuration file, DO NOT EDIT THIS FILE
debug on
LOGFILE \users\race.com.au\logs\www.race.com.au.13-03-2001*
OUTFILE \users\race.com.au\web\analog\index.html
#
# Auto-generated configuration file ends here 


file: www.race.com.12-03-2001
lines: 171675
web site URL string is present in 160442 lines

A processing GZiped log says:

F: Closing logfile \users\race.com\logs\www.race.com.12-03-2001.gz
S: Successful requests: 16535
S: Redirected requests: 0
S: Failed requests: 639
S: Requests returning informational status code: 0
S: Status code not given: 0
S: Unwanted lines: 154495
S: Corrupt lines: 6
S: Earliest entry in logfile: 12/Mar/01:0000
S: Latest entry in logfile: 12/Mar/01:1653
F: Opening \users\race.com\web\analog\index.html as output file
F: Closing \users\race.com\web\analog\index.html

re-process after external un-Gzipping

F: Closing logfile \users\race.com\logs\www.race.com.12-03-2001
S: Successful requests: 16535
S: Redirected requests: 0
S: Failed requests: 639
S: Requests returning informational status code: 0
S: Status code not given: 0
S: Unwanted lines: 154495
S: Corrupt lines: 6
S: Earliest entry in logfile: 12/Mar/01:0000
S: Latest entry in logfile: 12/Mar/01:1653
F: Opening \users\race.com\web\analog\index.html as output file
F: Closing \users\race.com\web\analog\index.html

next file:

F: Closing configuration file air.cfg
F: Opening \analog\analog3.3\lang\us.lng as language file
F: Closing language file \analog\analog3.3\lang\us.lng
F: Opening \analog\analog3.3\domains.tab as domains file
F: Closing domains file \analog\analog3.3\domains.tab
F: Opening \users\race.com\logs\www.race.com.13-03-2001.gz as logfil
e
F:   Using gzip -c -d to uncompress it
F: Closing logfile \users\race.com\logs\www.race.com.13-03-2001.gz
S: Successful requests: 0
S: Redirected requests: 0
S: Failed requests: 0
S: Requests returning informational status code: 0
S: Status code not given: 0
S: Unwanted lines: 0
S: Corrupt lines: 0
F: Opening \users\race.com\web\analog\index.html as output file
F: Closing \users\race.com\web\analog\index.html

re-process

F: Closing configuration file air.cfg
F: Opening \analog\analog3.3\lang\us.lng as language file
F: Closing language file \analog\analog3.3\lang\us.lng
F: Opening \analog\analog3.3\domains.tab as domains file
F: Closing domains file \analog\analog3.3\domains.tab
F: Opening \users\race.com\logs\www.race.com.13-03-2001 as logfile
F: Closing logfile \users\race.com\logs\www.race.com.13-03-2001
S: Successful requests: 0
S: Redirected requests: 0
S: Failed requests: 0
S: Requests returning informational status code: 0
S: Status code not given: 0
S: Unwanted lines: 0
S: Corrupt lines: 0
F: Opening \users\race.com\web\analog\index.html as output file
F: Closing \users\race.com\web\analog\index.html

Analog say 'zilch', but, this one has 180,000 lines:
wc   www.race.com.13-03-2001

180130 3149495 33682423 www.race.com.13-03-2001

the url string is present in: 162686

20/03/01   4:56   32977553           0  www.race.com.12-03-2001
20/03/01   6:32   33862553           0  www.race.com.13-03-2001

I have another 10 or 12 logs with similar predicament......

PS: the URL above is NOT the real URL



Voytek Eymont
SBT Information Systems Pty Ltd
http://www.sbt.net.au/links/
phone +61-2 9310-1144 fax +61-2 9310-1118 

------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/[email protected]/
------------------------------------------------------------------------
------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/[email protected]/
------------------------------------------------------------------------

Reply via email to