hello analog people,

I wrote a couple weeks or so ago asking about how I could parse various
logfile formats, to load into a data warehouse.  I wasn't a very good C
programmer so I picked up Perl to do this, and that helped me understand the
C expression matching logic actually.  My perl script matches on all the
fixed log formats that Stephen matches on, and then writes the following to
a new file :

1.  a header column : for example: Host Ident Authuser TimeDate Protocol
RequestedFile HTTP Code Bytes ReferrerSite Browser

2.  each line of the logfile parsed according the the example fields above

* the script does not match netscape, webstar, or eclf formats

So, like I said I'm not sure how useful other would find this (some might
say its pretty simple).  I was using it as the prelude to loading each field
of data in a logfile, into tables within a data warehouse for historical
storage and clickstream analysis.  Analog is great, but I wanted to keep the
level of data as low as possible, no sums or aggregations needed.

I got sidetracked and stopped working on it the other day and so don't have
any functionality in this program for parsing out SessionIDs from a URL
header (which would be cool) etc.  Email if you would like me to send you a
copy.

Nate

[EMAIL PROTECTED]

------------------------------------------------------------------------
This is the analog-help mailing list. To unsubscribe from this
mailing list, send mail to [EMAIL PROTECTED]
with "unsubscribe" in the main BODY OF THE MESSAGE.
List archived at http://www.mail-archive.com/analog-help@lists.isite.net/
------------------------------------------------------------------------

Reply via email to