D'oh!

You were looking for total (and types) of pages... that's a little more complicated...

First off, to answer your question about why doc_list is not working, I discovered that you have to run htdig with -t to get it to create the file.

Then you'll need to post process the output with Perl or tool of choice.

Ted Stresen-Reuter
Administrator/Webmaster

Clevernet
(Schlegel Ulrich 000420736K S.L.N.E. B35787027)
C/ Cronista Benitez Inglot 7-bajo
35011 Las Palmas de Gran Canaria

Tel.:  011 34 928 28 90 27
Fax:   011 34 928 25 92 37
Cell: 011 34 696 07 25 17
Chicago: 312-239-0810

http://www.clevernet.biz
http://www.tedmasterweb.com

On May 25, 2006, at 7:37 PM, G. T. Stresen-Reuter wrote:

I'm not sure how that config option is intended to work, but you might find this command line option useful: htstat. Here is the output from my htstat:
htstat: Total documents: 119
htstat: Total words: 136384
htstat: Total unique words: 11151

Might be just what the doctor ordered!

Ted Stresen-Reuter
Administrator/Webmaster

Clevernet
(Schlegel Ulrich 000420736K S.L.N.E. B35787027)
C/ Cronista Benitez Inglot 7-bajo
35011 Las Palmas de Gran Canaria

Tel.:  011 34 928 28 90 27
Fax:   011 34 928 25 92 37
Cell: 011 34 696 07 25 17
Chicago: 312-239-0810

http://www.clevernet.biz
http://www.tedmasterweb.com

On May 25, 2006, at 7:25 PM, Hugh Caley wrote:

I set up an htdig installation for our corporate intranet; it works very well and everyone is quite happy. Recently one of the web admins asked me if we could get a count of the numbers of the types of pages (.htm, .asp, etc) that htdig actually crawls.

I added:

doc_list: /tmp/documents.txt

to htdig.conf, and I thought I would just parse that file for the information, but it didn't get generated for some reason. Do I need to do something else to create this file?

Is there a more elegant way to do this sort of thing?

Thanks,

Hugh

-- Hugh Caley | Unix Systems Administrator | CIS
AFFYMETRIX, INC. | 6550 Vallejo St. Ste 100 | Emeryville, CA 94608
Tel: 510-428-8537 | [EMAIL PROTECTED]



-------------------------------------------------------
All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel? cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general




-------------------------------------------------------
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel? cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general




-------------------------------------------------------
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to