Re: [htdig] doclist, perl db.docdb access

1999-09-11 Thread Geoff Hutchison


At 1:16 PM -0500 9/10/99, Bill Carlson wrote:
>I have access to the database, but the hashed information doesn't seem
>right. For example, the key should be the URL in question, yet when
>running doclist.pl for example, the output is something like:

The problem is the attributes common_url_parts and url_part_aliases, 
which encode and compress the URLs. In short, the Perl code hasn't 
kept pace with the C++ code.

Ideally, we should have a shared C++ library and Perl bindings to 
this. Then the Perl code will keep pace with the C++ code as long as 
there aren't significant changes requiring binding updates. The 3.2 
development tree already is beginning to be built on a shared library 
setup.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.



[htdig] doclist, perl db.docdb access

1999-09-10 Thread Bill Carlson


Hello,

I am stumbling into some problems using any of the contrib perl scripts. I
understand that various fields have been added to the docdb that aren't in
some of the scripts; I have accounted for those.

I have access to the database, but the hashed information doesn't seem
right. For example, the key should be the URL in question, yet when
running doclist.pl for example, the output is something like:

^Gwww.somewhere.org/index.html^S

where those are control characters that only show when piping through
less.

I modified the script to use BerekeleyDB instead of GDBM_File, but no
change.

Any pointers?

Thanks,

Bill Carlson

Systems Programmer[EMAIL PROTECTED]|  Opinions are mine,
Virtual Hospital  http://www.vh.org/|  not my employer's.
University of Iowa Hospitals and Clinics|




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.