Hi,
is there a way of "editing" the htdig documents database after the
dig is finished ? We tried the BerkeleyDB tools that are included
in the htdig distribution but e.g. db_dump refuses to do anything
with any of the database files. The error messages always look like
this:
db_dump:
At 9:35 AM +0100 12/6/00, [EMAIL PROTECTED] wrote:
Is there anything wrong with our db files? htsearch seems to be able
to use them, though. Am I missing something?
No, but I don't think you want to use the db_dump programs to deal
with them. In particular, ht://Dig "serializes" the documents
At 8:54 AM +0100 12/6/00, André LAGADEC wrote:
I think that htdig doesn't like the HTML code "!--//" and "//--", and
it see beginning of comment but not the end and ignore the rest of HTML
code of the page.
This is probably correct from the output you sent.
I am true ? An other idea ? What can
This is a one time email.
You will not receive any more correspondence from us, unless
you request it!
Are you away from Israel?
Do you have relative and friends abroad?
Now you can enjoy:
Israeli Food
Israeli Music
Israeli Movies
Books in Hebrew
Children Videos in Hebrew
Unique Israeli Gifts
I just love getting to know new software. There's always some form of
teething pain. Yesterday, I started running my first set of reasonably
large htdig/htmerge processes. Came in today to find the Linux server
(which is running nothing besides basic Mandrake processes and, of course,
htdig)
On Wed, 6 Dec 2000 [EMAIL PROTECTED] wrote:
You can always include a file in the config file, e.g.:
exclude_urls: `/path/to/patterns`
Ok, so that works for exclude_urls, as well (... maybe we just shoulda
tried that in the first place). Fine.
It works for any config attribute:
I can answer this one!!!
I have Redhat Linux 5 and I have never, ever crashed it, even when
using htDig for both my internal and external web site parsings. This
sounds like something deeper, like a hard disk error or bad RAM that
crashed when htDig ran across it. I would recommend a good set
On Wed, 6 Dec 2000, David Gewirtz wrote:
In any case, I couldn't telnet into the Linux box, couldn't run my KDE
console, nada.
I've never seen Linux hang like that before. Almost makes me wish for NT.
I have only rarely seen a server dead enough that it wouldn't accept
outside connections
I just love getting to know new software. There's always some form of
teething pain. Yesterday, I started running my first set of reasonably
large htdig/htmerge processes. Came in today to find the Linux server
(which is running nothing besides basic Mandrake processes and, of
course,
Hey all,
I'm new to this list but have gone through the FAQ and have been looking
through the Search Engine for people with a similar if not the exact
same problem.
I am using RedHat 6.2 with GCC 2.95.2 with GNU ld 2.9.5, and I have
libstdc++ 2.9.0-30 installed (latest version). This is
According to Jun Dong ([EMAIL PROTECTED]):
Thanks for your tips.
In Slackware 7.0 Packages there is no files of LC_CTYPE , LC_* etc.. under
/usr/lib/locale/de or deutsch.
Under /usr/lib/locale/de is only Directory LC_MESSAGES.
I have copied directory de_DE which includs all files LC_* from
According to Foerst, Daniel P.:
I am using RedHat 6.2 with GCC 2.95.2 with GNU ld 2.9.5, and I have
libstdc++ 2.9.0-30 installed (latest version). This is htdig-3.1.5
I am not able to figure out what is going wrong.. any assistance you can
lend is greatly appreciated!
...
I run the
On Wed, 6 Dec 2000, David Gewirtz wrote:
* Is there a way to tell what files got chomped by the fsck and have
lost+found nodes?
Nope. That's why they're "lost and found." You can, however, take a look
at what's in there.
* Is there a way to check a log for htdig?
Not unless you were
Hypothetical Situation:
I have an SQL database table of links I wish to present someone visiting
my site. However, I would like to make these links searchable from my
site. Normally, if these links were static, I would just list them in
the htdig.conf file.
Is there any way to have start_url
David Gewirtz wrote:
I just love getting to know new software. There's always some form of
teething pain. Yesterday, I started running my first set of reasonably
large htdig/htmerge processes. Came in today to find the Linux server
(which is running nothing besides basic Mandrake processes
At 5:59 PM -0600 12/6/00, Heriberto Cantu wrote:
It was a fast work so probably need a second review and the completion
of the synonyms.es file.
I think it a good idea to have this package in the www.htdig.org site,
but couln't find a way to upload this.
You can try ftp://www.htdig.org/upload/
16 matches
Mail list logo