Gabriele Bartolini writes:
> je suis Gabriele ... and I write you from Italy. I am building up a
Ciao Gabriele :-)
> program that checks a site and stores all the tags it found in every
> document crawled. I argueed with Geoff lots of times about the best
> database to use ... We have come up that the best solutioni was to use
> mySQL. I have read in your previous message to the htdig3-dev that you have
> encountered some problems with it. Can you please give me more details on
> it? Indeed I have created a simple mysql interface in order to manage the C
> API interface.
There are three main problems with MySQL :
. No transactions
. No locks
. Size of field indexed max 255 char
Also you must take care to use the 'binary' keyword if you want your
search to be case sensitive. If you can cope with the three bottlenecks,
MySQL is very fast and compact.
> As I am at the beginning of the work, and now me and Geoff are focusing our
> attention on implementing a persistent connection mechanism on ht://dig
> (and ht://check), I would appreciate any kind of suggestions from you.
I didn't know about ht://check. Is it possible to get it somewhere ?
Are you planning to provide some effort to modify ht://dig so that it
can use a SQL database instead of Berkeley DB ?
Cheers,
--
Loic Dachary
ECILA
100 av. du Gal Leclerc
93500 Pantin - France
Tel: 33 1 56 96 09 80, Fax: 33 1 56 96 09 61
e-mail: [EMAIL PROTECTED] URL: http://www.senga.org/
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.