According to Abdullah Ramazanoglu:
> Since this is a quick feedback mail, I didn't subscribe to any 
> list. Sorry if you feel your inbox is being littered.
> Also, instead of sending a mail to each party, I'm sending a 
> composite mail to all parties at once, so some parts may not be 
> of interest to you.

No problem.  We appreciate this sort of feedback, even though in
this case these are mostly known problems.  Most of these are fixed
in the htdig-3.2.0b4-081901.tar.gz snapshot.  Please give it a try.
(The 082601 snapshot seems to be truncated, so use the next to last one,
which is the most recent working version.)

You can still post to htdig-general, or even htdig-dev (subject to
moderator approval), even if you're not subscribed, so that's still the
best place to post htdig-related correspondance.

> htdig= 3.2.0b3
> 
> cygwin= 1.3.2-1 and latest set as of Aug 2001 especially, 
> gcc-2.95.3-5 and peer libs, flex-2.5.4-1, autoconf-2.52-1, 
> make-3.79.1-4
...
> Modifications to htdig :
> 
> ==============================================================
> 
> 1. htsearch/Display.cc :
> Add,
> #include <limits.h>
> #include <float.h>
> 
> Notes: This step is necessary for both RedHat and Cygwin 
> environments. RedHat solves (in their src.rpm) this by just 
> adding "#include <values.h>" but values.h is being phased out, 
> (limits.h and float.h recommended instead) and does not compile 
> in cygwin environment. (Cygwin has a "dummy" values.h) The 
> following steps are necessary only for Cygwin environment.

Fixed in the recent snapshot.  At least, float.h is added.  Is limits.h
actually needed on Cygwin?

> ==============================================================
> 
> 2. htfuzzy/EndingsDB.cc :
> 
> Comment-out the following :
> // This is an attempt to get around compatibility problems
> // with the included regex
> // 
> #ifdef HAVE_BROKEN_REGEX
> // 
> #include <regex.h>
> // 
> #else
> // 
> #include "regex.h"
> // 
> #endif
> 
> And add this :
> #include "../htlib/regex.h"
> 
> ==============================================================
> 
> 3. htfuzzy/Fuzzy.cc :
> 
> Instead of:
> #include "Regex.h"
> 
> Use:
> #include "../htlib/Regex.h"
> #include "Regex.h"
> 
> ==============================================================
> 
> 4. htfuzzy/Regex.cc :
> 
> Instead of:
> #include "Regex.h"
> 
> Use:
> #include "../htlib/Regex.h"
> #include "Regex.h"

These 3 problems should be fixed in the recent snapshot.  The cause
was the ambiguity between regex.h and Regex.h on a case-insensitive
file system.  The latter header has been renamed to Regexp.h.

> ==============================================================
> 
> 5. htdb/htdb_load.cc & htdb/htdb_dump.cc & htdb/htdb_stat.cc :
> 
> In each of these 3 programs, add the following to the end of 
> "include" block:
> 
> #ifdef HAVE_GETOPT_H
> #include <getopt.h>
> #endif

Fixed in the recent snapshot.

> ==============================================================
> ==============================================================
> 
> If you're not interested in htdig please be informed that what 
> follows on is relevant to htdig only.
> 
> And some bugs in htdig:
> 
> 1- Sort-by-title does not sort. It is not related to locale 
> settings, because my locale (tr_TR) works properly, and it does 
> not sort in C or en_US locale as well.

Fixed in the recent snapshot.

> ==============================================================
> 
> 2- In templates, $(EXCERPT) HTML escaping works in a different 
> (and unwanted to me) way: It HTML escapes when not being asked 
> to. And it HTML-escapes the HTML code itself when asked to. Example:
> 
> $(TITLE) correctly produces: "This is � umlaut & � cedilla"
> $&(TITLE) correctly produces: "This is &uuml; umlaut &amp; 
> &ccedil; cedilla"
> which are what we would expect.
> 
> But,
> $(EXCERPT) produces: "This is &uuml; umlaut &amp; &ccedil; cedilla"
> and,
> $&(EXCERPT) produces : "This &lt;b&gt;is&lt;/b&gt; &uuml; umlaut 
> &amp; &ccedil; cedilla"
> 
> Which are unexpected. As it is, $(EXCERPT) is behaving as if 
> $&(EXCERPT), and $&(EXCERPT) is html escaping the html code itself.
> 
> I toyed around Display.cc to hard-disable SGML encoding totally, 
> without luck so far. (It mangles up some local characters. They 
> are defined in my locale, and gets html-escaped allright, but 
> not every browser understand html-escapes for local characters, 
> so shows garbage) Disabling html-escaping in all other fields 
> {i.e. use $(TITLE) instead of $&(TITLE)} solved the problem, but 
> disabling it in EXCERPT is not possible apparently. Any 
> suggestions are welcome)

The EXCERPT template variable can't be used as $&(EXCERPT) because
it may contain HTML tags in it, which you don't want encoded as,
e.g. &lt;strong&gt;.  So, you must use it as $(EXCERPT), but to prevent
any unencoded "<", "&", etc. from being directly inserted in with the
HTML, the plain text parts of the excerpt are SGML encoded.

The mangling of accented characters in non-ISO-8859-1 locales,
by htsearch's SGML encoding, is a known problem, but we don't have a
solution as yet.  The only fix right now is to hack the HtSGMLCodec class
not to convert any locale-specific accented characters (those above 127),
but to still convert the plain ASCII characters that need it.  I think
we need to add a configuration attribute to control whether this is done.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to