Hi I want to index webpages containing the ’ entity and i got excerpts with:
’
and the entity isn't displayed.
I have played with
translate_amp: true
translate_lt_gt: true
translate_quot: true
translate_latin1: true
but this doen't seems to be related to what i need. The problem seems to
be that the ’ is not in the list of known entities in
htdig/SGMLEntities.cc. I could add it but the code for this entity is
decimal 8217 and that won't fit in an unsigned char as defined:
static struct
{
char *entity;
unsigned char equiv;
} entities[] =
{
{ "lt", '<' } ,
.....
Can somebody confirm my feeling and guide me to a solution ?
Thank You
--
Marcel de Riedmatten
signature.asc
Description: Ceci est une partie de message numériquement signée
------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

