Hi 

I want to index webpages containing the ’ entity and i got
excerpts with:

’ 

and the entity isn't displayed. 


I have played with 

translate_amp: true
translate_lt_gt: true
translate_quot: true
translate_latin1: true

but this doen't seems to be related to what i need. The problem seems to
be that the ’ is not in the list of known entities in
htdig/SGMLEntities.cc. I could add it but the code for this entity is
decimal 8217 and that won't fit in an unsigned char as defined:

static struct
{
    char                        *entity;
    unsigned char       equiv;
} entities[] =
  {
    { "lt",           '<' } ,

.....


Can somebody confirm my feeling and guide me to a solution ? 

Thank You

-- 
Marcel de Riedmatten

Attachment: signature.asc
Description: Ceci est une partie de message numériquement signée

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
ht://Dig general mailing list: <[email protected]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to