Hello

I have posted a (wish)bug a couple of days ago regarding HTML
translations performed by htdig (#484345). I should ave brought the
issue on the list first (as Gilles Detillieux suggessted), so I'll just
bring up the issue now.

htdig does supports (afaik) 3 translations:

1. lg & gt (< >)
2. amp (&)
3. quot (")

However, there are some more escapes that I think would be helpful to
have.

For instance, ' (apostrophe '). 
Gilles said ' is not supported in HTML - that is correct; however, 
xhtml1.0 brings in XML well formed documents - in XML, you cannot use '
- ' is escaped as '

XHTML1.0 notes can be found at: http://www.w3.org/TR/xhtml1/ 
look at A2. Entity sets - special characters
http://www.w3.org/TR/xhtml1/DTD/xhtml-special.ent

The problem is that there are many more escape sequences (in the
&something; style); there are some ways to do it
1. by having a translation table - in which case htdig will translate
everything there so that htsearch will not misescape them while
displaying results (e.g. from a XHTML source file which has say €
when searching the browser will display € instead of the euro sign
cause htsearch escapes € into €).
2. Eliminate translations from htdig; htsearch will have to stop
escaping what is found in the DB in the &something; form.

I think the second way is better.

I'm not sure if I explained clearly I'll try to explain again if
necessary.

Is it possible/desirable ?

Thank you,
Ionut Nistor
[EMAIL PROTECTED]



_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to