Torsten Neuer wrote:
> Depends on what ht://Dig is developing towards. ;-)
> 
> I'd rather split things up for correct HTML 4.0 and language support.
> <LINK> is somewhat more complex than <AREA> with reagards to language
> support.  <LINK> adds the "hreflang" attribute to denote the language
> of the referred document whereas such an attribute is unknown to <AREA>.
 
The <LINK> tag is even more complex than that. From the HTML 4.0 DTD (in
the META section)

Note. When a property specified by a META element takes a value that is
a URI, some authors prefer to specify the meta data via the LINK
  element. Thus, the following meta data declaration:

    <META name="DC.identifier"
          content="ftp://ds.internic.net/rfc/rfc1866.txt">

  might also be written:

    <LINK rel="DC.identifier"
             type="text/plain"
             href="ftp://ds.internic.net/rfc/rfc1866.txt">

In other words, we have to worry about <META> tags masquerading as
<LINK> tags!

> To ease the task of automatically switching between multiple languages
> (i.e. stopword lists, dictionaries etc.) in the future it probably will
> be better to handle both DTD elements separately.

The change isn't *that* hard--you'd just separate the case statements
and duplcate the current code. I'm going to leave the change as-is, with
the <AREA> and <LINK> tags parsed by the same code. But patches for more
involved HTML parsing are welcome. ;-) This will get a little easier
when the Transport object is more robust and the HTML parser can fetch
the linked meta-data.

-Geoff
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to