At 11:31 AM +0200 9/5/00, Quim Sanmarti wrote:
>Well, AFAIK the flags we currently deal with are defined as a fixed 
>bit mask in the HtWordReference class, and are limited to
>       text

(in that text is the absence of all flags)

>       capital (BTW, what is this?)

If the word is not completely lowercase, then this flag is set. This 
is to allow some ability to restrict to capitalized words (like 
proper names).

>As you can see, they are mostly HTMLish, as htdig currently indexes 
>HTML (or whatever translated internally to HTML by parsers).

We have left a number of bits open. Our idea is that these can be 
user-configured or parser-configured. In other words, an XML parser 
could set appropriate flags as it goes, storing information about its 
flagset somewhere else.

In short, we have left the flags open to future expansion with XML in 
mind, but I don't think we should target this for 3.2. My assumption 
is that the 3.3 effort will include work in this direction.

-Geoff


------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this. 


Reply via email to