Dar wrote:

>  However, that does have its 
>own problems.  You end up with pieces of elements you don't expect.  I 
>ran a quick test with Tagalog and getting the last word got part of a 
>font element and the trailing </p>.  Of course, since it is all ASCII, 
>you can program around that.


That's quite right. In my experience so far, these trailing and fronting 
elements all occur very regularly and can be filtered out as one wishes. A line 
or two makes it all neat and manipulable. There is also the problem that 
unicode-derived ascii will produce "one" legible word for Transcript, no matter 
how many characters are involved.  A filter is required to space out betwen ";" 
and "&" which is also not a big deal. I am just not sure exactly what sort of 
functions Thomas wants to apply to his unicode chunks.
_______________________________________________
use-revolution mailing list
use-revolution@lists.runrev.com
http://lists.runrev.com/mailman/listinfo/use-revolution

Reply via email to