In order to build the new filter, I essentially read the HTML 4.01 spec, built a character-by-character state machine to separate tags etc, parsed the tags, then did things to the attributes (depending on the tag). It would appear that the first phase of this will be a total pig with CSS2 (look at the style sheet for Thought Crime)... does an off-the-shelf CSS2 parser (in java, not using too many third party libraries) that we could use without too much difficulty exist? -- Matthew Toseland toad at amphibian.dyndns.org amphibian at users.sourceforge.net Freenet/Coldstore open source hacker. Employed full time by Freenet Project Inc. from 11/9/02 to 11/1/03 http://freenetproject.org/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available URL: <https://emu.freenetproject.org/pipermail/devl/attachments/20030103/2fce929e/attachment.pgp>
