Re: [whatwg] The problem of duplicate ID as a security issue

Alexey Feldgendler Thu, 16 Mar 2006 03:45:59 -0800

On Wed, 15 Mar 2006 19:26:03 +0600, Mihai Sucan <[EMAIL PROTECTED]>wrote:

Sandboxes are quite special things, so we'll need a DOMSandbox anyway.But instead of adding things like getElementById() to the DOMSandboxinterface, I tend to make the "fake document" which is visible frominside the sandbox a member of the sandbox itself. The call will looklike sandbox.document.getElementById().

As Ric said, having <sandbox>es treated "too similar" to a document isoverkill.

A DOMDocument interface has to be exposed to the contained scripts anyway,ahy not also make it accessible from the outside?

(A wild thought: maybe enforce ID uniqueness only for <!DOCTYPE html>?)

I think enforcing ID uniqueness in standards mode would be good, butthat would still probably break (very?) few pages. Those web authorsshould have to "live with it", because they want standards-compliantsites.

I'm not speaking about enforcing ID uniqueness at the time of parsing thepage, but only at the time of calling getElementById(). I believe it willbreak very few pages, if any.

I know that many web applications have bugs like this: they have a CSSrule like "#titlebar { font-weight: bold; }" and a single titlebar on thepage. After that, the requirements change, and they have more than onetitlebar on a page. To make the rule apply to all titlebars, they givethem all the same ID (instead of using class, not ID, in CSS rules). Whilesuch documents are non-connforming, they should not, in my opinion, causeparse errors even in standards mode. Here is why: duplicate IDs are wrong,but it's obvious what the author means, and it's easy to do "what theauthor intended".

Usually in such applications the scripts don't call getElementById() forthose ID values which occur more than once. If they occasionally do, it'sreally a programming bug. I don't believe that there are applications thatreally rely on the particular behavior in this case, though I admit thatthere are possibly some that have this bug unnoticed and still work. Ithink that this case should trigger an exception in standards modebecause, for this bug, there is no obvious fix to apply, and we don't know"what the author meant" -- does he want to do something to the firstelement with the specified ID, the second, or both.

Side note and wild guess: We are probably forgeting that the beauty ofthe web is actually allowing everyone to contribute, be it bad code orbetter code. Wanting something *that* strict is like disproving one ofthe essential concepts contributing to the success of the web.

Simply picking the last matching node is actually hiding a bug and lettingit go unnoticed. (Why the last one? Why not the first, for example?)

And, by the way, blog entries aren't the only place where sandboxingcan be appliied in blogs. For example, LiveJournal allows user-definedjournal styles which are written by the users in a self-inventedprogramming language which outputs HTML. That HTML goes through theHTML cleaner afterwards, of course. Manny people would love to adddynamic menus, AJAX comments folding etc to their styles. This could bepartly solved with a set of predefined "toys", but actually the entireLiveJournal styling system is about user-initiated development. Thosewith programming skills write new styles, and other users may take anduse them.

I did not see LiveJournal, so I don't know what kind of features theyoffer.
<sandbox> would probably do "the trick" (would help a lot with securityin this case also).

Yes, I think so. Actually, my activity around the sandboxing idea has beeninspired by several recent security incidents with LiveJournal and itsstyling system which failed to filter out some patterns of dangerous HTML.

Take HTML, for example, it's a markup language greatly appreciated bymany and despised by others. Even you said in one reply to this thread"today's HTML sucks" - advocating for the need of allowing user-scriptsin pages, for having table sorting, popup menus, etc. A few minuteslater in another reply you say "we already have a great markup language,which is HTML" - advocating for allowing users to write HTML, instead ofcustom markup.

Yeah, really, I sound a bit contradictory. Actually, in my opinion, HTMLis better than most of ad-hoc markup languages, and HTML with scripts isstill better than just HTML.

And another thing: HTML 5 is about to make HTML pages more powerful, thereare going to be menus, datagrids and such, but most of these features areuseless without scripting, aren't they? For example, a datagrid isn'treally sortable at client side without a script, which makes it useless inblogs and CMS unless they allow some scripting.

So, <sandbox> may be designed to help tighting-up security on the web,but we should also try to think of how's it actually in usage,side-effects, etc. It definitely solves problems, but will it causeother problems? How important are they?

Of course, there is a lot more to think and talk about. I suppose thereare going to be problems with particular buggy implementations ofsandboxing and exploits specifically targetted at holes in suchimplementations. I suspect that web application authors and siteadministrators will be hesitant to allow user scripting even in sandboxesbecause of the possible browser bugs. Though, because sandboxes can beuseful even if scripting inside them is completely disallowed, I hope thatthe use of sandboxes becomes somewhat popular even before siteadministrators decide to allow scripting.



-- Opera M2 9.0 TP2 on Debian Linux 2.6.12-1-k7

* Origin: X-Man's Station at SW-Soft, Inc. [ICQ: 115226275]<[EMAIL PROTECTED]>

Re: [whatwg] The problem of duplicate ID as a security issue

Reply via email to