Selectors API Method Names

Lachlan Hunt Fri, 22 Jun 2007 22:26:11 -0700

Hi,

The naming of the methods in the Selectors API specification has beenheavily debated. I understand the reason for this debate. There is along history of bad naming (XMLHttpRequest, AJAX, etc.) and there isclearly a desire to avoid that again. However, it's virtuallyimpossible to come up with the perfect name, since everyone has adifferent opinion of what that may be.

My rationale in resolving this issue has not depended upon the majorityopinion. Arguments based on nothing more than personal preference havenot been given much weight, simply because everyone has differentopinions and it's impossible to rank one above another. All argumentsbased on logic, reasoning or technical issues have been taken intoaccount and judged on their own merit, not the credentials of the peoplemaking the arguments.

With that in mind, and given that the arguments that have been putforward are sometimes mutually exclusive, please understand thatwhatever the final decision, it is simply not possible to pleaseeveryone, but I hope we can all accept it.



*Rationale for Naming Principles*

A short name is important for several reasons. Since these methods aredesigned, and behave, as a superset of the existing getElementsBy*methods, it is likely that their use will significantly replace the useof previous methods.

Based on evidence from many widely used JS libraries, and otherfeedback, we know that authors generally prefer shorter names overlonger names. This is particularly true for methods that will befrequently used.

Not only does a shorter name reduce the amount of typing, it can help toimprove the readability of the source code, which in turn makes codeeasier to maintain. Given these reasons, I have concluded that, withinreason, short names are a fundamental requirement that must be adhered to.

Ideally, the chosen method names would be relatively clear andunambiguous with regards to their purpose, usage and return value.However, this must be put into perspective; the names do not need todescribe these aspects perfectly.

One of the few names that meets this specific requirement perfectly isgetElementsByGroupOfSelectors, but that name is clearly too long. Anappropriate balance needs to be found between clarity and convenience.

With a somewhat intuitive name, it is expected that frequent use byauthors will alleviate any remaining concerns about the constant need torefer to documentation arising from any ambiguity in the name. Forexample, despite the name being completely non-descriptive, many authorsfrequently use $() as an alias for document.getElementById() without theneed to constantly lookup what it means.

The name must not clash with any existing DOM API, nor be too similar toan API that could cause confusion amongst authors. It is also somewhatimportant to choose a name that is less likely to clash with a futureAPI, though this is difficult because it is impossible to predict thefuture. It can, however, be helped by choosing a relatively unambiguousname. For example, choosing selectAll() as one of the names would causesignificant confusion because the name is commonly associated with, andvery similar to, text selection APIs.

For the two methods, it is desirable that the chosen names are somewhatrelated to each other. However, because there is a need to maintaincode readability, the two chosen names cannot not be too similar to eachother because it would reduce maintainability of code.

It has been argued that the chosen names should be in line with theconventions of existing DOM APIs, including:


* getElementById
* getElementsByName
* getElementsByTagName
* getElementsByTagNameNS
* getElementsByClassName (proposed in HTML5)

The convention is to use getElement* for methods that return a singlenode and getElements* for methods that return multiple nodes. However,given that we need two separate methods for these APIs, this is notdesirable as it conflicts the need for readability.

For example, choosing getElementBySelector() and getElementsBySelector()would not be good choices because they only differ by a singlecharacter. This would make it easier to make a mistake by typing thewrong method, and also make it more difficult to recognise the errorwhen debugging code.

There is precedence for not following this convention for similarmethods. DOM Level 3 XPath defines the evaluate() method for evaluatingXPath expressions. (Although, that method is slightly different becauseit has a variable return type.)

In addition, Microsoft's .NET uses selectSingleNode() and selectNodes()for their proprietary XPath implementation. Unfortunately, this alsomeans that those and similar names cannot be used for these APIs.



*Summary of Naming Principles*

* Short
* Somewhat descriptive of the functionality
* Clear, concise and relatively unambiguous
* Avoid clashes with other APIs due to ambiguous naming
* Easy to type (can't rely on autocomplete)
* Easy to read


*Rejections*

The following names have been rejected for the reasons detailed below.

* match()                             matchAll()
* matchSelector()                     matchAllSelectors()
* matchSelectors()                    matchAllSelectors()

match() is already used for regular expressions matching on Strings inECMAScript and it is considered better to use a less ambiguous name. Itis also not clear whether match() would return a single or multipleelements, without having to assume based on the other being matchAll().

The matchSelector/matchAllSelectors variants have been rejected becausethe names seem to be misleading. They create the impression that theformer matches against a single selector and the latter against multipleselectors, instead of returning single and multiple results, respectively.


* select()                            selectAll()
* selectOne()                         selectAll()
* selectFirst()                       selectAll()
* selectSingle()                      selectAll()
* selectSingleNode()                  selectNodes()
* selectNode()                        selectNodeList()

These select*() variations are either in direct conflict with, or verysimilar to, existing APIs, and their use would result in confusionamongst authors.


* get()                               getAll()
* getOne()                            getAll()

These were rejected because they are not descriptive at all, and theyare too ambiguous. They were not well received by almost anyone.


* getElementBySelector()              getElementsBySelector()
* getElementBySelectors()             getElementsBySelectors()
* getElementByCSSSelector()           getElementsByCSSSelector()
* getElementBySelector()              getElementListBySelector()
* getElementBySelectors()             getElementListBySelectors()
* getElementByGroupOfSelectors()      getElementsByGroupOfSelectors()
* getElementByGroupOfSelectors()      getElementListByGroupOfSelectors()

These were all rejected because they are far too long. While they arevery clear and mostly follow the established convention, they are notconcise and do not satisfy the length requirement.


* nodeBySelector()                    nodeListBySelector()
* getNode()                           getNodes()
* getNode()                           getAllNodes()
* getNode()                           getNodeList()
* getNodeBySelector()                 getNodeListBySelector()
* getNodeByExpr()                     getNodeListByExpr()
* getBySelector()                     getBySelectorAll()

These variants using node instead of element were rejected for a fewreasons. Selectors only select elements, not all types of nodes, and itdoesn't seem likely that selectors would be extended to selectnon-element nodes in the future. These also break the establishedconvention of using getElement, and there is no reasonable justificationfor doing so in these cases.


* css()                               cssAll()
* cssQuery()                          cssQueryAll()
* matchCSS()                          matchCSSAll()

Selectors aren't just for CSS, as this API clearly demonstrates.Although there is a common association between selectors and CSS, thereis no reason to encourage this misconception. The names create theimpression that they deal with CSS styles, rather than selecting elements.

Although there has been a previous JavaScript implementation of cssQueryin the past, this is not considered sufficient justification for usingthe ambiguous name.



*Candidates*

After careful consideration, I've narrowed down the remaining options tothese seven pairs.


* matchSingle()                       matchAll()
* matchOne()                          matchAll()
* getElement()                        getElementList()
* getElement()                        getElements()
* selectElement()                     selectElementList()
* selectElement()                     selectAllElements()
* chooseOne()                         chooseAll()

These are summarised with their pros and cons below:

* matchSingle()/matchOne()/matchAll()

These names short, easy to type and easy to read. The choice betweenmatchOne and matchSingle would effectively come down to personalpreference. Although matchOne is shorter, it's not significantly betterthan matchSingle.

The advantage of using matchSingle and matchAll is that there is anexisting JavaScript implementation of these methods in Dean Edwards'Base2 library. While the implementation could be considered evidence insupport of these names, it must be noted that these names wereimplemented simply because they were the names in the spec at the time.

The names aren't completely clear and unambiguous and it must be notedthat these names did not receive wide acceptance when they were put inthe draft, and so choosing these names probably wouldn't be the mostproductive choice.


* getElement()/getElementList()/getElements()

The advantage of these methods is that they somewhat follow theestablished convention, although not completely because they don'tspecify BySelector (or equivalent). Given that these APIs areeffectively a superset of existing getElement* methods, it makes somesense to use names that recognise that.

The problem with choosing getElement and getElements is that they aretoo similar to each other, which reduces code readability, and so usinggetElementList would be a workaround for that issue.


* selectElement()/selectElementList()/selectAllElements()

Overall, these names are relatively good. While they are not theshortest alternative, they are not too long; they are relatively easy totype and are easy to read; and they are clear and concise. By using theword "select", which is easily associated with Selector, they aresomewhat more descriptive than the getElement variations.

One problem is the use of "select*" is similar to the .NET XPath methods(selectSingleNode/selectNodes), though the use of Element instead ofNode reduces the confusion slightly. Those are also proprietary methodsthat aren't used outside of .NET (The DOM3XPath standard uses evaluate()instead.).

Several people expressed a preference for select() and selectAll(,though they inevitably had to be rejected due to clashes and ambiguitywith the name. Using selectElement and selectAllElements instead seemslike a good compromise that solves the problem.


* chooseOne()/chooseAll()

The word "choose" is an alternative verb to the word "select"; howeverit's a slightly more ambiguous term. While these are shorter, theadvantage of length isn't quite enough to sacrifice the clarity of the name.



*Conclusion*

After carefully considering all of these reasons, I have update the specto use selectElement() and selectAllElements(), based on the argumentsgiven above.


http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?content-type=text/html;%20charset=UTF-8

--
Lachlan Hunt
http://lachy.id.au/

Selectors API Method Names

Reply via email to