Re: [RDA-L] Part 2: Efficiency of DBMS operations Re: [RDA-L] [BIBFRAME] RDA, DBMS and RDF

Karen Coyle Mon, 14 May 2012 17:53:17 -0700

Note to the majority of readers on RDA-L: you should feel no guilt inskipping the rest of this thread. It has veered off into a technicaldiscussion that you may simply have no time (or use) for - kc


On 5/14/12 12:50 PM, Simon Spero wrote:

    On Mon, May 14, 2012 at 10:45 AM, Karen Coyle <li...@kcoyle.net
    <mailto:li...@kcoyle.net>> wrote:
     What happened with the MARC format is that when we moved it into
    actual databases it turned out that certain things that people
    expected or wanted didn't really work well. For example, many
    librarians expected that you could *[a]* /replicate a card catalog
    display/ with *[b]* /records/ /displaying in order by the/
    /heading that was searched/. That is really hard to do (*[c]* /and
    not possible to do efficiently/) using*[d]* /DBMS/ functionality,
    which is based on *[e]* /retrieved sets/ not /linear ordering/,
    and*[f] */especially using keyword searching/.  [emphasis and
    labels  added]
BLUF: Not all DBMS are Relational; it is possible to efficientlyretrieve records in order from many different types of DBMS, includingRelational databases.
[c] and [d] make the claim that it is impossible to retrieve recordsefficiently in some desired order using DBMS functionality. This isjustified by [e] which claims that the source of this necessaryinefficiency is that DBMS functionality is based on "retrieved sets"not "linear ordering".

No, that is not what I meant. Of course you can retrieve records in agiven order, and we do all the time. It's about using the headings inthe MARC records to establish that order. So here's the question I putto Mac:


***

let's say you have a record with 3 subject headings:

Working class -- France
Working class -- Dwellings -- France
Housing -- France

In a card catalog, these would result in 3 separate cards and thereforeshould you look all through the subject card catalog you would see thebook in question 3 times.

In a keyword search limited to subject headings, most systems wouldretrieve this record once and display it once. That has to do with howthe DBMS resolves from indexes to records. So even though a keyword mayappear more than once in a record, the record is only retrieved once.

In your catalog, which displays the subject headings on a line with theauthor and title

1) will each of these subject headings appear in the display?

2) does that mean that the bibliographic record (represented by theauthor and title) will display 3 times in the list of retrievals?


***

I could add to that: if the record had four subject headings:

Working class -- France
Working class -- Dwellings -- France
Housing -- France
Housing -- Europe

Then under what circumstances in your system design would the user seeall four subject entries (heading plus bib data) in a single display?

That's part of the question. The card catalog had a separate physicalentry for each "entry point" or heading associated with thebibliographic description. Do we have a reasonably efficient way toimitate this behavior using keyword (or keyword in heading, orleft-anchored string searching) in an online library catalog? (followedby: is there any reason to do that?)

But I think another part is the difference between retrieval, in thedatabase sense of the term ("give me all of the records with the word*france* in a subject heading") vs. the kind of alphabetical linearaccess that the card catalog provided, which allows you to begin at:


France -- United States -- Commerce

and soon arrive at

Frances E. Willard Union (Yakima, Wash.)

I don't think you can get from one to the other in most online catalogsbecause the set of records that you can see is determined by the searchthat retrieves only those records with *france* in it.

I've designed a browse in DBMSs using a left-anchored search thatretrieves one heading (the first one hit) in a heading index followed bya long series of "get next" commands. Naturally, "next" has to also benext in alphabetical order, so the index you are traversing has to be inalphabetical order. I should say: alphabetical order that is retainedeven as records are added, modified or deleted. I think this may be morefeasible in some DBMSs than others.

However, what is obviously missing here is a display of the bib recordthat goes with the heading (all of that "ISBD" stuff). It's possiblethat DBMS's can do this fine today, but in my olden days when Isuggested to the DBA that we'd need to "get next," display that heading,then retrieve and display the bibliographic record that went with it, 20times in order to create a page of display, I practically had to revivethe DBA with a bucket of cold water.

Mac's system also cannot take the display from France--US--etc toFrances E. Willard because the headings it has to work with have beenretrieved on a keyword search, thus only headings with the term *france*in them are displayed. It also does display non-retrieved headings forthat same bibliographic record. It does not do what the card catalogdid, which is display every heading from every record in alphabeticalorder. When the headings have been retrieved on a keyword, the headingsthat do not have that keyword do not appear in the display.

All that to say that if we are not going to display our records inalphabetical order by their headings, then I'm not sure if creatingheadings during cataloging makes all that much sense. Or at least, notthe kinds of headings that we do create, which are designed to be viewedin alphabetical order. You are supposed to see "Hamlet" before you see


Hamlet. French.
Hamlet. German.
Hamlet. German. 1919

Maybe you don't see "Hamlet" first, but the logic of adding on to theright hand side of the heading implies that the order conveys somethingto the user that facilitates finding what he is looking for.

Thus, I question to creation of headings that are designed to beencountered in alphabetical order unless we adopt an ordered displayaround those headings. And if we think it is important to adopt such adisplay, we need to understand the implications for system design.


I hope this isn't too confusing,


Simon, I hardly know where to begin. :-)

kc

Simon
* In some situations involving multiple tables, some systems mayreturn records in a different order if no specific order is requested.This is due to decisions that the DBMS makes on the fastest way ofanswering the query. Since not asking for results to be returned in aspecific order tells the system that you don't care about ordering,the system may choose to use different algorithms when running yourquery. This extra freedom to optimize is why the order of results isunspecified by default.


--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet

Re: [RDA-L] Part 2: Efficiency of DBMS operations Re: [RDA-L] [BIBFRAME] RDA, DBMS and RDF

Reply via email to