Re: [Drizzle-discuss] New Protocol Draft

Jim Starkey Wed, 12 Nov 2008 13:43:32 -0800

Patrick Galbraith wrote:

Wow,
I turn away from my mail for a day or two and come back to find adeluge of all these passionate emails and fervor. You'd think this wasa discussion on religion or politics. I suppose in a sense it is.
That's the worst thing about writing a book is that I often miss thesediscussions!
Jim, if you had a magic wand, what would be the best way to storeBLOBs-- say if you were to be writing a DB app from scratch-- vs. theway it is now implemented (pointer in server record) ? What is the"right" way, and how do you deal with all the issues that people bringup such as backup, etc, that you contend are issues only because theywere implemented wrongly in the first place?
I sometimes wonder about the issue of whether your (one's) RDBMS isgood, it should eat it's own dogfood by using itself for it's owninformation as is the case with information schema. The age-oldquestion of whether to store BLOBs in the DB or not seems to be inthis category of philosophical arguments.

Good to hear from you, Patrick.  I understand we'll see you this weekend.

Having gotten somewhat a professional rut of writing rdms after rdmsafter rdms, I faced that question in Rdb/Eln, Interbase,Netfrastructure/Falcon, and now Nimbus. This is how my thinking hasevolved.

In Rdb/ELN and Interbase/Firebird, small blobs and large blob headerswere intermingled table rows on data pages, preferably on the samepage. For large blobs, the header was either a vector of page numbersof the blob pages, or a vector of pointer pages to data pages. Thetheory was that when the blob was reasonable small, it could be fetchedfor free.

The actual record contained a blob id (record number) -- essentially apointer to the blob. When the user requested a blob, he actually gotthe blob id. When the blob was referenced (Rdb and Interbase/Falcon,like Java, have a corner of the API dedicated to blob handling to enablehandling of blobs too large to fit in memory).

The counter argument is that intermingling blobs reduced the density ofrows on page, slowing down large scans that were unlikely to touch toblobs at all. Netfrastructure (from which Falcon derived) was first andforemost a Web application platform for which blobs figured prominentlyas jpegs, Java classes, Word documents, and the like. On the other hand,there were other mechanisms that came into play:


   * Images (jpegs, pngs, etc.) and static downloadable things (PDFs)
     were fetched and cached by the Web server module.
   * Things like Word documents (used by humans to manage content)
     needed to be kept, but were translated to wholesome HTML on upload
   * Java classes were referenced at most once per server instance

This translates into many frequently referenced tables containing blobsthat themselves were infrequently referenced. A good example worthlooking at is the Images table. During upload, images were extractedfrom documents, stored in the image table, and the image replaced with areference to the image table. During page generation, an imagereference was replaces with a file reference to the image on the Webserver, and the image name appended to a custom http header. The Webserver module would run through the header, determine whichimages/downloads were already resident on the Web server, and do adatabase query to fetch the ones that weren't. This guaranteed that theWeb server would fetch an image blob at most once. On the other hand,it meant that page generator made more than a few references to Imagestable, but never actually references the image blob itself. A littlecomplex, perhaps, but it worked like a charm. Image names were assignedby a trigger (a popular drizzle non-features) from a sequence.

So Netfrastructure and Falcon have two independent data sections pertable -- on for rows and one for blobs. They tend to be co-linear byhappenstance, but nothing goes out of its way to ensure that. From aon-disk structure perspective, the row and blob data sections wereidentical, though Falon uses different code paths to achieve criticalde-optimization (oh, well).

I should probably note that all over the above systems -- Rdb/Eln,Interbase, Firebird, Netfrastructure, and Falcon -- are careful duringupdate operations to leave unmodified blobs in place. Only when arecord or record version is garbage collected are unloved blobsdeleted. In specific, this means that multiple record versions can (andalmost always do) point to a single blob.

Nimbus retains the Falcon philosophy is separate data spaces for rowsand blobs. Unlike the precursors, however, the two types of datasection are handled differently. The primary reason that that recordshave versions, format version number, and subtypes (Nimbus implementsthe semantic data model). Blobs have none of these, and benefit from asimpler structure.

I'd be happy to go on at even greater length at conference this weekend,particularly is someone else is buying. Paul, do you remember whopicked up the last round at the UC?

I think it's sad that drizzle is opting out of the Web, but databaseguys have been rather thick from the beginning. It's just so hard tothink past the card reader...


029 anyone?

--
Jim Starkey
President, NimbusDB, Inc.
978 526-1376

_______________________________________________
Mailing list: https://launchpad.net/~drizzle-discuss
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~drizzle-discuss
More help   : https://help.launchpad.net/ListHelp

Re: [Drizzle-discuss] New Protocol Draft

Reply via email to