Re: Persistent Objects Using SQL

Stevan Little Sun, 30 May 2010 14:15:24 -0700


On May 30, 2010, at 2:34 AM, Darren Duncan wrote:

Stevan Little wrote:
On May 29, 2010, at 11:20 PM, Darren Duncan wrote:
2. Besides the ability to introspect or perform powerful searcheson your objects using SQL/etc, I see another big advantage ofusing database storage without serialization as portability. Youcan have applications written in different programming languagessharing the same database and the same objects, because they don'tcontain Perl-specific data formats.
KiokuDB mostly uses JSON and JSPON as the storage format, which isnot Perl specific. The serialization format we store in isdependent on the Moose class definition, so in that way it is notterribly portable.
An advantage of not using serialization like JSON, but ratherstoring each object attribute as a database member attribute, isthat the DBMS itself can then most easily be defined to enforce theconsistency of your objects, so someone accessing the database bysome way other than KiokuDB, or using a buggy version of KiokuDB, isless likely to be able to corrupt the data. As for how to get thedatabase to do that, one general answer is CHECK constraints, thoughthat is a fallback to where terser/simpler kinds of constraintsdon't do the job.

I think you misunderstand, KiokuDB is *not* just a JSON serializationservice, it breaks up the object graph on a per-instance basis andstores each instance separately. It uses JSPON as a way to handlereferences from one object to another.

It is not like MongoDB which stores the entire "document" at once andhas no (built in) way to refer from one "document" to another. Infact, MongoDB is more like a traditional relational DB in that allit's relationships are stored implicitly and laid out by the user.KiokuDB on the other hand stores all the relations as explicit andresolves them for you when they are extracted.

I think perhaps you need to take a much closer look at KiokuDB becauseI suspect you have not done so and so are pointing out issues that youperceive it to have, but in fact, it does not.

A relational database can map to an object structure of anylanguage fairly easily. Add attributes/columns for mutuallyheterogeneous data, like when you would add object attributes, andadd tuples/rows for mutually homogeneous data, like when you woulduse arrays or sets.
And then you get the impedance mismatch. You are ignoringinheritance, which is not really possible in a relational model.
I wasn't ignoring inheritance, but rather was just being terse bygiving examples rather than every relevant detail.


Fair enough.

As for inheritance, a relational model can handle that just fine.
You also have several options for how to lay it out, depending onwhat you're going for.
One option in the general case is to have a distinct database relvar/table per each instantiatable class, which has one attribute/columnper class attribute, plus an extra attribute/column to hold an IDvalue for the object. When a class composes a role or inherits froma class, the attributes defined in the others plus those defineddirectly in the first class would each have a correspondingattribute in the relvar/table attribute/column, so that eachattribute of the object of that class has a place to be stored. Andso, when multiple classes compose the same attributes, theircorresponding relvars/tables all have common-named/typed attributes/columns corresponding to said.
Another option in the general case is to also have a database relvar/table for each role or non-instantiatable class as well, which isthen the only one having the attributes/columns that thecorresponding declares, and then the relvars/tables mentioned in theprevious paragraph then wouldn't have these but instead would havematching ID values to relate records in them to ones in the others.
Generally speaking, with the exception perhaps of Moose classeswhere every single object can have different names or kinds etc ofattributes, rather than those being class-defined, I would think thebest design is for the database to have exactly the same granularityof component data as the Moose objects do. Just where each objectcan have different attributes, then the database could probably bedesigned like a key-value store, but that's less ideal.

Yes, sorry allow me to clarify, the relational model does not doinheritance easily or cleanly. What you describe above (a key-valuestore) no longer has as much value as a relational DB. The othercommon inheritance mappings all have equally bad tradeoffs involved.The result is you either have a sub-standard DB schema or a non-idealobject graph.

It also does not deal well with polymorphism since the ID (theobject's identity) is essentially fixed to a table (usually mapped toa class). But this is a well worn topic and there is no need to beatthis dead horse one more time.

One should think about the database schema like they think abouttheir code. It is just as reasonable to change the schema as it isto change what classes you have or what attributes they have. Theschema *is* code, and the data it holds is like objects of classes.No more, and no less.

That is a very DB centric viewpoint and I disagree with youcompletely. Changing a schema during development is one thing,changing it after deployment, after you have started to collect data,etc. is another thing entirely and very much a non-trivial task.

Remember, objects are graphs not sets of tuples.
And graphs can be represented as sets of tuples, such as wheretuples have 2 attributes that name connected graph nodes. For thatmatter, objects only *represent* graphs themselves.

Sure, but your relationships are implicit and require outside-the-DBprogramming to make them real.


( 1, 'Foo', 2 )
( 2, 'Bar', 1 )

These two tuples create a cyclical graph, but the RDBMS doesn't helpme reconstruct that graph, that I must do in my code. KiokuDB storesthings as graphs and when you extract them, you get the graphs back.

I mean, in reality we really don't need anything more then functionswith a single argument to write code. Things like numeric literals,conditionals, loop constructs and local variables are not reallynecessary. But why would you want to use Church Numerals to do mathand limit yourself to pure lambda calculus?

Now, all that I've had to say here isn't meant to diminish that theJSON serialization approach is useful and probably a best fit formany usage scenarios.

Again, look more closely at KiokuDB and take a look at JSPON. KiokuDBis not just a dumb JSON store, instead it *uses* JSON/JSPON as astorage format, that is all.

But at the same time, relational databases are very powerful andtheir strengths, of ensuring that data is consistent and making iteasier to search, should be utilized, where it makes sense to doso. Using a relational database, without exploiting the featuresthat make them uniquely powerful, is like wasting the tools you have.

Well, yes I agree, but while we might be not (by default) be utilizingthe full power of the RDBMS, we are not throwing the whole thing out.Again, take a look at what is actually done then we can have a realconversation about it.

Perhaps a reasonable analogy is people who use Perl 5 but writetheir Perl code as if they were using Perl 4, and were fakingreferences rather than using real references, structures, andobjects. Sometimes I think of that when I hear of people justdumping objects as a serialized string in a relational database.


No, that is not a reasonable analogy.

KiokuDB does not fake references at all, it stores real true explicitreferences, not implicit tuple relationships, I think you have itbackwards.

Sometimes a relational DB is not the right tool for the job. Sometimesthe data structures being used wouldn't make sense if they wereflattened into a relational model. Sometimes there is no real businessvalue in having the data in a relational model and there is only valuehaving said data in object form.

Using a RDBMS for storing serialized objects using KiokuDB allows usto take full advantage of the reliability, flexibility and scalabilityof a RDBMS platform without being stunted by the impedance mismatchthat would typically come with a more traditional ORM tool.


- Stevan

Re: Persistent Objects Using SQL

Reply via email to