Re: [The Java Posse] Re: java server app of the week suggesion - Cassandra

Alan Kent Tue, 02 Mar 2010 15:43:24 -0800

Viktor Klang wrote:

My not so extensive experience has told me that it depends on the kindof schema you're building.For something like a Twitter-clone you probably won't run into thisunless you've done some bad planning,
but I definitely would agree with you that it could be(come) a big issue.
I'd love to hear if someone's had problems with this and what theirdomain/use was.


Regarding schema evolution...

The good old (err, ancient!) Z39.50 protocol used by libraries fordistributed search is actually quite good for this. Lots of sillythings in the protocol specification itself, but the semantic model isquite good. There are abstraction layers between the physicalrepresentation, what you query on, and what you retrieve (and more).The abstraction of the query model for example allows you to send thesame query to different collections even if the schema was notidentical. They just had to support the same query fields used by thequery to evaluate the query. Actually, its even more general than that- you can set it up to return zero matches for unknown query fieldsrather than aborting with an error. This allowed introduction of newerversions of database schemas with backwards compatibility to oldapplications. (I am simplifying a bit here!)

We still use Z39.50 today in the non-SQL database system we develop atwork (TeraText.com). We have customers who want to log a continuousflow of arriving information (e.g. syslog messages), retiring off oldcontent. E.g. create a new database each week and keep the last 26databases around for 6 months of historical data. Then query across theappropriate subset of databases to find results. Z39.50 makes it easyto introduce schema changes into next week's database while still beingable to search across all the older databases as well. (Obviously onlythe new database would find matches on searches specifying newer queryfields.)

Schema changes are typically not frequent, but when some new query comesalong that the customer wants to be able to do, the ability to introducenew fields is very useful - especially if its a high volume of content.Rebuilding all the old databases to retrospectively add new indexes cantake a long time and would potentially take the service off line, makingit not so desirable.


Alan

--
You received this message because you are subscribed to the Google Groups "The Java 
Posse" group.
To post to this group, send email to javapo...@googlegroups.com.
To unsubscribe from this group, send email to 
javaposse+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/javaposse?hl=en.

Re: [The Java Posse] Re: java server app of the week suggesion - Cassandra

Reply via email to