With a distributed data store, the data would become a logical
object - no adding or removal of machines would affect the data.
This is an ideal that would remove a tremendous maintenance
burden from many sites ---- well, at least the one's I have worked
at as far as I can see.

Two things:

1) Hadoop is the wrong technology. It's not designed to support transactional operations.

2) Transactional operations are, in general, your Big Obstacle for doing anything in the way of a distributed storage manager.

It's possible you could make both of the above "go away" if you were planning for a DW platform in which transactions weren't important. However, that would have to become an incompatible fork of PostgreSQL.

AFAIK, the Yahoo platform does not involve Hadoop at all.

--Josh


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to