In a nutshell, what are the essential differences between PyTables (a Python
wrapper around HDF5), HBase (built on Hadoop) and Google's BigTable?
Dinesh
-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
Don
>
> Well, I think that the HDF5 case is similar than the NetCDF for this
> scenario: if you need to efficiently retrieve measurements that are
> near in time, the best would be to save them in that order. However, in
> order to take advantage of this (disk-sorted) arrangement, you will
> need to
A Monday 12 May 2008, Nick Bower escrigué:
> > 2. Has anyone contended with managing schema differences between
> > sources? In other words, if I have say 500 loggers, each logging
> > slightly different schemas (ie 100 different columns and so
> > different table definitions), the suggested Pytabl
A Monday 12 May 2008, Nick Bower escrigué:
> Hi - just investigating pytables from storing data from many
> distributed remote logging stations, each logging about 100 channels
> at 1 second frequency (a fair bit).
>
> My questions;
>
> 1. How does one handle ordering timeseries data within a table
A Sunday 11 May 2008, David Wilson escrigué:
> Here is more detail on the error (*TypeError: invalid type for
> ``rating`` column: *
> ) I get. Thanks for your help:
>
> ##
> dcw$ python test1-insert.py
>
> movie3.h5 (File) ''
> Last modif.: 'Sat May 10 14:25:08 200
A Sunday 11 May 2008, Francesc Alted escrigué:
> A Sunday 11 May 2008, Ivan Vilata i Balaguer escrigué:
> > Dinesh B Vadhia (el 2008-05-10 a les 10:10:29 -0700) va dir::
> > > I'm using the OS filesystem to store 32,000 images files. I'm
> > > now going to move them into a datastore and the choice
> 2. Has anyone contended with managing schema differences between sources?
> In other words, if I have say 500 loggers, each logging slightly different
> schemas (ie 100 different columns and so different table definitions), the
> suggested Pytables way of binding static data definitions for each