AFAIK in Accumulo we can use "locality groups" to group sets of columns together on disk which would make it more like a column-oriented database. Considering that "locality groups" are per column family, I was wondering what if we treat column families like column qualifiers (creating one column family per each qualifier) and assigning each to a different locality group. This way all the data in a given column will be next to each other on disk which makes it easier for analytical applications to query the data.
Any thoughts? Thanks, Mohammad
