Re: Accumulo as a Column Storage

Josh Elser Thu, 19 Oct 2017 15:06:51 -0700

Yup, that's the intended use case. You have the flexibility to determinewhat column families make sense to group together. Your only "cost" inchanging your mind is the speed at which you can re-compact your data.

There is one concern which comes to mind. Though making many localitygroups does increase the speed at which you can read from specificcolumns, it decreases the speed at which you can read from _all_columns. So, you can do this trick to make Accumulo act more like acolumnar database, but beware that you're going to have an impact if youstill have a use-case where you read more than just one or two columnsat a time.


Does that make sense?

On 10/19/17 5:50 PM, Mohammad Kargar wrote:

AFAIK in Accumulo we can use "locality groups" to group sets of columnstogether on disk which would make it more like a column-orienteddatabase. Considering that "locality groups" are per column family, Iwas wondering what if we treat column families like column qualifiers(creating one column family per each qualifier) and assigning each to adifferent locality group. This way all the data in a given column willbe next to each other on disk which makes it easier for analyticalapplications to query the data.
Any thoughts?

Thanks,
Mohammad

Re: Accumulo as a Column Storage

Reply via email to