[ANNOUNCE] Apache 1.6.1 Released

2014-10-03 Thread Corey Nolet
The Apache Accumulo project is happy to announce its 1.6.1 release.

Version 1.6.1 is the most recent bug-fix release in its 1.6.x release line.
This version includes numerous bug fixes and performance improvements over
previous versions. Existing users of 1.6.x are encouraged to upgrade to
this version. Users new to Accumulo are encouraged to start with this
version as well.

The Apache Accumulo sorted, distributed key/value store is a robust,
scalable, high performance data storage system that features cell-based
access control and customizable server-side processing.  It is based on
Google's BigTable design and is built on top of Apache Hadoop, Apache
Zookeeper, and Apache Thrift.

The release is available at http://accumulo.apache.org/downloads/ and
release notes at http://accumulo.apache.org/release_notes/1.6.1.html.


Thanks.

- The Apache Accumulo Team


Fwd: [ANNOUNCE] Apache 1.6.1 Released

2014-10-03 Thread Josh Elser

+ user@a.a.o

 Original Message 
Subject: [ANNOUNCE] Apache 1.6.1 Released
Date: Fri, 3 Oct 2014 13:59:24 -0400
From: Corey Nolet 
Reply-To: d...@accumulo.apache.org
To: annou...@apache.org, "d...@accumulo.apache.org" 

The Apache Accumulo project is happy to announce its 1.6.1 release.

Version 1.6.1 is the most recent bug-fix release in its 1.6.x release line.
This version includes numerous bug fixes and performance improvements over
previous versions. Existing users of 1.6.x are encouraged to upgrade to
this version. Users new to Accumulo are encouraged to start with this
version as well.

The Apache Accumulo sorted, distributed key/value store is a robust,
scalable, high performance data storage system that features cell-based
access control and customizable server-side processing.  It is based on
Google's BigTable design and is built on top of Apache Hadoop, Apache
Zookeeper, and Apache Thrift.

The release is available at http://accumulo.apache.org/downloads/ and
release notes at http://accumulo.apache.org/release_notes/1.6.1.html.


Thanks.

- The Apache Accumulo Team



Determining tablets assigned to table splits, and the number of rows in each tablet

2014-10-03 Thread Dylan Hutchison
This is for Accumulo 1.6.  Suppose we have the table splits

c

g

w


Does anyone know how to determine

   1. *the number of tablets assigned to each table split range?  *
   For this example, this is the number of tablets in the ranges (-Inf,c),
   (c,g), (g,w), (w,Inf).  Or is the design 1-1, that is, for each table split
   range there is exactly one tablet?
   2. *the number of rows inside all the tablets occupying a table split
   range?  *
   For this example, this is the total number of rows among all tablets in
   the ranges (-Inf,c), (c,g), (g,w), (w,Inf).

We use this count to verify how well manually set table splits are load
balancing in the tables.

Some context: I wrote functions that found these numbers two years ago
working on D4M in Accumulo 1.5.  I took the dark route of using non-public
Accumulo API to get TabletServer information, get TabletStats information,
and find the matchings to a table's splits by scanning the extents listed
in the METATABLE.  I can share the code if anyone is curious.  It's not
pretty, but it did the job.

Moving forward as we aim to upgrade to Accumulo 1.6, we should determine
the tablet split information the right way, not by reverse engineering
Accumulo.  Any suggestions?

Thanks,
Dylan Hutchison

-- 
www.cs.stevens.edu/~dhutchis