Hi Jim,

I can say that Accumulo will work on Azure's blob store and their data
lake store. These are a result of testing I'm involved with at
Hortonworks (dayjob). I know that these filesystems are tested to an
appropriate degree, proving that they do provide the things that
Accumulo needs.

As a refresher, the things we need from a filesystem are: performance
(Accumulo's write performance is pretty dominated by I/O) and
durability guarantees (when we call sync() on a file, the data we just
wrote better be there).

For WebHDFS, I think you would both hurt for performance and I would
be surprised if it actually provided the durability correctness. My
understanding is that WebHDFS is more meant to allow non-Java clients
easy access to HDFS (as a one-off) rather than act as a fully-fledged
access layer.

- Josh

On Fri, Apr 14, 2017 at 10:16 AM, James Hughes <[email protected]> wrote:
> Hi all,
>
> I know folks have asked about Accumulo on S3 before (1).
>
> Has anyone tried running Accumulo on Azure's blob storage or data lake
> solutions (2)?  (Or perhaps more generally, has anyone tried Accumulo on
> WebHDFS?)
>
> As more background, I have deployed Accumulo on HDP clouds in Azure, and
> that works great.  I'm interested in using the blob / data lake storage for
> benefits with scaling, etc.
>
> Thanks in advance,
>
> Jim
>
> 1.  http://apache-accumulo.1065345.n5.nabble.com/Accumulo-on-s3-td16737.html
> 2.
> https://docs.microsoft.com/en-us/azure/data-lake-store/data-lake-store-integrate-with-other-services

Reply via email to