I have experience with HDInsight (used to be Hortonworks Hadoop PaaS, now
Azure has its own distro - same look and feel).  Just a quick background.
HDInsight offers workload based cluster offerings - Spark, Hadoop (MR)
Kafka, Hive LLAP, HBase etc.  Its disaggregated compute and storage
(leverages configurable cloud native storage - Azure Blob Storage or Azure
Data Lake Store), and virtual machines.  Only for Kafka it uses network
attached disks for storage, and Hbase WAL is on attached disks.  The secure
flavor (Kerberized and the only multi-user flavor) leverages a PaaS service
called Azure Active Directory Domain Services (Active Directory under the
hood).

The product team - if you reached out to them would discourage running
Accumulo on HDInsight and going ahead and installing it would
affect support & SLAs.  I would reach out to the customer assigned (by
Microsoft) Azure cloud solution architect and get an email back from the
product team to share with the customer if the customer needs it.

I would go with VMs and if they are using the secure version (Enterprise
Security Package), I would domain join those VMs in Azure Active Directory
Domain Services.

Anagha Khanolkar


On Wed, Dec 23, 2020 at 12:23 PM Christopher <[email protected]> wrote:

> I have not had experience with HDInsight. My first thoughts are that if it
> provides Hadoop and ZooKeeper for you, then that's a few less things to
> worry about from a maintenance perspective for your Accumulo cluster. On
> the other hand, if you can't run Accumulo nodes colocated with Hadoop
> DataNodes, then I wonder if you're losing some performance due to lack of
> data locality (on top of any performance hit from being in a virtual
> environment).
>
> On Wed, Dec 23, 2020 at 12:19 PM Roberts, Geoffry [USA] <
> [email protected]> wrote:
>
>> All,
>>
>>
>>
>> A quick question on something I’ve never tried before:
>>
>>
>>
>> Does anyone have any experience with setting up Accumulo with HDInsight?
>> Can it be done?  Or am I better off just using a few Linux VMs, which is my
>> first inclination and definitely my comfort zone?
>>
>>
>>
>> The employer has me on MS Azure.  I am setting up an Accumulo  cluster
>> there.  I notice Az offers a Hadoop thing called HDInsight.  I looked into
>> possibly using it—it has Zookeeper—for the H & Z part of my installation
>> but as yet I don’t see how to bring Accumulo into the picture.
>>
>>
>>
>> Any thoughts are appreciated.
>>
>

Reply via email to