Hi,

To let multiple users share a single Hive instance, we know that we
should use stand alone metastore services, but what about cli (and
other clients) and hiveserver services, what's the best pratice for
the server layout for a production Hive instance?

1. I think hive metastore, hwi, and hiveserver services are all hadoop
clients, they should be running on servers which are not part of the
Hadoop cluster, so we should prepare a dedicated server for them, or
one server for each service, this is dependent on workloads.
2. For cli users, because cli has embedded hiveserver, which can
connect to metastore service directlly, we can install hive clis on
their workstations, with the same Hadoop/Hive binaries and
configuration files on their workstations.
3. For JDBC and ODBC clients, because they must connect to a
hiveserver, which can only handle one query at a time, so we must
start one hiveserver service for each client, only the JDBC,ODBC
driver is needed on the client, no Hive or Hadoop binaries are needed
on them.

Do I miss anything?

Regards,

Xiaobo Gu

Reply via email to