Ismaël Mejía created BEAM-9554: ---------------------------------- Summary: Improve connection reuse on HBaseIO.ReadAll Key: BEAM-9554 URL: https://issues.apache.org/jira/browse/BEAM-9554 Project: Beam Issue Type: Improvement Components: io-java-hbase Reporter: Ismaël Mejía Assignee: Ismaël Mejía
The recent refactor of HBase.ReadAll in BEAM-9279 creates new connections in the @ProcessElement method (once per element), in the case that a pipeline is used on streaming mode this could be costly so we should find a way to cache and reuse connections to avoid both slow start of reads and saturating the clusters. Notice that this is an ongoing issue for DoFn based IOs that manifested first on Writes for JdbcIO BEAM-7230 and was recently discussed too in the context of the CassandraIO refactor: https://github.com/apache/beam/pull/10546#issuecomment-580619044 -- This message was sent by Atlassian Jira (v8.3.4#803005)