Ismaël Mejía created BEAM-9554:
----------------------------------

             Summary: Improve connection reuse on HBaseIO.ReadAll
                 Key: BEAM-9554
                 URL: https://issues.apache.org/jira/browse/BEAM-9554
             Project: Beam
          Issue Type: Improvement
          Components: io-java-hbase
            Reporter: Ismaël Mejía
            Assignee: Ismaël Mejía


The recent refactor of HBase.ReadAll in BEAM-9279 creates new connections in 
the @ProcessElement method (once per element), in the case that a pipeline is 
used on streaming mode this could be costly so we should find a way to cache 
and reuse connections to avoid both slow start of reads and saturating the 
clusters.

Notice that this is an ongoing issue for DoFn based IOs that manifested first 
on Writes for JdbcIO BEAM-7230 and was recently discussed too in the context of 
the CassandraIO refactor: 
https://github.com/apache/beam/pull/10546#issuecomment-580619044



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to