Re: [Discuss] State Backend use external HBase storage

sjk Thu, 17 Nov 2016 02:24:13 -0800

Hi, Chen Qin
I fount this issue. Does it kicked off?  What’s the current progress?
https://issues.apache.org/jira/browse/FLINK-4266 
<https://issues.apache.org/jira/browse/FLINK-4266>


> On Nov 16, 2016, at 19:35, Till Rohrmann <trohrm...@apache.org> wrote:
> 
> Hi Jinkui,
> 
> the file system state backend and the RocksDB state backend can be
> configured (and usually should be) such that they store their checkpoint
> data on a reliable storage system such as HDFS. Then you also have the
> reliability guarantees.
> 
> Of course, one can start adding more state backends to Flink. At some point
> in time there was the idea to write a Cassandra backed state backend [1],
> for example. Similarly, one could think about a HBase backed state backend.
> 
> [1]
> http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Cassandra-statebackend-td12690.html
> 
> 
> Cheers,
> Till
> 
> On Wed, Nov 16, 2016 at 3:10 AM, shijinkui <shijin...@huawei.com> wrote:
> 
>> Hi, All
>> 
>> At present flink have three state backend: memory, file system, rocksdb.
>> MemoryStateBackend will tansform the snapshot to jobManager, 5MB limited
>> default. Even setting it bigger, that not suitable for very big state
>> storage.
>> HDFS can meet the reliability guarantee, but It's slow. File System and
>> RocksDB are fast, but they are have no reliability guarantee.
>> Three state backend all have no reliability guarantee.
>> 
>> Can we have a Hbase state backend, providing reliability guarantee of
>> state snapshot?
>> For user, only new a HbaseStateBackend object, provide hbase parameter and
>> optimization configure.
>> Maybe Hbase or other distributed key-value storage is heavyweight storage,
>> we only use hbase client to read/write asynchronously.
>> 
>> -Jinkui Shi
>>

Re: [Discuss] State Backend use external HBase storage

Reply via email to