Na Li created SENTRY-1883:
-----------------------------
Summary: Optimizing Sentry to HDFS protocol
Key: SENTRY-1883
URL: https://issues.apache.org/jira/browse/SENTRY-1883
Project: Sentry
Issue Type: Improvement
Reporter: Na Li
Priority: Minor
Currently Sentry uses serialized Thrift structures to send a lot of
information from the Sentry Server to the HDFS namenode plugin for the HDFS
sync.
We should think of ways to optimize this protocol in several ways:
- Rather then streaming huge snapshots in a single message we should
provide streaming protocol with smaller messages and later reassembly on
the HDFS side.
- Most of the information passed are long strings with common prefixes.
We should be able to apply simple compression techniques (e.g. prefix
compression) or even run a full compression on the data before sending.
- We should consider using non-thrift data structures for passing the
info and just use Thrift as a transport mechanism.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)