Hi Team,

We have built the  "CDC  pipeline with apache hudi and debezium" .  It
works very well in our production.

But we have inhouse Ambari  Cluster with Hive metastore for all the ETL
purpose and Athena for all analytics purposes.  To make hudi table we work
on the athena we have preserved only the latest version and create the
table in parquet format .

Right now hive metastore get update using hudi itself . But to keep the
athena metastore in sync we have wrote a separate script to manage. But
that looks like not right approach . As only the required the affected
partition needs to be updated in athena side.

Please suggest as right approach here .

            Thanks and Regards,
        S SYED ABDUL KATHER

Reply via email to