Hi, I have been recently working on a HDFS system consumer for Samza. The work includes two major parts: 1. properly partitioning a HDFS directory and 2. consuming from HDFS files.
I have attached the design doc in the Jira ticket here: https://issues.apache.org/jira/browse/SAMZA-967 It would be great to here some feedback from your. Thanks, Hai