Hi, I wanted to use Hbase Table as the source for my Hadoop Streaming MapReduce Jobs. However, the executable script (in Python) I am writing can only read data from STDIN. I found out that I need to use a custom Input format class (returning key/value pairs of Text class) if I want to integrate HBase with Hadoop Streaming.
Is using the custom input format class only way for this use case or there is any other alternative ??? Prakhar