Using Sqoop I’ve successfully imported a few rows from a table that has a BLOB column. As indicated in the Sqoop documentation, it has created ‘_lob’ directory with files such as: large_obj_attempt_201503141229_83736_m_000004_00.lob for *some* of the rows.
Questions: 1) As per doc, only files over 16M will go in this directory, correct? 2) How do I know which row this file is related to? In short, how do I use these ‘lobs’ that are created? Using Hive? Pig? Native MapReduce? Any sample code will be greatly appreciated.
