Hi all, This is the[1] hdfs connector for hazelcast jet. Here the problem is hazelcast jet is using `org.apache.hadoop.mapred` package and gora is using `org.apache.hadoop.mapreduce` package.
These are two APIs exposed by hadoop to create hadoop jobs and etc. Eventhough these two are similar in functionality, both the packages are shiped with hadoop. See this answer[2], So I thought of writing an InputFormat for gora with the support of older package(`org.apache.hadoop.mapred`). Please correct me if I'm going in the wrong direction. Your Feedback is appreciated. [1] https://docs.hazelcast.org/docs/jet/0.7/manual/#hdfs [2] https://stackoverflow.com/a/7600339 Thanks and regards Lahiru On Fri, Apr 26, 2019 at 6:32 PM Lahiru Jayasekera <[email protected]> wrote: > Hi Madhawa, > Sorry for the late reply. Sure I'll try that and let you know. > > Thanks > > On Tue, Apr 23, 2019 at 2:24 PM Madhawa Kasun Gunasekara < > [email protected]> wrote: > >> Hi Lahiru, >> >> Good initiative. >> It seems like we need to use hadoop-hdfs and hadoop common with version >> 2.8.3. Try adding these dependencies, and exclude the old dependency from >> the gora, or otherwise, we can do a version upgrade in gora for Hadoop. at >> the moment we use Hadoop version 2.5.2 in gora but I prefer to upgrade the >> Hadoop versions in the gora to 2.8.3. >> >> Recently hazelcast developers released 3.0 version, We can try that also. >> >> Thanks, >> Madhawa >> >> >> On Sat, Apr 6, 2019 at 9:53 PM Lahiru Jayasekera <[email protected] >> > >> wrote: >> >> > Hi all, >> > I have come up with a poc for hazelcast jet execution engine support for >> > gora. >> > >> > >> https://github.com/LahiruJayasekara/gora/blob/poc-hazelcast-jet/gora-tutorial/src/main/java/org/apache/gora/tutorial/log/HazelcastJetPOC.java >> > >> > Here I have wrote a custom source for jet. What this does is read the >> > AccessLog table created via LogManager example, and feed the PageView >> > objects to jet. >> > >> > I have also looked into using hdfs connectors of jet. But I encountered >> a >> > problem with conflicting hadoop versions. InputFormat class used in >> gora is >> > from pacjage 'org.apache.hadoop.mapreduce', but in jet it is ' >> > org.apache.hadoop.mapred.InputFormat'. >> > >> > Anyway the proposed poc works as expected. Please add your suggestions >> > here. >> > >> > Thanks and regards >> > >> > -- >> > Lahiru Jayasekara >> > Batch 15 >> > Faculty of Information Technology >> > University of Moratuwa >> > 0716492170 >> > >> > > > -- > Lahiru Jayasekara > Batch 15 > Faculty of Information Technology > University of Moratuwa > 0716492170 > -- Lahiru Jayasekara Batch 15 Faculty of Information Technology University of Moratuwa 0716492170

