[ https://issues.apache.org/jira/browse/NUTCH-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355939#comment-14355939 ]
Lewis John McGibbney commented on NUTCH-1936: --------------------------------------------- Hi [~ashwini.tokekar] bq. I have a request can you please share more details about what are your expected outcomes from this project. The Hadoop API is used in a pervasive fashion throughout Nutch trunk codebase. Right now the entire codebase relies upon Hadoop 1.X. The idea is to move EVERY tool, and every instance of every class which uses Hadoop 1.X API over to Hadoop 2.X. I would suggest that you take time to look into the existing issues open for similar tasks e.g. https://wiki.apache.org/nutch/GoogleSummerOfCode#Jira_Issues > GSoC 2015 - Move Nutch to Hadoop 2.X > ------------------------------------ > > Key: NUTCH-1936 > URL: https://issues.apache.org/jira/browse/NUTCH-1936 > Project: Nutch > Issue Type: Task > Components: build > Reporter: Lewis John McGibbney > Labels: gsoc2015 > Fix For: 2.4, 1.11 > > > The Nutch PMC > [discussed|http://www.mail-archive.com/dev%40nutch.apache.org/msg16250.html] > ideas for a good 2015 GSoC project. It appears that porting the (trunk) > codebase to [Hadoop 2.X|http://hadoop.apache.org/docs/stable/] seems to an > attractive option and one which would present an excellent learning > experience for a summer student. > A more comprehensive description of this issue should be included within > either a mentor-defined project description or a successful student > application. -- This message was sent by Atlassian JIRA (v6.3.4#6332)