Please go through the following

https://github.com/apache/hadoop/blob/trunk/BUILDING.txt

and a specific command to generate the distribution which can be run after
your changes.
mvn package -Pdist -DskipTests -Dtar -Dmaven.javadoc.skip=true

Hope this helps.




On Thu, Jun 23, 2022 at 9:41 PM Rahul Bhardwaj <rahul265...@gmail.com>
wrote:

> I am following this wiki
> <https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/SingleCluster.html>
>  to
> build and run hadoop locally in pseudo-dsitributed mode. But I am unable to
> figure out how to build my changes and generate similar binaries so that I
> can test my changes locally. Is there some documentation on how to do this?
>
> On Mon, 13 Jun 2022 at 00:26, Brahma Reddy Battula <bra...@apache.org>
> wrote:
>
>> Hi Rahul,
>>
>> Welcome to hadoop world.
>>
>> Apart from the gautham mentioned, you can check the following also.
>> https://livebook.manning.com/book/hadoop-in-action/part-1/
>>
>> Go through the following wiki for contributions
>> https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute
>>
>>
>> Please subscribe to the hadoop mailing list[1], and shoot your queries
>> there from next time.
>>
>>
>>
>>
>>
>> 1. https://hadoop.apache.org/mailing_lists.html
>>
>> On Sun, Jun 12, 2022 at 10:42 PM Gautham Banasandra <gaur...@apache.org>
>> wrote:
>>
>>> Hi Rahul,
>>>
>>> I was looking for something more detailed and low-level like how the code
>>> > for the various services in HDFS is organized, entrypoints etc.
>>>
>>> I found this book useful to get a good idea of Hadoop in general - Apache
>>> Hadoop™ YARN: Moving beyond MapReduce and Batch Processing with Apache
>>> Hadoop™ 2 [Book] (oreilly.com)
>>> <
>>> https://www.oreilly.com/library/view/apache-hadooptm-yarn/9780133441925/
>>> >.
>>>
>>> In my opinion, you get into Open Source contributions by just doing so.
>>> You
>>> don't have to know HDFS in detail to start contributing to it. Now that
>>> you've gone through the Hadoop documentation, try setting up Hadoop in
>>> pseudo-distributed mode. If you notice any glitch, try fixing it and send
>>> out a PR. You never know what issue you'll find. I ran into this when I
>>> tried compiling Hadoop on Windows - [HDFS-15385] Upgrade boost library to
>>> 1.72 - ASF JIRA (apache.org)
>>> <https://issues.apache.org/jira/browse/HDFS-15385> (And yes, this was my
>>> first PR to Hadoop). Then use Docker and set up the Hadoop cluster with
>>> multiple nodes. Once you're able to do this, try browsing
>>> issues.apache.org
>>> and you'll find tons of issues that you can work on. There's always so
>>> much
>>> work to do in Open Source and the thing that I like the most is that
>>> "there's no deadline on anything" :) So, you can really work on some
>>> awesome stuff, own it, perfect it and share it with the world.
>>>
>>> Best of luck.
>>>
>>> Thanks,
>>> --Gautham
>>>
>>> On Sun, 12 Jun 2022 at 16:34, Rahul Bhardwaj <rahul265...@gmail.com>
>>> wrote:
>>>
>>> > Hi all,
>>> > I am a newbie wanting to start contributing to the hadoop ecosystem. I
>>> want
>>> > to start by contributing to HDFS and was looking for resources to
>>> > understand the architecture and I just found this -
>>> >
>>> >
>>> https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html
>>> > which is a fairly high level documentation. I was looking for something
>>> > more detailed and low-level like how the code for the various services
>>> in
>>> > HDFS is organized, entrypoints etc. Can someone point me to such
>>> resources?
>>> > Also is there a slack workspace for such discussions? Not sure if this
>>> > mailing list is the right forum for such doubts.
>>> >
>>>
>>
>>
>> --
>>
>>
>>
>> --Brahma Reddy Battula
>>
>

-- 



--Brahma Reddy Battula

Reply via email to