Re: Spring for hadoop

Radim Kolar Tue, 22 Jan 2013 16:08:24 -0800

i have solution integrating spring beans and spring batch directly intohadoop core. its far more advanced then spring data hadoop support withpojo patch.

in my solution every component of mapreduce can be hadoop bean. You willget spring batch integrated directly into mapper, which means that youcan run multiple steps in one mapper pass and because of async writedone by spring batch you will get about 3x higher write performance. ihave rewriten HDFS which has way faster writes. Spring batch componentreplaces standard hadoop job manager (that thing with web gui) andspring integration is used for advanced stuff like multiresourcescheduling. You can write simple java bean for every new resource youwant to add into system and another bean for logic for assigning jobsbased on that particular resource.

I submitted few patches to hadoop but they were not interesting enoughto get into core. If you want to buy my hadoop with integrated spring,let me know.

Re: Spring for hadoop

Reply via email to