Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2016-01-07 Thread Zhe Zhang
Sorry for getting back to the thread late.

Since Ming/Daryn have not raised further concerns, I'm +0 on including EC
in 2.9. The upside is that we can perhaps position EC in 2.9 as a "beta"
feature to gain some production experiences, and graduate it from beta in
3.0. The downside is as Elliot stated, this will delay the upgrade to 2.9
for some customers.

On Tue, Dec 8, 2015 at 2:54 PM, Zhe Zhang  wrote:

> Hi Vinod,
>
> Thanks for the update. After seeing the comment from Elliot I checked with
> Ming Ma and Daryn Sharp offline, to get feedback based on their large scale
> deployments (Twitter and Yahoo).
>
> Based on my reading, Ming doesn't have a strong preference between 2.9 and
> 3.0, and Daryn is still having some questions about the stability of EC
> code.
>
> Maybe we should give Ming/Daryn some more time to reply to this thread?
>
> I think we should also address Elliot's comments above regarding major vs.
> minor releases, how that impacts the 2.9 upgrade timing etc.
>
> Thanks,
> ---
> Zhe Zhang
>
> On Tue, Dec 8, 2015 at 2:04 PM, Vinod Kumar Vavilapalli <
> vino...@apache.org> wrote:
>
>> Forgot to update this thread. I branched off 2.8 last week. So, we can
>> now go ahead and do a merge of HDFS-7285 into branch-2 (version 2.9) like
>> we discussed before.
>>
>> Thanks
>> +Vinod
>>
>>
>> > On Nov 3, 2015, at 4:40 PM, Vinod Kumar Vavilapalli <
>> vino...@hortonworks.com> wrote:
>> >
>> > That makes sense.
>> >
>> > Thanks for the discussion everyone, let’s stick to this tentative plan
>> of EC for 2.9.
>> >
>> > I just updated the Roadmap wiki to reflect the same.
>> >
>> > +Vinod
>> >
>> >
>> >> On Nov 2, 2015, at 4:26 PM, Zheng, Kai  wrote:
>> >>
>> >> Yeah, so for the issues we recently resolved on trunk and are
>> addressing as follow-on tasks in Phase I, we would label them with "erasure
>> coding" and maybe also set the target version as "2.9" for the convenience?
>> >>
>> >> -Original Message-
>> >> From: Jing Zhao [mailto:ji...@apache.org]
>> >> Sent: Tuesday, November 03, 2015 8:04 AM
>> >> To: hdfs-dev@hadoop.apache.org
>> >> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
>> HDFS-7285 (erasure coding) branch to trunk]
>> >>
>> >> +1 for the plan about Phase I & II.
>> >>
>> >> BTW, maybe out of the scope of this thread, just want to mention we
>> should either move the jira under HDFS-8031 or update the jira component as
>> "erasure-coding" when making further improvement or fixing bugs in EC. In
>> this way it will be easier for later backporting EC to 2.9.
>> >>
>> >> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B <
>> vinayakumarb.apa...@gmail.com
>> >>> wrote:
>> >>
>> >>> +1 for the idea.
>> >>> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
>> >>>
>> >>>> Sounds good to me. When it's determined to include EC in 2.9
>> >>>> release, it may be good to have a rough release date as Zhe asked,
>> >>>> so accordingly the scope of EC can be discussed out. We still have
>> >>>> quite a few of things as Phase I follow-on tasks to do before EC can
>> >>>> be deployed in a production system. Phase II to develop non-striping
>> >>>> EC for cold data would possibly
>> >>> be
>> >>>> started after that. We might consider to include only Phase I and
>> >>>> leave Phase II for next release according to the rough release date.
>> >>>>
>> >>>> Regards,
>> >>>> Kai
>> >>>>
>> >>>> -Original Message-
>> >>>> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
>> >>>> Sent: Tuesday, November 03, 2015 5:41 AM
>> >>>> To: hdfs-dev@hadoop.apache.org
>> >>>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
>> >>>> HDFS-7285 (erasure coding) branch to trunk]
>> >>>>
>> >>>> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we
>> >>>> +plan to
>> >>>> have 2.8 and 2.9 releases.
>> >>>>
>> >>>> Regards,
>> >>>> Uma
>> >>>>
>> >>>> On 11/2/15, 11:49 AM, "

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-12-08 Thread Zhe Zhang
Hi Vinod,

Thanks for the update. After seeing the comment from Elliot I checked with
Ming Ma and Daryn Sharp offline, to get feedback based on their large scale
deployments (Twitter and Yahoo).

Based on my reading, Ming doesn't have a strong preference between 2.9 and
3.0, and Daryn is still having some questions about the stability of EC
code.

Maybe we should give Ming/Daryn some more time to reply to this thread?

I think we should also address Elliot's comments above regarding major vs.
minor releases, how that impacts the 2.9 upgrade timing etc.

Thanks,
---
Zhe Zhang

On Tue, Dec 8, 2015 at 2:04 PM, Vinod Kumar Vavilapalli 
wrote:

> Forgot to update this thread. I branched off 2.8 last week. So, we can now
> go ahead and do a merge of HDFS-7285 into branch-2 (version 2.9) like we
> discussed before.
>
> Thanks
> +Vinod
>
>
> > On Nov 3, 2015, at 4:40 PM, Vinod Kumar Vavilapalli <
> vino...@hortonworks.com> wrote:
> >
> > That makes sense.
> >
> > Thanks for the discussion everyone, let’s stick to this tentative plan
> of EC for 2.9.
> >
> > I just updated the Roadmap wiki to reflect the same.
> >
> > +Vinod
> >
> >
> >> On Nov 2, 2015, at 4:26 PM, Zheng, Kai  wrote:
> >>
> >> Yeah, so for the issues we recently resolved on trunk and are
> addressing as follow-on tasks in Phase I, we would label them with "erasure
> coding" and maybe also set the target version as "2.9" for the convenience?
> >>
> >> -Original Message-----
> >> From: Jing Zhao [mailto:ji...@apache.org]
> >> Sent: Tuesday, November 03, 2015 8:04 AM
> >> To: hdfs-dev@hadoop.apache.org
> >> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285
> (erasure coding) branch to trunk]
> >>
> >> +1 for the plan about Phase I & II.
> >>
> >> BTW, maybe out of the scope of this thread, just want to mention we
> should either move the jira under HDFS-8031 or update the jira component as
> "erasure-coding" when making further improvement or fixing bugs in EC. In
> this way it will be easier for later backporting EC to 2.9.
> >>
> >> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B <
> vinayakumarb.apa...@gmail.com
> >>> wrote:
> >>
> >>> +1 for the idea.
> >>> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
> >>>
> >>>> Sounds good to me. When it's determined to include EC in 2.9
> >>>> release, it may be good to have a rough release date as Zhe asked,
> >>>> so accordingly the scope of EC can be discussed out. We still have
> >>>> quite a few of things as Phase I follow-on tasks to do before EC can
> >>>> be deployed in a production system. Phase II to develop non-striping
> >>>> EC for cold data would possibly
> >>> be
> >>>> started after that. We might consider to include only Phase I and
> >>>> leave Phase II for next release according to the rough release date.
> >>>>
> >>>> Regards,
> >>>> Kai
> >>>>
> >>>> -Original Message-
> >>>> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> >>>> Sent: Tuesday, November 03, 2015 5:41 AM
> >>>> To: hdfs-dev@hadoop.apache.org
> >>>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
> >>>> HDFS-7285 (erasure coding) branch to trunk]
> >>>>
> >>>> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we
> >>>> +plan to
> >>>> have 2.8 and 2.9 releases.
> >>>>
> >>>> Regards,
> >>>> Uma
> >>>>
> >>>> On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> >>> wrote:
> >>>>
> >>>>> Forking the thread. Started looking at the 2.8 list, various
> >>>>> features¹ status and arrived here.
> >>>>>
> >>>>> While I understand the pervasive nature of EC and a need for a
> >>>>> significant bake-in, moving this to a 3.x release is not a good idea.
> >>>>> We will surely get a 2.8 out this year and, as needed, I can even
> >>>>> spend time getting started on a 2.9. OTOH, 3.x is long ways off,
> >>>>> and given all the incompatibilities there, it would be a while
> >>>>> before users can get their hands on EC if it were to be only on
> >>>>> 3.x. At best, this may force sites that wa

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-12-08 Thread Vinod Kumar Vavilapalli
Forgot to update this thread. I branched off 2.8 last week. So, we can now go 
ahead and do a merge of HDFS-7285 into branch-2 (version 2.9) like we discussed 
before.

Thanks
+Vinod


> On Nov 3, 2015, at 4:40 PM, Vinod Kumar Vavilapalli  
> wrote:
> 
> That makes sense.
> 
> Thanks for the discussion everyone, let’s stick to this tentative plan of EC 
> for 2.9.
> 
> I just updated the Roadmap wiki to reflect the same.
> 
> +Vinod
> 
> 
>> On Nov 2, 2015, at 4:26 PM, Zheng, Kai  wrote:
>> 
>> Yeah, so for the issues we recently resolved on trunk and are addressing as 
>> follow-on tasks in Phase I, we would label them with "erasure coding" and 
>> maybe also set the target version as "2.9" for the convenience?
>> 
>> -Original Message-
>> From: Jing Zhao [mailto:ji...@apache.org] 
>> Sent: Tuesday, November 03, 2015 8:04 AM
>> To: hdfs-dev@hadoop.apache.org
>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 
>> (erasure coding) branch to trunk]
>> 
>> +1 for the plan about Phase I & II.
>> 
>> BTW, maybe out of the scope of this thread, just want to mention we should 
>> either move the jira under HDFS-8031 or update the jira component as 
>> "erasure-coding" when making further improvement or fixing bugs in EC. In 
>> this way it will be easier for later backporting EC to 2.9.
>> 
>> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B >> wrote:
>> 
>>> +1 for the idea.
>>> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
>>> 
>>>> Sounds good to me. When it's determined to include EC in 2.9 
>>>> release, it may be good to have a rough release date as Zhe asked, 
>>>> so accordingly the scope of EC can be discussed out. We still have 
>>>> quite a few of things as Phase I follow-on tasks to do before EC can 
>>>> be deployed in a production system. Phase II to develop non-striping 
>>>> EC for cold data would possibly
>>> be
>>>> started after that. We might consider to include only Phase I and 
>>>> leave Phase II for next release according to the rough release date.
>>>> 
>>>> Regards,
>>>> Kai
>>>> 
>>>> -Original Message-
>>>> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
>>>> Sent: Tuesday, November 03, 2015 5:41 AM
>>>> To: hdfs-dev@hadoop.apache.org
>>>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge 
>>>> HDFS-7285 (erasure coding) branch to trunk]
>>>> 
>>>> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we 
>>>> +plan to
>>>> have 2.8 and 2.9 releases.
>>>> 
>>>> Regards,
>>>> Uma
>>>> 
>>>> On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
>>> wrote:
>>>> 
>>>>> Forking the thread. Started looking at the 2.8 list, various 
>>>>> features¹ status and arrived here.
>>>>> 
>>>>> While I understand the pervasive nature of EC and a need for a 
>>>>> significant bake-in, moving this to a 3.x release is not a good idea.
>>>>> We will surely get a 2.8 out this year and, as needed, I can even 
>>>>> spend time getting started on a 2.9. OTOH, 3.x is long ways off, 
>>>>> and given all the incompatibilities there, it would be a while 
>>>>> before users can get their hands on EC if it were to be only on 
>>>>> 3.x. At best, this may force sites that want EC to backport the 
>>>>> entire EC feature to older releases, at worst this will be repeat 
>>>>> the mess of 0.20 security release
>>>> forks.
>>>>> 
>>>>> If we think adding this to 2.8 (even if it switched off) is too 
>>>>> much risk per our original plan, let¹s move this to 2.9, there by 
>>>>> leaving enough time for stability, integration testing and bake-in, 
>>>>> and a realistic chance of having it end up on users¹ clusters soonish.
>>>>> 
>>>>> +Vinod
>>>>> 
>>>>>> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
>>>>>> 
>>>>>> wrote:
>>>>>> 
>>>>>> I think our plan thus far has been to target this for 3.0. I'm 
>>>>>> okay with  putting it in branch-2 if we've given a hard look at 
>>>>>> compatibility, but  I'll note thoug

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-04 Thread Elliott Clark
I don't really see the difference between 2.9 with a ton of scary changes
(EC is a lot more NN stuff than a usual release) and a 3.0.
What's the downside of getting a major version. It relaxes the compat a
little bit. It would allow some bake time before it's stable.

Put another way.  I'm upgrading our prod clusters to 2.7 now and 2.8 when
it comes out. That's because those releases are normal dot releases.
I would hold off a much longer time for a 2.9 with EC.

That to me really signals that it's a different type of release and that
should be messaged with a major version rev.

On Wed, Nov 4, 2015 at 4:06 PM, Andrew Purtell 
wrote:

> Yes we can mostly likely help you. Please come over to dev@bigtop.
>
>
> > On Nov 4, 2015, at 12:40 PM, Andrew Wang 
> wrote:
> >
> > We used to get help from Bigtop when it comes to integration testing. Do
> we
> > think that's possible for 2.8?
> >
> > On Wed, Nov 4, 2015 at 10:08 AM, Steve Loughran 
> > wrote:
> >
> >>
>  On 2 Nov 2015, at 23:11, Vinod Vavilapalli 
> >>> wrote:
> >>>
> >>> Yes, I’ve already started looking at 2.8.0, that is exactly how I ended
> >> up with this discussion on the state of EC.
> >>>
> >>> +Vinod
> >>>
> >>>
>  On Nov 2, 2015, at 3:02 PM, Haohui Mai  >>> ricet...@gmail.com>> wrote:
> >>>
> >>> Is it a good time to start the discussion on the issues of releasing
> 2.8?
> >>
> >> Before rushing to release 2.8, people should be trying in downstream
> apps
> >> today,. As well as identifying hdfs-client related issues, I've just
> >> discovered that the MiniYARNCluster has added lots of stack traces
> >> (YARN-4330), and I'm sure there are other regressions.
> >>
> >> It's generally not that hard to take a downstream project and try to
> build
> >> with a hadoop version of 2.8.0-SNAPSHOT; compilation and classpath
> problems
> >> will show up immediately; unit test regressions can be at least
> identified
> >> by switching between 2.7.1 and 2.8.0-SNAPSHOT.
>


Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-04 Thread Andrew Purtell
Yes we can mostly likely help you. Please come over to dev@bigtop. 


> On Nov 4, 2015, at 12:40 PM, Andrew Wang  wrote:
> 
> We used to get help from Bigtop when it comes to integration testing. Do we
> think that's possible for 2.8?
> 
> On Wed, Nov 4, 2015 at 10:08 AM, Steve Loughran 
> wrote:
> 
>> 
 On 2 Nov 2015, at 23:11, Vinod Vavilapalli 
>>> wrote:
>>> 
>>> Yes, I’ve already started looking at 2.8.0, that is exactly how I ended
>> up with this discussion on the state of EC.
>>> 
>>> +Vinod
>>> 
>>> 
 On Nov 2, 2015, at 3:02 PM, Haohui Mai >> ricet...@gmail.com>> wrote:
>>> 
>>> Is it a good time to start the discussion on the issues of releasing 2.8?
>> 
>> Before rushing to release 2.8, people should be trying in downstream apps
>> today,. As well as identifying hdfs-client related issues, I've just
>> discovered that the MiniYARNCluster has added lots of stack traces
>> (YARN-4330), and I'm sure there are other regressions.
>> 
>> It's generally not that hard to take a downstream project and try to build
>> with a hadoop version of 2.8.0-SNAPSHOT; compilation and classpath problems
>> will show up immediately; unit test regressions can be at least identified
>> by switching between 2.7.1 and 2.8.0-SNAPSHOT.


Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-04 Thread Andrew Wang
We used to get help from Bigtop when it comes to integration testing. Do we
think that's possible for 2.8?

On Wed, Nov 4, 2015 at 10:08 AM, Steve Loughran 
wrote:

>
> > On 2 Nov 2015, at 23:11, Vinod Vavilapalli 
> wrote:
> >
> > Yes, I’ve already started looking at 2.8.0, that is exactly how I ended
> up with this discussion on the state of EC.
> >
> > +Vinod
> >
> >
> > On Nov 2, 2015, at 3:02 PM, Haohui Mai  ricet...@gmail.com>> wrote:
> >
> > Is it a good time to start the discussion on the issues of releasing 2.8?
> >
>
> Before rushing to release 2.8, people should be trying in downstream apps
> today,. As well as identifying hdfs-client related issues, I've just
> discovered that the MiniYARNCluster has added lots of stack traces
> (YARN-4330), and I'm sure there are other regressions.
>
> It's generally not that hard to take a downstream project and try to build
> with a hadoop version of 2.8.0-SNAPSHOT; compilation and classpath problems
> will show up immediately; unit test regressions can be at least identified
> by switching between 2.7.1 and 2.8.0-SNAPSHOT.


Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-04 Thread Steve Loughran

> On 2 Nov 2015, at 23:11, Vinod Vavilapalli  wrote:
> 
> Yes, I’ve already started looking at 2.8.0, that is exactly how I ended up 
> with this discussion on the state of EC.
> 
> +Vinod
> 
> 
> On Nov 2, 2015, at 3:02 PM, Haohui Mai 
> mailto:ricet...@gmail.com>> wrote:
> 
> Is it a good time to start the discussion on the issues of releasing 2.8?
> 

Before rushing to release 2.8, people should be trying in downstream apps 
today,. As well as identifying hdfs-client related issues, I've just discovered 
that the MiniYARNCluster has added lots of stack traces (YARN-4330), and I'm 
sure there are other regressions.

It's generally not that hard to take a downstream project and try to build with 
a hadoop version of 2.8.0-SNAPSHOT; compilation and classpath problems will 
show up immediately; unit test regressions can be at least identified by 
switching between 2.7.1 and 2.8.0-SNAPSHOT.

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-04 Thread Zhe Zhang
To add to the above discussion on umbrella JIRA: the COMMON side changes of
EC have been tracked under HADOOP-11264 (before merge) and HADOOP-11842
(after merge).

---
Zhe Zhang

On Tue, Nov 3, 2015 at 4:40 PM, Vinod Vavilapalli 
wrote:

> That makes sense.
>
> Thanks for the discussion everyone, let’s stick to this tentative plan of
> EC for 2.9.
>
> I just updated the Roadmap wiki to reflect the same.
>
> +Vinod
>
>
> > On Nov 2, 2015, at 4:26 PM, Zheng, Kai  wrote:
> >
> > Yeah, so for the issues we recently resolved on trunk and are addressing
> as follow-on tasks in Phase I, we would label them with "erasure coding"
> and maybe also set the target version as "2.9" for the convenience?
> >
> > -Original Message-
> > From: Jing Zhao [mailto:ji...@apache.org]
> > Sent: Tuesday, November 03, 2015 8:04 AM
> > To: hdfs-dev@hadoop.apache.org
> > Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285
> (erasure coding) branch to trunk]
> >
> > +1 for the plan about Phase I & II.
> >
> > BTW, maybe out of the scope of this thread, just want to mention we
> should either move the jira under HDFS-8031 or update the jira component as
> "erasure-coding" when making further improvement or fixing bugs in EC. In
> this way it will be easier for later backporting EC to 2.9.
> >
> > On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B <
> vinayakumarb.apa...@gmail.com
> >> wrote:
> >
> >> +1 for the idea.
> >> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
> >>
> >>> Sounds good to me. When it's determined to include EC in 2.9
> >>> release, it may be good to have a rough release date as Zhe asked,
> >>> so accordingly the scope of EC can be discussed out. We still have
> >>> quite a few of things as Phase I follow-on tasks to do before EC can
> >>> be deployed in a production system. Phase II to develop non-striping
> >>> EC for cold data would possibly
> >> be
> >>> started after that. We might consider to include only Phase I and
> >>> leave Phase II for next release according to the rough release date.
> >>>
> >>> Regards,
> >>> Kai
> >>>
> >>> -Original Message-
> >>> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> >>> Sent: Tuesday, November 03, 2015 5:41 AM
> >>> To: hdfs-dev@hadoop.apache.org
> >>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
> >>> HDFS-7285 (erasure coding) branch to trunk]
> >>>
> >>> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we
> >>> +plan to
> >>> have 2.8 and 2.9 releases.
> >>>
> >>> Regards,
> >>> Uma
> >>>
> >>> On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> >> wrote:
> >>>
> >>>> Forking the thread. Started looking at the 2.8 list, various
> >>>> features¹ status and arrived here.
> >>>>
> >>>> While I understand the pervasive nature of EC and a need for a
> >>>> significant bake-in, moving this to a 3.x release is not a good idea.
> >>>> We will surely get a 2.8 out this year and, as needed, I can even
> >>>> spend time getting started on a 2.9. OTOH, 3.x is long ways off,
> >>>> and given all the incompatibilities there, it would be a while
> >>>> before users can get their hands on EC if it were to be only on
> >>>> 3.x. At best, this may force sites that want EC to backport the
> >>>> entire EC feature to older releases, at worst this will be repeat
> >>>> the mess of 0.20 security release
> >>> forks.
> >>>>
> >>>> If we think adding this to 2.8 (even if it switched off) is too
> >>>> much risk per our original plan, let¹s move this to 2.9, there by
> >>>> leaving enough time for stability, integration testing and bake-in,
> >>>> and a realistic chance of having it end up on users¹ clusters soonish.
> >>>>
> >>>> +Vinod
> >>>>
> >>>>> On Oct 19, 2015, at 1:44 PM, Andrew Wang
> >>>>> 
> >>>>> wrote:
> >>>>>
> >>>>> I think our plan thus far has been to target this for 3.0. I'm
> >>>>> okay with  putting it in branch-2 if we've given a hard look at
> >>>>> compatibility, but  I'll n

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-03 Thread Vinod Vavilapalli
That makes sense.

Thanks for the discussion everyone, let’s stick to this tentative plan of EC 
for 2.9.

I just updated the Roadmap wiki to reflect the same.

+Vinod


> On Nov 2, 2015, at 4:26 PM, Zheng, Kai  wrote:
> 
> Yeah, so for the issues we recently resolved on trunk and are addressing as 
> follow-on tasks in Phase I, we would label them with "erasure coding" and 
> maybe also set the target version as "2.9" for the convenience?
> 
> -Original Message-
> From: Jing Zhao [mailto:ji...@apache.org] 
> Sent: Tuesday, November 03, 2015 8:04 AM
> To: hdfs-dev@hadoop.apache.org
> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 
> (erasure coding) branch to trunk]
> 
> +1 for the plan about Phase I & II.
> 
> BTW, maybe out of the scope of this thread, just want to mention we should 
> either move the jira under HDFS-8031 or update the jira component as 
> "erasure-coding" when making further improvement or fixing bugs in EC. In 
> this way it will be easier for later backporting EC to 2.9.
> 
> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B > wrote:
> 
>> +1 for the idea.
>> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
>> 
>>> Sounds good to me. When it's determined to include EC in 2.9 
>>> release, it may be good to have a rough release date as Zhe asked, 
>>> so accordingly the scope of EC can be discussed out. We still have 
>>> quite a few of things as Phase I follow-on tasks to do before EC can 
>>> be deployed in a production system. Phase II to develop non-striping 
>>> EC for cold data would possibly
>> be
>>> started after that. We might consider to include only Phase I and 
>>> leave Phase II for next release according to the rough release date.
>>> 
>>> Regards,
>>> Kai
>>> 
>>> -Original Message-
>>> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
>>> Sent: Tuesday, November 03, 2015 5:41 AM
>>> To: hdfs-dev@hadoop.apache.org
>>> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge 
>>> HDFS-7285 (erasure coding) branch to trunk]
>>> 
>>> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we 
>>> +plan to
>>> have 2.8 and 2.9 releases.
>>> 
>>> Regards,
>>> Uma
>>> 
>>> On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
>> wrote:
>>> 
>>>> Forking the thread. Started looking at the 2.8 list, various 
>>>> features¹ status and arrived here.
>>>> 
>>>> While I understand the pervasive nature of EC and a need for a 
>>>> significant bake-in, moving this to a 3.x release is not a good idea.
>>>> We will surely get a 2.8 out this year and, as needed, I can even 
>>>> spend time getting started on a 2.9. OTOH, 3.x is long ways off, 
>>>> and given all the incompatibilities there, it would be a while 
>>>> before users can get their hands on EC if it were to be only on 
>>>> 3.x. At best, this may force sites that want EC to backport the 
>>>> entire EC feature to older releases, at worst this will be repeat 
>>>> the mess of 0.20 security release
>>> forks.
>>>> 
>>>> If we think adding this to 2.8 (even if it switched off) is too 
>>>> much risk per our original plan, let¹s move this to 2.9, there by 
>>>> leaving enough time for stability, integration testing and bake-in, 
>>>> and a realistic chance of having it end up on users¹ clusters soonish.
>>>> 
>>>> +Vinod
>>>> 
>>>>> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
>>>>> 
>>>>> wrote:
>>>>> 
>>>>> I think our plan thus far has been to target this for 3.0. I'm 
>>>>> okay with  putting it in branch-2 if we've given a hard look at 
>>>>> compatibility, but  I'll note though that 2.8 is already looking 
>>>>> like quite a large release,  and our release bandwidth has been 
>>>>> focused on the 2.6 and 2.7 maintenance  releases. Adding another 
>>>>> multi-hundred JIRAs to 2.8 might make it too  unwieldy to get out 
>>>>> the door. If we bump EC past that, 3.0 might very well  be our 
>>>>> next release vehicle. I do plan to revive the 3.0 schedule some 
>>>>> time  next year. With EC and
>>>>> JDK8 in a good spot, the only big feature remaining  is classpath 
>>>>> isolation.
>>>>> 
>

RE: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-03 Thread Zheng, Kai
Thanks Andrew for pointing this. It sounds good. Yes we have umbrella JIRAs for 
the follow-on tasks, HDFS-8031 for the HDFS side, and HADOOP-11842 for the 
HADOOP side. 

-Original Message-
From: Andrew Wang [mailto:andrew.w...@cloudera.com] 
Sent: Tuesday, November 03, 2015 8:49 AM
To: hdfs-dev@hadoop.apache.org
Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 
(erasure coding) branch to trunk]

If we use an umbrella JIRA to categorize all the ongoing EC work, that will let 
us more easily change the target version later. For instance, if we decide to 
bump Phase II out of 2.9, then we just need to change the target version of the 
Phase II umbrella rather than all the subtasks.

On Mon, Nov 2, 2015 at 4:26 PM, Zheng, Kai  wrote:

> Yeah, so for the issues we recently resolved on trunk and are 
> addressing as follow-on tasks in Phase I, we would label them with "erasure 
> coding"
> and maybe also set the target version as "2.9" for the convenience?
>
> -Original Message-
> From: Jing Zhao [mailto:ji...@apache.org]
> Sent: Tuesday, November 03, 2015 8:04 AM
> To: hdfs-dev@hadoop.apache.org
> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge 
> HDFS-7285 (erasure coding) branch to trunk]
>
> +1 for the plan about Phase I & II.
>
> BTW, maybe out of the scope of this thread, just want to mention we 
> should either move the jira under HDFS-8031 or update the jira 
> component as "erasure-coding" when making further improvement or 
> fixing bugs in EC. In this way it will be easier for later backporting EC to 
> 2.9.
>
> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B < 
> vinayakumarb.apa...@gmail.com
> > wrote:
>
> > +1 for the idea.
> > On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
> >
> > > Sounds good to me. When it's determined to include EC in 2.9 
> > > release, it may be good to have a rough release date as Zhe asked, 
> > > so accordingly the scope of EC can be discussed out. We still have 
> > > quite a few of things as Phase I follow-on tasks to do before EC 
> > > can be deployed in a production system. Phase II to develop 
> > > non-striping EC for cold data would possibly
> > be
> > > started after that. We might consider to include only Phase I and 
> > > leave Phase II for next release according to the rough release date.
> > >
> > > Regards,
> > > Kai
> > >
> > > -Original Message-
> > > From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> > > Sent: Tuesday, November 03, 2015 5:41 AM
> > > To: hdfs-dev@hadoop.apache.org
> > > Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
> > > HDFS-7285 (erasure coding) branch to trunk]
> > >
> > > +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we 
> > > +plan to
> > > have 2.8 and 2.9 releases.
> > >
> > > Regards,
> > > Uma
> > >
> > > On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> > > 
> > wrote:
> > >
> > > >Forking the thread. Started looking at the 2.8 list, various 
> > > >features¹ status and arrived here.
> > > >
> > > >While I understand the pervasive nature of EC and a need for a 
> > > >significant bake-in, moving this to a 3.x release is not a good idea.
> > > >We will surely get a 2.8 out this year and, as needed, I can even 
> > > >spend time getting started on a 2.9. OTOH, 3.x is long ways off, 
> > > >and given all the incompatibilities there, it would be a while 
> > > >before users can get their hands on EC if it were to be only on 
> > > >3.x. At best, this may force sites that want EC to backport the 
> > > >entire EC feature to older releases, at worst this will be repeat 
> > > >the mess of 0.20 security release
> > > forks.
> > > >
> > > >If we think adding this to 2.8 (even if it switched off) is too 
> > > >much risk per our original plan, let¹s move this to 2.9, there by 
> > > >leaving enough time for stability, integration testing and 
> > > >bake-in, and a realistic chance of having it end up on users¹ clusters 
> > > >soonish.
> > > >
> > > >+Vinod
> > > >
> > > >> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> > > >>
> > > >>wrote:
> > > >>
> > > >> I think our plan thus far has been to target this for 3.0. I'm 
> > > >>okay with  putting it in branch-2 

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Andrew Wang
If we use an umbrella JIRA to categorize all the ongoing EC work, that will
let us more easily change the target version later. For instance, if we
decide to bump Phase II out of 2.9, then we just need to change the target
version of the Phase II umbrella rather than all the subtasks.

On Mon, Nov 2, 2015 at 4:26 PM, Zheng, Kai  wrote:

> Yeah, so for the issues we recently resolved on trunk and are addressing
> as follow-on tasks in Phase I, we would label them with "erasure coding"
> and maybe also set the target version as "2.9" for the convenience?
>
> -Original Message-
> From: Jing Zhao [mailto:ji...@apache.org]
> Sent: Tuesday, November 03, 2015 8:04 AM
> To: hdfs-dev@hadoop.apache.org
> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285
> (erasure coding) branch to trunk]
>
> +1 for the plan about Phase I & II.
>
> BTW, maybe out of the scope of this thread, just want to mention we should
> either move the jira under HDFS-8031 or update the jira component as
> "erasure-coding" when making further improvement or fixing bugs in EC. In
> this way it will be easier for later backporting EC to 2.9.
>
> On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B <
> vinayakumarb.apa...@gmail.com
> > wrote:
>
> > +1 for the idea.
> > On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
> >
> > > Sounds good to me. When it's determined to include EC in 2.9
> > > release, it may be good to have a rough release date as Zhe asked,
> > > so accordingly the scope of EC can be discussed out. We still have
> > > quite a few of things as Phase I follow-on tasks to do before EC can
> > > be deployed in a production system. Phase II to develop non-striping
> > > EC for cold data would possibly
> > be
> > > started after that. We might consider to include only Phase I and
> > > leave Phase II for next release according to the rough release date.
> > >
> > > Regards,
> > > Kai
> > >
> > > -----Original Message-
> > > From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> > > Sent: Tuesday, November 03, 2015 5:41 AM
> > > To: hdfs-dev@hadoop.apache.org
> > > Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge
> > > HDFS-7285 (erasure coding) branch to trunk]
> > >
> > > +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we
> > > +plan to
> > > have 2.8 and 2.9 releases.
> > >
> > > Regards,
> > > Uma
> > >
> > > On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> > wrote:
> > >
> > > >Forking the thread. Started looking at the 2.8 list, various
> > > >features¹ status and arrived here.
> > > >
> > > >While I understand the pervasive nature of EC and a need for a
> > > >significant bake-in, moving this to a 3.x release is not a good idea.
> > > >We will surely get a 2.8 out this year and, as needed, I can even
> > > >spend time getting started on a 2.9. OTOH, 3.x is long ways off,
> > > >and given all the incompatibilities there, it would be a while
> > > >before users can get their hands on EC if it were to be only on
> > > >3.x. At best, this may force sites that want EC to backport the
> > > >entire EC feature to older releases, at worst this will be repeat
> > > >the mess of 0.20 security release
> > > forks.
> > > >
> > > >If we think adding this to 2.8 (even if it switched off) is too
> > > >much risk per our original plan, let¹s move this to 2.9, there by
> > > >leaving enough time for stability, integration testing and bake-in,
> > > >and a realistic chance of having it end up on users¹ clusters soonish.
> > > >
> > > >+Vinod
> > > >
> > > >> On Oct 19, 2015, at 1:44 PM, Andrew Wang
> > > >>
> > > >>wrote:
> > > >>
> > > >> I think our plan thus far has been to target this for 3.0. I'm
> > > >>okay with  putting it in branch-2 if we've given a hard look at
> > > >>compatibility, but  I'll note though that 2.8 is already looking
> > > >>like quite a large release,  and our release bandwidth has been
> > > >>focused on the 2.6 and 2.7 maintenance  releases. Adding another
> > > >>multi-hundred JIRAs to 2.8 might make it too  unwieldy to get out
> > > >>the door. If we bump EC past that, 3.0 might very well  be our
> > > >>next release vehicle. I

RE: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Zheng, Kai
Yeah, so for the issues we recently resolved on trunk and are addressing as 
follow-on tasks in Phase I, we would label them with "erasure coding" and maybe 
also set the target version as "2.9" for the convenience?

-Original Message-
From: Jing Zhao [mailto:ji...@apache.org] 
Sent: Tuesday, November 03, 2015 8:04 AM
To: hdfs-dev@hadoop.apache.org
Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 
(erasure coding) branch to trunk]

+1 for the plan about Phase I & II.

BTW, maybe out of the scope of this thread, just want to mention we should 
either move the jira under HDFS-8031 or update the jira component as 
"erasure-coding" when making further improvement or fixing bugs in EC. In this 
way it will be easier for later backporting EC to 2.9.

On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B  wrote:

> +1 for the idea.
> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
>
> > Sounds good to me. When it's determined to include EC in 2.9 
> > release, it may be good to have a rough release date as Zhe asked, 
> > so accordingly the scope of EC can be discussed out. We still have 
> > quite a few of things as Phase I follow-on tasks to do before EC can 
> > be deployed in a production system. Phase II to develop non-striping 
> > EC for cold data would possibly
> be
> > started after that. We might consider to include only Phase I and 
> > leave Phase II for next release according to the rough release date.
> >
> > Regards,
> > Kai
> >
> > -Original Message-
> > From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> > Sent: Tuesday, November 03, 2015 5:41 AM
> > To: hdfs-dev@hadoop.apache.org
> > Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge 
> > HDFS-7285 (erasure coding) branch to trunk]
> >
> > +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we 
> > +plan to
> > have 2.8 and 2.9 releases.
> >
> > Regards,
> > Uma
> >
> > On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> wrote:
> >
> > >Forking the thread. Started looking at the 2.8 list, various 
> > >features¹ status and arrived here.
> > >
> > >While I understand the pervasive nature of EC and a need for a 
> > >significant bake-in, moving this to a 3.x release is not a good idea.
> > >We will surely get a 2.8 out this year and, as needed, I can even 
> > >spend time getting started on a 2.9. OTOH, 3.x is long ways off, 
> > >and given all the incompatibilities there, it would be a while 
> > >before users can get their hands on EC if it were to be only on 
> > >3.x. At best, this may force sites that want EC to backport the 
> > >entire EC feature to older releases, at worst this will be repeat 
> > >the mess of 0.20 security release
> > forks.
> > >
> > >If we think adding this to 2.8 (even if it switched off) is too 
> > >much risk per our original plan, let¹s move this to 2.9, there by 
> > >leaving enough time for stability, integration testing and bake-in, 
> > >and a realistic chance of having it end up on users¹ clusters soonish.
> > >
> > >+Vinod
> > >
> > >> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> > >>
> > >>wrote:
> > >>
> > >> I think our plan thus far has been to target this for 3.0. I'm 
> > >>okay with  putting it in branch-2 if we've given a hard look at 
> > >>compatibility, but  I'll note though that 2.8 is already looking 
> > >>like quite a large release,  and our release bandwidth has been 
> > >>focused on the 2.6 and 2.7 maintenance  releases. Adding another 
> > >>multi-hundred JIRAs to 2.8 might make it too  unwieldy to get out 
> > >>the door. If we bump EC past that, 3.0 might very well  be our 
> > >>next release vehicle. I do plan to revive the 3.0 schedule some 
> > >>time  next year. With EC and
> > >>JDK8 in a good spot, the only big feature remaining  is classpath 
> > >>isolation.
> > >>
> > >> EC is also a pretty fundamental change to HDFS. Even if it's 
> > >>compatible, in  terms of size and impact it might best belong in a 
> > >>new major release.
> > >>
> > >> Best,
> > >> Andrew
> > >>
> > >> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B < 
> > >> vinayakumarb.apa...@gmail.com> wrote:
> > >>
> > >>> Is anyone else also thinks that feature is ready to goto 
> > >>>branc

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Jing Zhao
+1 for the plan about Phase I & II.

BTW, maybe out of the scope of this thread, just want to mention we should
either move the jira under HDFS-8031 or update the jira component as
"erasure-coding" when making further improvement or fixing bugs in EC. In
this way it will be easier for later backporting EC to 2.9.

On Mon, Nov 2, 2015 at 3:48 PM, Vinayakumar B  wrote:

> +1 for the idea.
> On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:
>
> > Sounds good to me. When it's determined to include EC in 2.9 release, it
> > may be good to have a rough release date as Zhe asked, so accordingly the
> > scope of EC can be discussed out. We still have quite a few of things as
> > Phase I follow-on tasks to do before EC can be deployed in a production
> > system. Phase II to develop non-striping EC for cold data would possibly
> be
> > started after that. We might consider to include only Phase I and leave
> > Phase II for next release according to the rough release date.
> >
> > Regards,
> > Kai
> >
> > -Original Message-
> > From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> > Sent: Tuesday, November 03, 2015 5:41 AM
> > To: hdfs-dev@hadoop.apache.org
> > Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285
> > (erasure coding) branch to trunk]
> >
> > +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we plan
> > +to
> > have 2.8 and 2.9 releases.
> >
> > Regards,
> > Uma
> >
> > On 11/2/15, 11:49 AM, "Vinod Vavilapalli" 
> wrote:
> >
> > >Forking the thread. Started looking at the 2.8 list, various features¹
> > >status and arrived here.
> > >
> > >While I understand the pervasive nature of EC and a need for a
> > >significant bake-in, moving this to a 3.x release is not a good idea.
> > >We will surely get a 2.8 out this year and, as needed, I can even spend
> > >time getting started on a 2.9. OTOH, 3.x is long ways off, and given
> > >all the incompatibilities there, it would be a while before users can
> > >get their hands on EC if it were to be only on 3.x. At best, this may
> > >force sites that want EC to backport the entire EC feature to older
> > >releases, at worst this will be repeat the mess of 0.20 security release
> > forks.
> > >
> > >If we think adding this to 2.8 (even if it switched off) is too much
> > >risk per our original plan, let¹s move this to 2.9, there by leaving
> > >enough time for stability, integration testing and bake-in, and a
> > >realistic chance of having it end up on users¹ clusters soonish.
> > >
> > >+Vinod
> > >
> > >> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> > >>wrote:
> > >>
> > >> I think our plan thus far has been to target this for 3.0. I'm okay
> > >>with  putting it in branch-2 if we've given a hard look at
> > >>compatibility, but  I'll note though that 2.8 is already looking like
> > >>quite a large release,  and our release bandwidth has been focused on
> > >>the 2.6 and 2.7 maintenance  releases. Adding another multi-hundred
> > >>JIRAs to 2.8 might make it too  unwieldy to get out the door. If we
> > >>bump EC past that, 3.0 might very well  be our next release vehicle. I
> > >>do plan to revive the 3.0 schedule some time  next year. With EC and
> > >>JDK8 in a good spot, the only big feature remaining  is classpath
> > >>isolation.
> > >>
> > >> EC is also a pretty fundamental change to HDFS. Even if it's
> > >>compatible, in  terms of size and impact it might best belong in a new
> > >>major release.
> > >>
> > >> Best,
> > >> Andrew
> > >>
> > >> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> > >> vinayakumarb.apa...@gmail.com> wrote:
> > >>
> > >>> Is anyone else also thinks that feature is ready to goto branch-2
> > >>>as well?
> > >>>
> > >>> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
> > >>>then and  ready to go in branch-2.
> > >>>
> > >>> -Vinay
> > >>> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
> > >>>
> > >>>> Thanks Vinay for capturing the issue and Uma for offering the help.
> > >>>>
> > >>>> ---
> > >>>> Zhe Zhang
> > >>>

RE: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Vinayakumar B
+1 for the idea.
On Nov 3, 2015 07:22, "Zheng, Kai"  wrote:

> Sounds good to me. When it's determined to include EC in 2.9 release, it
> may be good to have a rough release date as Zhe asked, so accordingly the
> scope of EC can be discussed out. We still have quite a few of things as
> Phase I follow-on tasks to do before EC can be deployed in a production
> system. Phase II to develop non-striping EC for cold data would possibly be
> started after that. We might consider to include only Phase I and leave
> Phase II for next release according to the rough release date.
>
> Regards,
> Kai
>
> -Original Message-
> From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com]
> Sent: Tuesday, November 03, 2015 5:41 AM
> To: hdfs-dev@hadoop.apache.org
> Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285
> (erasure coding) branch to trunk]
>
> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we plan
> +to
> have 2.8 and 2.9 releases.
>
> Regards,
> Uma
>
> On 11/2/15, 11:49 AM, "Vinod Vavilapalli"  wrote:
>
> >Forking the thread. Started looking at the 2.8 list, various features¹
> >status and arrived here.
> >
> >While I understand the pervasive nature of EC and a need for a
> >significant bake-in, moving this to a 3.x release is not a good idea.
> >We will surely get a 2.8 out this year and, as needed, I can even spend
> >time getting started on a 2.9. OTOH, 3.x is long ways off, and given
> >all the incompatibilities there, it would be a while before users can
> >get their hands on EC if it were to be only on 3.x. At best, this may
> >force sites that want EC to backport the entire EC feature to older
> >releases, at worst this will be repeat the mess of 0.20 security release
> forks.
> >
> >If we think adding this to 2.8 (even if it switched off) is too much
> >risk per our original plan, let¹s move this to 2.9, there by leaving
> >enough time for stability, integration testing and bake-in, and a
> >realistic chance of having it end up on users¹ clusters soonish.
> >
> >+Vinod
> >
> >> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> >>wrote:
> >>
> >> I think our plan thus far has been to target this for 3.0. I'm okay
> >>with  putting it in branch-2 if we've given a hard look at
> >>compatibility, but  I'll note though that 2.8 is already looking like
> >>quite a large release,  and our release bandwidth has been focused on
> >>the 2.6 and 2.7 maintenance  releases. Adding another multi-hundred
> >>JIRAs to 2.8 might make it too  unwieldy to get out the door. If we
> >>bump EC past that, 3.0 might very well  be our next release vehicle. I
> >>do plan to revive the 3.0 schedule some time  next year. With EC and
> >>JDK8 in a good spot, the only big feature remaining  is classpath
> >>isolation.
> >>
> >> EC is also a pretty fundamental change to HDFS. Even if it's
> >>compatible, in  terms of size and impact it might best belong in a new
> >>major release.
> >>
> >> Best,
> >> Andrew
> >>
> >> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> >> vinayakumarb.apa...@gmail.com> wrote:
> >>
> >>> Is anyone else also thinks that feature is ready to goto branch-2
> >>>as well?
> >>>
> >>> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
> >>>then and  ready to go in branch-2.
> >>>
> >>> -Vinay
> >>> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
> >>>
> >>>> Thanks Vinay for capturing the issue and Uma for offering the help.
> >>>>
> >>>> ---
> >>>> Zhe Zhang
> >>>>
> >>>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
> >>> uma.ganguma...@intel.com
> >>>>>
> >>>> wrote:
> >>>>
> >>>>> Vinay,
> >>>>>
> >>>>>
> >>>>> I would merge them as part of HDFS-9182.
> >>>>>
> >>>>> Thanks,
> >>>>> Uma
> >>>>>
> >>>>>
> >>>>>
> >>>>> On 10/5/15, 12:48 AM, "Vinayakumar B" 
> >>>>>wrote:
> >>>>>
> >>>>>> Hi Andrew,
> >>>>>> I see CHANGES.txt entries not yet merged from
> >>> CHANGES-HDFS-EC-7285.txt.
> >>>>>>
> >>>&

RE: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Zheng, Kai
Sounds good to me. When it's determined to include EC in 2.9 release, it may be 
good to have a rough release date as Zhe asked, so accordingly the scope of EC 
can be discussed out. We still have quite a few of things as Phase I follow-on 
tasks to do before EC can be deployed in a production system. Phase II to 
develop non-striping EC for cold data would possibly be started after that. We 
might consider to include only Phase I and leave Phase II for next release 
according to the rough release date.

Regards,
Kai

-Original Message-
From: Gangumalla, Uma [mailto:uma.ganguma...@intel.com] 
Sent: Tuesday, November 03, 2015 5:41 AM
To: hdfs-dev@hadoop.apache.org
Subject: Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 
(erasure coding) branch to trunk]

+1 for EC to go into 2.9. Yes, 3.x would be long way to go when we plan 
+to
have 2.8 and 2.9 releases.

Regards,
Uma

On 11/2/15, 11:49 AM, "Vinod Vavilapalli"  wrote:

>Forking the thread. Started looking at the 2.8 list, various features¹ 
>status and arrived here.
>
>While I understand the pervasive nature of EC and a need for a 
>significant bake-in, moving this to a 3.x release is not a good idea. 
>We will surely get a 2.8 out this year and, as needed, I can even spend 
>time getting started on a 2.9. OTOH, 3.x is long ways off, and given 
>all the incompatibilities there, it would be a while before users can 
>get their hands on EC if it were to be only on 3.x. At best, this may 
>force sites that want EC to backport the entire EC feature to older 
>releases, at worst this will be repeat the mess of 0.20 security release forks.
>
>If we think adding this to 2.8 (even if it switched off) is too much 
>risk per our original plan, let¹s move this to 2.9, there by leaving 
>enough time for stability, integration testing and bake-in, and a 
>realistic chance of having it end up on users¹ clusters soonish.
>
>+Vinod
>
>> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
>>wrote:
>> 
>> I think our plan thus far has been to target this for 3.0. I'm okay 
>>with  putting it in branch-2 if we've given a hard look at 
>>compatibility, but  I'll note though that 2.8 is already looking like 
>>quite a large release,  and our release bandwidth has been focused on 
>>the 2.6 and 2.7 maintenance  releases. Adding another multi-hundred 
>>JIRAs to 2.8 might make it too  unwieldy to get out the door. If we 
>>bump EC past that, 3.0 might very well  be our next release vehicle. I 
>>do plan to revive the 3.0 schedule some time  next year. With EC and 
>>JDK8 in a good spot, the only big feature remaining  is classpath 
>>isolation.
>> 
>> EC is also a pretty fundamental change to HDFS. Even if it's 
>>compatible, in  terms of size and impact it might best belong in a new 
>>major release.
>> 
>> Best,
>> Andrew
>> 
>> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B < 
>> vinayakumarb.apa...@gmail.com> wrote:
>> 
>>> Is anyone else also thinks that feature is ready to goto branch-2  
>>>as well?
>>> 
>>> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since 
>>>then and  ready to go in branch-2.
>>> 
>>> -Vinay
>>> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
>>> 
>>>> Thanks Vinay for capturing the issue and Uma for offering the help.
>>>> 
>>>> ---
>>>> Zhe Zhang
>>>> 
>>>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
>>> uma.ganguma...@intel.com
>>>>> 
>>>> wrote:
>>>> 
>>>>> Vinay,
>>>>> 
>>>>> 
>>>>> I would merge them as part of HDFS-9182.
>>>>> 
>>>>> Thanks,
>>>>> Uma
>>>>> 
>>>>> 
>>>>> 
>>>>> On 10/5/15, 12:48 AM, "Vinayakumar B" 
>>>>>wrote:
>>>>> 
>>>>>> Hi Andrew,
>>>>>> I see CHANGES.txt entries not yet merged from
>>> CHANGES-HDFS-EC-7285.txt.
>>>>>> 
>>>>>> Was this intentional?
>>>>>> 
>>>>>> Regards,
>>>>>> Vinay
>>>>>> 
>>>>>> On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
>>> andrew.w...@cloudera.com>
>>>>>> wrote:
>>>>>> 
>>>>>>> Branch has been merged to trunk, thanks again to everyone who 
>>>>>>>worked
>>>> on
>>>>>>> the
>>>

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Vinod Vavilapalli
Yes, I’ve already started looking at 2.8.0, that is exactly how I ended up with 
this discussion on the state of EC.

+Vinod


On Nov 2, 2015, at 3:02 PM, Haohui Mai 
mailto:ricet...@gmail.com>> wrote:

Is it a good time to start the discussion on the issues of releasing 2.8?



Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Haohui Mai
+1 on putting EC on 2.9.

Is it a good time to start the discussion on the issues of releasing 2.8?

~Haohui

On Mon, Nov 2, 2015 at 1:40 PM, Gangumalla, Uma
 wrote:
> +1 for EC to go into 2.9. Yes, 3.x would be long way to go when we plan to
> have 2.8 and 2.9 releases.
>
> Regards,
> Uma
>
> On 11/2/15, 11:49 AM, "Vinod Vavilapalli"  wrote:
>
>>Forking the thread. Started looking at the 2.8 list, various features¹
>>status and arrived here.
>>
>>While I understand the pervasive nature of EC and a need for a
>>significant bake-in, moving this to a 3.x release is not a good idea. We
>>will surely get a 2.8 out this year and, as needed, I can even spend time
>>getting started on a 2.9. OTOH, 3.x is long ways off, and given all the
>>incompatibilities there, it would be a while before users can get their
>>hands on EC if it were to be only on 3.x. At best, this may force sites
>>that want EC to backport the entire EC feature to older releases, at
>>worst this will be repeat the mess of 0.20 security release forks.
>>
>>If we think adding this to 2.8 (even if it switched off) is too much risk
>>per our original plan, let¹s move this to 2.9, there by leaving enough
>>time for stability, integration testing and bake-in, and a realistic
>>chance of having it end up on users¹ clusters soonish.
>>
>>+Vinod
>>
>>> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
>>>wrote:
>>>
>>> I think our plan thus far has been to target this for 3.0. I'm okay with
>>> putting it in branch-2 if we've given a hard look at compatibility, but
>>> I'll note though that 2.8 is already looking like quite a large release,
>>> and our release bandwidth has been focused on the 2.6 and 2.7
>>>maintenance
>>> releases. Adding another multi-hundred JIRAs to 2.8 might make it too
>>> unwieldy to get out the door. If we bump EC past that, 3.0 might very
>>>well
>>> be our next release vehicle. I do plan to revive the 3.0 schedule some
>>>time
>>> next year. With EC and JDK8 in a good spot, the only big feature
>>>remaining
>>> is classpath isolation.
>>>
>>> EC is also a pretty fundamental change to HDFS. Even if it's
>>>compatible, in
>>> terms of size and impact it might best belong in a new major release.
>>>
>>> Best,
>>> Andrew
>>>
>>> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
>>> vinayakumarb.apa...@gmail.com> wrote:
>>>
 Is anyone else also thinks that feature is ready to goto branch-2  as
well?

 Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
then and
 ready to go in branch-2.

 -Vinay
 On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:

> Thanks Vinay for capturing the issue and Uma for offering the help.
>
> ---
> Zhe Zhang
>
> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
 uma.ganguma...@intel.com
>>
> wrote:
>
>> Vinay,
>>
>>
>> I would merge them as part of HDFS-9182.
>>
>> Thanks,
>> Uma
>>
>>
>>
>> On 10/5/15, 12:48 AM, "Vinayakumar B" 
>>wrote:
>>
>>> Hi Andrew,
>>> I see CHANGES.txt entries not yet merged from
 CHANGES-HDFS-EC-7285.txt.
>>>
>>> Was this intentional?
>>>
>>> Regards,
>>> Vinay
>>>
>>> On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
 andrew.w...@cloudera.com>
>>> wrote:
>>>
 Branch has been merged to trunk, thanks again to everyone who
worked
> on
 the
 feature!

 On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang 
 wrote:

> Thanks everyone who has participated in this discussion.
>
> With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
 has
 passed.
> I will do a final 'git merge' with trunk and work with Andrew to
> merge
 the
> branch to trunk. I'll update on this thread when the merge is
 done.
>
> ---
> Zhe Zhang
>
> On Thu, Sep 24, 2015 at 11:08 PM, Liu, Yi A 
 wrote:
>
>> (Change it to binding.)
>>
>> +1
>> I have been involved in the development and code review on the
 feature
>> branch. It's a great feature and I think it's ready to merge it
> into
> trunk.
>>
>> Thanks all for the contribution.
>>
>> Regards,
>> Yi Liu
>>
>>
>> -Original Message-
>> From: Liu, Yi A
>> Sent: Friday, September 25, 2015 1:51 PM
>> To: hdfs-dev@hadoop.apache.org
>> Subject: RE: [VOTE] Merge HDFS-7285 (erasure coding) branch to
> trunk
>>
>> +1 (non-binding)
>> I have been involved in the development and code review on the
 feature
>> branch. It's a great feature and I think it's ready to merge it
> into
> trunk.
>>
>> Thanks all for the contribution.
>>
>> Regards,
>> Yi Liu

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Gangumalla, Uma
+1 for EC to go into 2.9. Yes, 3.x would be long way to go when we plan to
have 2.8 and 2.9 releases.

Regards,
Uma

On 11/2/15, 11:49 AM, "Vinod Vavilapalli"  wrote:

>Forking the thread. Started looking at the 2.8 list, various features¹
>status and arrived here.
>
>While I understand the pervasive nature of EC and a need for a
>significant bake-in, moving this to a 3.x release is not a good idea. We
>will surely get a 2.8 out this year and, as needed, I can even spend time
>getting started on a 2.9. OTOH, 3.x is long ways off, and given all the
>incompatibilities there, it would be a while before users can get their
>hands on EC if it were to be only on 3.x. At best, this may force sites
>that want EC to backport the entire EC feature to older releases, at
>worst this will be repeat the mess of 0.20 security release forks.
>
>If we think adding this to 2.8 (even if it switched off) is too much risk
>per our original plan, let¹s move this to 2.9, there by leaving enough
>time for stability, integration testing and bake-in, and a realistic
>chance of having it end up on users¹ clusters soonish.
>
>+Vinod
>
>> On Oct 19, 2015, at 1:44 PM, Andrew Wang 
>>wrote:
>> 
>> I think our plan thus far has been to target this for 3.0. I'm okay with
>> putting it in branch-2 if we've given a hard look at compatibility, but
>> I'll note though that 2.8 is already looking like quite a large release,
>> and our release bandwidth has been focused on the 2.6 and 2.7
>>maintenance
>> releases. Adding another multi-hundred JIRAs to 2.8 might make it too
>> unwieldy to get out the door. If we bump EC past that, 3.0 might very
>>well
>> be our next release vehicle. I do plan to revive the 3.0 schedule some
>>time
>> next year. With EC and JDK8 in a good spot, the only big feature
>>remaining
>> is classpath isolation.
>> 
>> EC is also a pretty fundamental change to HDFS. Even if it's
>>compatible, in
>> terms of size and impact it might best belong in a new major release.
>> 
>> Best,
>> Andrew
>> 
>> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
>> vinayakumarb.apa...@gmail.com> wrote:
>> 
>>> Is anyone else also thinks that feature is ready to goto branch-2  as
>>>well?
>>> 
>>> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
>>>then and
>>> ready to go in branch-2.
>>> 
>>> -Vinay
>>> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
>>> 
 Thanks Vinay for capturing the issue and Uma for offering the help.
 
 ---
 Zhe Zhang
 
 On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
>>> uma.ganguma...@intel.com
> 
 wrote:
 
> Vinay,
> 
> 
> I would merge them as part of HDFS-9182.
> 
> Thanks,
> Uma
> 
> 
> 
> On 10/5/15, 12:48 AM, "Vinayakumar B" 
>wrote:
> 
>> Hi Andrew,
>> I see CHANGES.txt entries not yet merged from
>>> CHANGES-HDFS-EC-7285.txt.
>> 
>> Was this intentional?
>> 
>> Regards,
>> Vinay
>> 
>> On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
>>> andrew.w...@cloudera.com>
>> wrote:
>> 
>>> Branch has been merged to trunk, thanks again to everyone who
>>>worked
 on
>>> the
>>> feature!
>>> 
>>> On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang 
>>> wrote:
>>> 
 Thanks everyone who has participated in this discussion.
 
 With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
>>> has
>>> passed.
 I will do a final 'git merge' with trunk and work with Andrew to
 merge
>>> the
 branch to trunk. I'll update on this thread when the merge is
>>> done.
 
 ---
 Zhe Zhang
 
 On Thu, Sep 24, 2015 at 11:08 PM, Liu, Yi A 
>>> wrote:
 
> (Change it to binding.)
> 
> +1
> I have been involved in the development and code review on the
>>> feature
> branch. It's a great feature and I think it's ready to merge it
 into
 trunk.
> 
> Thanks all for the contribution.
> 
> Regards,
> Yi Liu
> 
> 
> -Original Message-
> From: Liu, Yi A
> Sent: Friday, September 25, 2015 1:51 PM
> To: hdfs-dev@hadoop.apache.org
> Subject: RE: [VOTE] Merge HDFS-7285 (erasure coding) branch to
 trunk
> 
> +1 (non-binding)
> I have been involved in the development and code review on the
>>> feature
> branch. It's a great feature and I think it's ready to merge it
 into
 trunk.
> 
> Thanks all for the contribution.
> 
> Regards,
> Yi Liu
> 
> 
> -Original Message-
> From: Vinayakumar B [mailto:vinayakum...@apache.org]
> Sent: Friday, September 25, 2015 12:21 PM
> To: hdfs-dev@hadoop.apache.org
> Subject: Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to
 

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Zhe Zhang
Thanks Vinod for the proposal and Andrew/Jing for the comments. 2.9 sounds
a good plan. As Andrew pointed out, 2.8 is already quite big. And even when
disabled, EC logic has been baked in NN pretty deeply.

Do we have a tentative date or estimate for 2.9?

---
Zhe Zhang

On Mon, Nov 2, 2015 at 1:22 PM, Jing Zhao  wrote:

> Thanks for the discussion, Vinod and Andrew. Backporting EC to 2.9 sounds
> good to me.
>
> On Mon, Nov 2, 2015 at 12:06 PM, Andrew Wang 
> wrote:
>
> > Thanks for forking the thread Vinod,
> >
> > SGTM, though I really do recommend waiting for 2.9 given the current size
> > of 2.8. I'm not a fan of an "off by default" half-measure, since it
> doesn't
> > change our compatibility requirements, and there's some major NN surgery
> > that can't really be disabled.
> >
> > If we do find a major user who's backported this to their own branch-2
> > fork, I agree that's motivation to get it in an upstream release
> quicker. I
> > haven't heard anything along these lines though.
> >
> > On Mon, Nov 2, 2015 at 11:49 AM, Vinod Vavilapalli <
> > vino...@hortonworks.com>
> > wrote:
> >
> > > Forking the thread. Started looking at the 2.8 list, various features’
> > > status and arrived here.
> > >
> > > While I understand the pervasive nature of EC and a need for a
> > significant
> > > bake-in, moving this to a 3.x release is not a good idea. We will
> surely
> > > get a 2.8 out this year and, as needed, I can even spend time getting
> > > started on a 2.9. OTOH, 3.x is long ways off, and given all the
> > > incompatibilities there, it would be a while before users can get their
> > > hands on EC if it were to be only on 3.x. At best, this may force sites
> > > that want EC to backport the entire EC feature to older releases, at
> > worst
> > > this will be repeat the mess of 0.20 security release forks.
> > >
> > > If we think adding this to 2.8 (even if it switched off) is too much
> risk
> > > per our original plan, let’s move this to 2.9, there by leaving enough
> > time
> > > for stability, integration testing and bake-in, and a realistic chance
> of
> > > having it end up on users’ clusters soonish.
> > >
> > > +Vinod
> > >
> > > > On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> > > wrote:
> > > >
> > > > I think our plan thus far has been to target this for 3.0. I'm okay
> > with
> > > > putting it in branch-2 if we've given a hard look at compatibility,
> but
> > > > I'll note though that 2.8 is already looking like quite a large
> > release,
> > > > and our release bandwidth has been focused on the 2.6 and 2.7
> > maintenance
> > > > releases. Adding another multi-hundred JIRAs to 2.8 might make it too
> > > > unwieldy to get out the door. If we bump EC past that, 3.0 might very
> > > well
> > > > be our next release vehicle. I do plan to revive the 3.0 schedule
> some
> > > time
> > > > next year. With EC and JDK8 in a good spot, the only big feature
> > > remaining
> > > > is classpath isolation.
> > > >
> > > > EC is also a pretty fundamental change to HDFS. Even if it's
> > compatible,
> > > in
> > > > terms of size and impact it might best belong in a new major release.
> > > >
> > > > Best,
> > > > Andrew
> > > >
> > > > On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> > > > vinayakumarb.apa...@gmail.com> wrote:
> > > >
> > > >> Is anyone else also thinks that feature is ready to goto branch-2
> as
> > > well?
> > > >>
> > > >> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
> > then
> > > and
> > > >> ready to go in branch-2.
> > > >>
> > > >> -Vinay
> > > >> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
> > > >>
> > > >>> Thanks Vinay for capturing the issue and Uma for offering the help.
> > > >>>
> > > >>> ---
> > > >>> Zhe Zhang
> > > >>>
> > > >>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
> > > >> uma.ganguma...@intel.com
> > > 
> > > >>> wrote:
> > > >>>
> > >  Vinay,
> > > 
> > > 
> > >  I would merge them as part of HDFS-9182.
> > > 
> > >  Thanks,
> > >  Uma
> > > 
> > > 
> > > 
> > >  On 10/5/15, 12:48 AM, "Vinayakumar B" 
> > > wrote:
> > > 
> > > > Hi Andrew,
> > > > I see CHANGES.txt entries not yet merged from
> > > >> CHANGES-HDFS-EC-7285.txt.
> > > >
> > > > Was this intentional?
> > > >
> > > > Regards,
> > > > Vinay
> > > >
> > > > On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
> > > >> andrew.w...@cloudera.com>
> > > > wrote:
> > > >
> > > >> Branch has been merged to trunk, thanks again to everyone who
> > worked
> > > >>> on
> > > >> the
> > > >> feature!
> > > >>
> > > >> On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang <
> > zhezh...@cloudera.com>
> > > >> wrote:
> > > >>
> > > >>> Thanks everyone who has participated in this discussion.
> > > >>>
> > > >>> With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
> > > >> has
> > > >> passed.
> > > >>> I will do a final 'git merge' with trunk

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Jing Zhao
Thanks for the discussion, Vinod and Andrew. Backporting EC to 2.9 sounds
good to me.

On Mon, Nov 2, 2015 at 12:06 PM, Andrew Wang 
wrote:

> Thanks for forking the thread Vinod,
>
> SGTM, though I really do recommend waiting for 2.9 given the current size
> of 2.8. I'm not a fan of an "off by default" half-measure, since it doesn't
> change our compatibility requirements, and there's some major NN surgery
> that can't really be disabled.
>
> If we do find a major user who's backported this to their own branch-2
> fork, I agree that's motivation to get it in an upstream release quicker. I
> haven't heard anything along these lines though.
>
> On Mon, Nov 2, 2015 at 11:49 AM, Vinod Vavilapalli <
> vino...@hortonworks.com>
> wrote:
>
> > Forking the thread. Started looking at the 2.8 list, various features’
> > status and arrived here.
> >
> > While I understand the pervasive nature of EC and a need for a
> significant
> > bake-in, moving this to a 3.x release is not a good idea. We will surely
> > get a 2.8 out this year and, as needed, I can even spend time getting
> > started on a 2.9. OTOH, 3.x is long ways off, and given all the
> > incompatibilities there, it would be a while before users can get their
> > hands on EC if it were to be only on 3.x. At best, this may force sites
> > that want EC to backport the entire EC feature to older releases, at
> worst
> > this will be repeat the mess of 0.20 security release forks.
> >
> > If we think adding this to 2.8 (even if it switched off) is too much risk
> > per our original plan, let’s move this to 2.9, there by leaving enough
> time
> > for stability, integration testing and bake-in, and a realistic chance of
> > having it end up on users’ clusters soonish.
> >
> > +Vinod
> >
> > > On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> > wrote:
> > >
> > > I think our plan thus far has been to target this for 3.0. I'm okay
> with
> > > putting it in branch-2 if we've given a hard look at compatibility, but
> > > I'll note though that 2.8 is already looking like quite a large
> release,
> > > and our release bandwidth has been focused on the 2.6 and 2.7
> maintenance
> > > releases. Adding another multi-hundred JIRAs to 2.8 might make it too
> > > unwieldy to get out the door. If we bump EC past that, 3.0 might very
> > well
> > > be our next release vehicle. I do plan to revive the 3.0 schedule some
> > time
> > > next year. With EC and JDK8 in a good spot, the only big feature
> > remaining
> > > is classpath isolation.
> > >
> > > EC is also a pretty fundamental change to HDFS. Even if it's
> compatible,
> > in
> > > terms of size and impact it might best belong in a new major release.
> > >
> > > Best,
> > > Andrew
> > >
> > > On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> > > vinayakumarb.apa...@gmail.com> wrote:
> > >
> > >> Is anyone else also thinks that feature is ready to goto branch-2  as
> > well?
> > >>
> > >> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since
> then
> > and
> > >> ready to go in branch-2.
> > >>
> > >> -Vinay
> > >> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
> > >>
> > >>> Thanks Vinay for capturing the issue and Uma for offering the help.
> > >>>
> > >>> ---
> > >>> Zhe Zhang
> > >>>
> > >>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
> > >> uma.ganguma...@intel.com
> > 
> > >>> wrote:
> > >>>
> >  Vinay,
> > 
> > 
> >  I would merge them as part of HDFS-9182.
> > 
> >  Thanks,
> >  Uma
> > 
> > 
> > 
> >  On 10/5/15, 12:48 AM, "Vinayakumar B" 
> > wrote:
> > 
> > > Hi Andrew,
> > > I see CHANGES.txt entries not yet merged from
> > >> CHANGES-HDFS-EC-7285.txt.
> > >
> > > Was this intentional?
> > >
> > > Regards,
> > > Vinay
> > >
> > > On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
> > >> andrew.w...@cloudera.com>
> > > wrote:
> > >
> > >> Branch has been merged to trunk, thanks again to everyone who
> worked
> > >>> on
> > >> the
> > >> feature!
> > >>
> > >> On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang <
> zhezh...@cloudera.com>
> > >> wrote:
> > >>
> > >>> Thanks everyone who has participated in this discussion.
> > >>>
> > >>> With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
> > >> has
> > >> passed.
> > >>> I will do a final 'git merge' with trunk and work with Andrew to
> > >>> merge
> > >> the
> > >>> branch to trunk. I'll update on this thread when the merge is
> > >> done.
> > >>>
> > >>> ---
> > >>> Zhe Zhang
> > >>>
> > >>> On Thu, Sep 24, 2015 at 11:08 PM, Liu, Yi A 
> > >> wrote:
> > >>>
> >  (Change it to binding.)
> > 
> >  +1
> >  I have been involved in the development and code review on the
> > >> feature
> >  branch. It's a great feature and I think it's ready to merge it
> > >>> into
> > >>> trunk.
> > 
> >  Thanks all for the con

Re: Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Andrew Wang
Thanks for forking the thread Vinod,

SGTM, though I really do recommend waiting for 2.9 given the current size
of 2.8. I'm not a fan of an "off by default" half-measure, since it doesn't
change our compatibility requirements, and there's some major NN surgery
that can't really be disabled.

If we do find a major user who's backported this to their own branch-2
fork, I agree that's motivation to get it in an upstream release quicker. I
haven't heard anything along these lines though.

On Mon, Nov 2, 2015 at 11:49 AM, Vinod Vavilapalli 
wrote:

> Forking the thread. Started looking at the 2.8 list, various features’
> status and arrived here.
>
> While I understand the pervasive nature of EC and a need for a significant
> bake-in, moving this to a 3.x release is not a good idea. We will surely
> get a 2.8 out this year and, as needed, I can even spend time getting
> started on a 2.9. OTOH, 3.x is long ways off, and given all the
> incompatibilities there, it would be a while before users can get their
> hands on EC if it were to be only on 3.x. At best, this may force sites
> that want EC to backport the entire EC feature to older releases, at worst
> this will be repeat the mess of 0.20 security release forks.
>
> If we think adding this to 2.8 (even if it switched off) is too much risk
> per our original plan, let’s move this to 2.9, there by leaving enough time
> for stability, integration testing and bake-in, and a realistic chance of
> having it end up on users’ clusters soonish.
>
> +Vinod
>
> > On Oct 19, 2015, at 1:44 PM, Andrew Wang 
> wrote:
> >
> > I think our plan thus far has been to target this for 3.0. I'm okay with
> > putting it in branch-2 if we've given a hard look at compatibility, but
> > I'll note though that 2.8 is already looking like quite a large release,
> > and our release bandwidth has been focused on the 2.6 and 2.7 maintenance
> > releases. Adding another multi-hundred JIRAs to 2.8 might make it too
> > unwieldy to get out the door. If we bump EC past that, 3.0 might very
> well
> > be our next release vehicle. I do plan to revive the 3.0 schedule some
> time
> > next year. With EC and JDK8 in a good spot, the only big feature
> remaining
> > is classpath isolation.
> >
> > EC is also a pretty fundamental change to HDFS. Even if it's compatible,
> in
> > terms of size and impact it might best belong in a new major release.
> >
> > Best,
> > Andrew
> >
> > On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> > vinayakumarb.apa...@gmail.com> wrote:
> >
> >> Is anyone else also thinks that feature is ready to goto branch-2  as
> well?
> >>
> >> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since then
> and
> >> ready to go in branch-2.
> >>
> >> -Vinay
> >> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
> >>
> >>> Thanks Vinay for capturing the issue and Uma for offering the help.
> >>>
> >>> ---
> >>> Zhe Zhang
> >>>
> >>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
> >> uma.ganguma...@intel.com
> 
> >>> wrote:
> >>>
>  Vinay,
> 
> 
>  I would merge them as part of HDFS-9182.
> 
>  Thanks,
>  Uma
> 
> 
> 
>  On 10/5/15, 12:48 AM, "Vinayakumar B" 
> wrote:
> 
> > Hi Andrew,
> > I see CHANGES.txt entries not yet merged from
> >> CHANGES-HDFS-EC-7285.txt.
> >
> > Was this intentional?
> >
> > Regards,
> > Vinay
> >
> > On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
> >> andrew.w...@cloudera.com>
> > wrote:
> >
> >> Branch has been merged to trunk, thanks again to everyone who worked
> >>> on
> >> the
> >> feature!
> >>
> >> On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang 
> >> wrote:
> >>
> >>> Thanks everyone who has participated in this discussion.
> >>>
> >>> With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
> >> has
> >> passed.
> >>> I will do a final 'git merge' with trunk and work with Andrew to
> >>> merge
> >> the
> >>> branch to trunk. I'll update on this thread when the merge is
> >> done.
> >>>
> >>> ---
> >>> Zhe Zhang
> >>>
> >>> On Thu, Sep 24, 2015 at 11:08 PM, Liu, Yi A 
> >> wrote:
> >>>
>  (Change it to binding.)
> 
>  +1
>  I have been involved in the development and code review on the
> >> feature
>  branch. It's a great feature and I think it's ready to merge it
> >>> into
> >>> trunk.
> 
>  Thanks all for the contribution.
> 
>  Regards,
>  Yi Liu
> 
> 
>  -Original Message-
>  From: Liu, Yi A
>  Sent: Friday, September 25, 2015 1:51 PM
>  To: hdfs-dev@hadoop.apache.org
>  Subject: RE: [VOTE] Merge HDFS-7285 (erasure coding) branch to
> >>> trunk
> 
>  +1 (non-binding)
>  I have been involved in the development and code review on the
> >> feature
>  branch. It's a great 

Erasure coding in branch-2 [Was Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to trunk]

2015-11-02 Thread Vinod Vavilapalli
Forking the thread. Started looking at the 2.8 list, various features’ status 
and arrived here.

While I understand the pervasive nature of EC and a need for a significant 
bake-in, moving this to a 3.x release is not a good idea. We will surely get a 
2.8 out this year and, as needed, I can even spend time getting started on a 
2.9. OTOH, 3.x is long ways off, and given all the incompatibilities there, it 
would be a while before users can get their hands on EC if it were to be only 
on 3.x. At best, this may force sites that want EC to backport the entire EC 
feature to older releases, at worst this will be repeat the mess of 0.20 
security release forks.

If we think adding this to 2.8 (even if it switched off) is too much risk per 
our original plan, let’s move this to 2.9, there by leaving enough time for 
stability, integration testing and bake-in, and a realistic chance of having it 
end up on users’ clusters soonish.

+Vinod

> On Oct 19, 2015, at 1:44 PM, Andrew Wang  wrote:
> 
> I think our plan thus far has been to target this for 3.0. I'm okay with
> putting it in branch-2 if we've given a hard look at compatibility, but
> I'll note though that 2.8 is already looking like quite a large release,
> and our release bandwidth has been focused on the 2.6 and 2.7 maintenance
> releases. Adding another multi-hundred JIRAs to 2.8 might make it too
> unwieldy to get out the door. If we bump EC past that, 3.0 might very well
> be our next release vehicle. I do plan to revive the 3.0 schedule some time
> next year. With EC and JDK8 in a good spot, the only big feature remaining
> is classpath isolation.
> 
> EC is also a pretty fundamental change to HDFS. Even if it's compatible, in
> terms of size and impact it might best belong in a new major release.
> 
> Best,
> Andrew
> 
> On Fri, Oct 16, 2015 at 7:04 PM, Vinayakumar B <
> vinayakumarb.apa...@gmail.com> wrote:
> 
>> Is anyone else also thinks that feature is ready to goto branch-2  as well?
>> 
>> Its > 2 weeks EC landed on trunk. IMo Looks Its quite stable since then and
>> ready to go in branch-2.
>> 
>> -Vinay
>> On Oct 6, 2015 12:51 AM, "Zhe Zhang"  wrote:
>> 
>>> Thanks Vinay for capturing the issue and Uma for offering the help.
>>> 
>>> ---
>>> Zhe Zhang
>>> 
>>> On Mon, Oct 5, 2015 at 12:19 PM, Gangumalla, Uma <
>> uma.ganguma...@intel.com
 
>>> wrote:
>>> 
 Vinay,
 
 
 I would merge them as part of HDFS-9182.
 
 Thanks,
 Uma
 
 
 
 On 10/5/15, 12:48 AM, "Vinayakumar B"  wrote:
 
> Hi Andrew,
> I see CHANGES.txt entries not yet merged from
>> CHANGES-HDFS-EC-7285.txt.
> 
> Was this intentional?
> 
> Regards,
> Vinay
> 
> On Wed, Sep 30, 2015 at 9:15 PM, Andrew Wang <
>> andrew.w...@cloudera.com>
> wrote:
> 
>> Branch has been merged to trunk, thanks again to everyone who worked
>>> on
>> the
>> feature!
>> 
>> On Tue, Sep 29, 2015 at 10:44 PM, Zhe Zhang 
>> wrote:
>> 
>>> Thanks everyone who has participated in this discussion.
>>> 
>>> With 7 +1's (5 binding and 2 non-binding), and no -1, this vote
>> has
>> passed.
>>> I will do a final 'git merge' with trunk and work with Andrew to
>>> merge
>> the
>>> branch to trunk. I'll update on this thread when the merge is
>> done.
>>> 
>>> ---
>>> Zhe Zhang
>>> 
>>> On Thu, Sep 24, 2015 at 11:08 PM, Liu, Yi A 
>> wrote:
>>> 
 (Change it to binding.)
 
 +1
 I have been involved in the development and code review on the
>> feature
 branch. It's a great feature and I think it's ready to merge it
>>> into
>>> trunk.
 
 Thanks all for the contribution.
 
 Regards,
 Yi Liu
 
 
 -Original Message-
 From: Liu, Yi A
 Sent: Friday, September 25, 2015 1:51 PM
 To: hdfs-dev@hadoop.apache.org
 Subject: RE: [VOTE] Merge HDFS-7285 (erasure coding) branch to
>>> trunk
 
 +1 (non-binding)
 I have been involved in the development and code review on the
>> feature
 branch. It's a great feature and I think it's ready to merge it
>>> into
>>> trunk.
 
 Thanks all for the contribution.
 
 Regards,
 Yi Liu
 
 
 -Original Message-
 From: Vinayakumar B [mailto:vinayakum...@apache.org]
 Sent: Friday, September 25, 2015 12:21 PM
 To: hdfs-dev@hadoop.apache.org
 Subject: Re: [VOTE] Merge HDFS-7285 (erasure coding) branch to
>>> trunk
 
 +1,
 
 I've been involved starting from design and development of
>> ErasureCoding.
 I think phase 1 of this development is ready to be merged to
>>> trunk.
 It had come a long way to the current state with significant
>>> effort
>> of
 many Contributors and Revi