Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-10-15 Thread dlmarion
Just to be clear, we are talking about adding profile support to the pom's for 
Hadoop 2.2.0 for a 1.4.5 and 1.5.1 release, correct? We are not talking about 
changing the default build profile for these branches are we? 

- Original Message -

From: Billie Rinaldi billie.rina...@gmail.com 
To: dev@accumulo.apache.org 
Sent: Monday, October 14, 2013 11:57:40 PM 
Subject: Re: Hadoop 2.0 Support for Accumulo 1.4 Branch 

Thanks for the note, Ted. That vote is for 2.2.0, not -beta. 
On Oct 14, 2013 7:30 PM, Ted Yu yuzhih...@gmail.com wrote: 

 w.r.t. hadoop-2 release, see this thread: 
 
 http://search-hadoop.com/m/YSTny19y1Ha1/hadoop+2.2.0 
 
 Looks like 2.2.0-beta would pass votes. 
 
 Cheers 
 
 
 On Mon, Oct 14, 2013 at 7:24 PM, Mike Drob md...@mdrob.com wrote: 
 
  Responses Inline. 
  
  - Mike 
  
  On Mon, Oct 14, 2013 at 12:55 PM, Sean Busbey bus...@cloudera.com 
 wrote: 
  
   Hey All, 
   
   I'd like to restart the conversation from end July / start August about 
   Hadoop 2 support on the 1.4 branch. 
   
   Specifically, I'd like to get some requirements ironed out so I can 
 file 
   one or more jiras. I'd also like to get a plan for application. 
   
   =requirements 
   
   Here's the requirements I have from the last thread: 
   
   1)  Maintain existing 1.4 compatibility 
   
   The only thing I see listed in the pom is Apache release 0.20.203.0. 
  (1.4.4 
   tag)[1] 
   
   I don't see anything in the README[2] nor the user manual[3] on other 
   versions being supported. 
   
   Yep. 
  
  
   2) Gain Hadoop 2 support 
   
   At the moment, I'm presuming this means Apache release 2.0.4-alpha 
 since 
   that's what 1.5.0 builds against for Hadoop 2. 
   
   I haven't been following the Hadoop 2 release schedule that closely, 
 but 
  I 
  think the latest is a 2.1.0-beta? Pretty sure it was released after we 
  finished Accumulo 1.5, so there's no reason not to support it in my mind. 
  Depending on an alpha of something strikes me as either unstable or 
 lazy, 
  although I fully understand that it may be neither. 
  
  
   3) Test for correctness on given versions, with = 5 node cluster 
   
   * Unit Tests 
   * Functional Tests 
   * 24hr continuous + verification 
   * 24hr continuous + verification + agitation 
   * 24hr random walk 
   * 24hr random walk + agitation 
   
   Keith mentioned running these against a CDH4 cluster, but I presume 
 that 
   since Apache Releases are our stated compatibilities it would actually 
 be 
   against whatever versions we list. Based on #1 and #2 above, I would 
  expect 
   that to be Apache Hadoop 0.20.203.0 and Apache Hadoop 2.0.4-alpha. 
   
   Hadoop 2 introduces some neat new things like NN HA, which I think it 
  might be worthwhile to test with. At that level it might be more of a 
  verification of the Hadoop code, but I'd like to be comfortable that our 
  DFS Clients switch correctly. This is in addition to the standard release 
  suite that we run. [1] 
  
  [1]: http://accumulo.apache.org/governance/releasing.html#testing 
  
  
   4) Binary packaging 
   4a) Either source produces a single binary for all accepted versions 
   
   or 
   
   4b) Instructions for building from source for each versions and somehow 
   flag what (if any) convenience binaries are made for the release. 
   
   
  Having run the binary packaging for 1.4.4, I can tell you that it is not 
 in 
  great shape. Christopher cleaned up a lot of the issues in the 1.5 line, 
 so 
  I didn't bother spending a ton of time on them here, but I think RPM and 
  DEB are both broken. It would be nice to be able to specify a Hadoop 2 
  version for compilation, similar to what happens in the newer code base, 
  which could be back ported, I suppose. 4b seems easier. 
  
  =application 
   
   There will be many back-ported patches. Not much active development 
  happens 
   on 1.4.x now, but I presume this should still all go onto a feature 
  branch? 
   
   Is the community preference that eventually all the changes become a 
  single 
   commit (or one-per-subtask if there are multiple jiras) on the active 
 1.4 
   development branch, or that the original patches remain broken out? 
   
   Not sure what you mean by this. 
  
  
   For what it's worth, I'd recommend keeping them broken out. (And that's 
  how 
   the initial development against CDH4 has been done.) 
   
   
   [1] http://bit.ly/1fxucMe 
   [2] http://bit.ly/192zUAJ 
   [3] 
   
  
 http://accumulo.apache.org/1.4/user_manual/Administration.html#Dependencies 
   
   -- 
   Sean 
   
  
 



Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-10-15 Thread Sean Busbey
On Tue, Oct 15, 2013 at 7:16 AM, dlmar...@comcast.net wrote:

 Just to be clear, we are talking about adding profile support to the pom's
 for Hadoop 2.2.0 for a 1.4.5 and 1.5.1 release, correct? We are not talking
 about changing the default build profile for these branches are we?



for 1.4.5-SNAPSHOT I am only talking about adding support Hadoop 2.2.0. I
am not suggesting we change the default from building against Hadoop
0.23.203.


I'm not sure about the change to 1.5.1-SNAPSHOT. I believe we're talking
about changing the hadoop.profile for 2.0 to use the 2.2.0 release. I don't
think it makes sense to change the default off of the version in the
hadoop.profile for 1.0.

Presumably this change would also happen in master. Now that Hadoop 2.x is
going to have a GA release, I think it makes sense to have a discussion
about changing the default to be the hadoop 2.0 profile for master, but
this is not that discussion.


-- 
Sean


Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-10-15 Thread Sean Busbey
On Tue, Oct 15, 2013 at 10:16 AM, Sean Busbey bus...@cloudera.com wrote:


 On Tue, Oct 15, 2013 at 7:16 AM, dlmar...@comcast.net wrote:

 Just to be clear, we are talking about adding profile support to the
 pom's for Hadoop 2.2.0 for a 1.4.5 and 1.5.1 release, correct? We are not
 talking about changing the default build profile for these branches are we?



 for 1.4.5-SNAPSHOT I am only talking about adding support Hadoop 2.2.0. I
 am not suggesting we change the default from building against Hadoop
 0.23.203.



I mean 0.20.203.0. Ugh, Hadoop versions.

-- 
Sean


Re: Hadoop 2.0 Support for Accumulo 1.4 Branch

2013-10-15 Thread Joey Echeverria
I think you meant:

Ugh, Hadoop versions.[1]

[1]
http://blog.cloudera.com/blog/2012/04/apache-hadoop-versions-looking-ahead-3/


On Tue, Oct 15, 2013 at 11:20 AM, Sean Busbey bus...@cloudera.com wrote:

 On Tue, Oct 15, 2013 at 10:16 AM, Sean Busbey bus...@cloudera.com wrote:

 
  On Tue, Oct 15, 2013 at 7:16 AM, dlmar...@comcast.net wrote:
 
  Just to be clear, we are talking about adding profile support to the
  pom's for Hadoop 2.2.0 for a 1.4.5 and 1.5.1 release, correct? We are
 not
  talking about changing the default build profile for these branches are
 we?
 
 
 
  for 1.4.5-SNAPSHOT I am only talking about adding support Hadoop 2.2.0. I
  am not suggesting we change the default from building against Hadoop
  0.23.203.
 
 
 
 I mean 0.20.203.0. Ugh, Hadoop versions.

 --
 Sean




-- 
Joey Echeverria
Director, Federal FTS
Cloudera, Inc.


Functional test broken on 1.4.5-SNAPSHOT and 1.5.1-SNAPSHOT

2013-10-15 Thread Sean Busbey
Could a committer take a look at ACCUMULO-1775[1]?

Right now, functional tests fail when run on a clean machine with the 1.4.4
release and the 1.4.x and 1.5.x development branches.

The fix is small.

-Sean

[1]: https://issues.apache.org/jira/browse/ACCUMULO-1775

-- 
Sean


Re: Functional test broken on 1.4.5-SNAPSHOT and 1.5.1-SNAPSHOT

2013-10-15 Thread Josh Elser

Yeah, I was planning on applying this shortly.

On 10/15/13 12:50 PM, Sean Busbey wrote:

Could a committer take a look at ACCUMULO-1775[1]?

Right now, functional tests fail when run on a clean machine with the 1.4.4
release and the 1.4.x and 1.5.x development branches.

The fix is small.

-Sean

[1]: https://issues.apache.org/jira/browse/ACCUMULO-1775



Re: Functional test broken on 1.4.5-SNAPSHOT and 1.5.1-SNAPSHOT

2013-10-15 Thread Sean Busbey
On Oct 15, 2013 11:58 AM, Josh Elser josh.el...@gmail.com wrote:

 Yeah, I was planning on applying this shortly.



Sweet, thanks Josh!

-- 
Sean


Re: [VOTE] Accumulo Instamo Archetype 1.4.4-RC3

2013-10-15 Thread Josh Elser
Uhhh, Mike -- 
https://repository.apache.org/content/repositories/orgapacheaccumulo-156/archetype-catalog.xml


I'm confused as to what you meant now (still working on adding the 
staging repo locally to test it out properly).


On 10/13/13 12:29 AM, Mike Drob wrote:

The released tarball has a DEPENDENCIES file that is not in source control.
I think we talked about this last time, but wanted to confirm that you were
aware of this. All other files match. I don't think this is an issue, since
iirc that file is auto-gen'd.

Signature on the source release matches the one in the KEYS file.

There is no archetype-catalog.xml file deployed, so I have to mvn install
before I am able to generate projects using the archetype. After I install
however, everything works fine.

+1, but I definitely want to see the archetype-catalog.xml taken care of
for the next release. Not going to be picky about it on this one because
there is an easy workaround. (Maybe it needs to be documented somewhere,
though?)


On Sat, Oct 12, 2013 at 5:56 PM, Josh Elser josh.el...@gmail.com wrote:


Thanks for taking the time to review, Keith.

This is scheduled to close tonight. Can any other devs weigh in, please?


On 10/11/2013 05:35 PM, Keith Turner wrote:


signature and checksum on the soruce tar looks good.
Runs fine


+1


On Wed, Oct 9, 2013 at 7:20 PM, Josh Elser josh.el...@gmail.com wrote:

  Round 3, everyone. Changes over RC2:


* Manually set tarLongFileMode=gnu for maven-assembly-plugin
* Remove maven-source-plugin as Apache parent pom provides this
* Remove unnecessary entries from LICENSE and NOTICE
* Add LICENSE and NOTICE to the archetype such that the project the
archetype generates also has said LICENSE and NOTICE
* Rename artifact from 'instamo-archetype' to
'accumulo-instamo-archetype'
for clarity

Git repo: https://git-wip-us.apache.org/repos/asf/accumulo-instamo-*
*** https://git-wip-us.apache.org/**repos/asf/accumulo-instamo-**
archetype.githttps://git-wip-**us.apache.org/repos/asf/**
accumulo-instamo-archetype.githttps://git-wip-us.apache.org/repos/asf/accumulo-instamo-archetype.git
**
Tag: 1.4.4 (f2b2e49ec8ce9a60c298ccdcfc763a6fd05f8da1)
Staging repo: 
https://repository.apache.org/content/repositories/**https://repository.apache.org/**content/repositories/**
orgapacheaccumulo-156/https:/**/repository.apache.org/**
content/repositories/**orgapacheaccumulo-156/https://repository.apache.org/content/repositories/orgapacheaccumulo-156/



Source release tarball: 
https://repository.apache.org/https://repository.apache.org/**
content/repositories/orgapacheaccumulo-156/org/**
apache/accumulo/accumulo-instamo-archetype/1.4.4/**
accumulo-instamo-archetype-1.4.4-source-release.tar.gzhtt**
ps://repository.apache.org/**content/repositories/**
orgapacheaccumulo-156/org/**apache/accumulo/accumulo-**
instamo-archetype/1.4.4/**accumulo-instamo-archetype-1.**
4.4-source-release.tar.gzhttps://repository.apache.org/content/repositories/orgapacheaccumulo-156/org/apache/accumulo/accumulo-instamo-archetype/1.4.4/accumulo-instamo-archetype-1.4.4-source-release.tar.gz




Pending successful vote after 72hrs, the staging repo will be promoted. I
changed how I was doing things before 1.4.4-RCx tags as it would
require
more manual intervention in the release process. (sidebar: will be
writing
up this information on the website to make 1.6.0 hopefully easier)

- Josh








Re: [VOTE] Accumulo Instamo Archetype 1.4.4-RC3

2013-10-15 Thread Josh Elser

Ok, I apparently shouldn't have taken Mike's word as gospel :)

profile
idinstamo-staging/id
repositories
repository
idinstamo/id
nameASF staging repo/name

urlhttps://repository.apache.org/content/repositories/orgapacheaccumulo-156//url
layoutdefault/layout
/repository
/repositories
/profile

in my settings.xml, and invoking

mvn -Pinstamo-staging archetype:generate 
-DarchetypeArtifactId=accumulo-instamo-archetype 
-DarchetypeVersion=1.4.4 -DarchetypeGroupId=org.apache.accumulo


Worked for me. Can someone else verify please? I technically never 
closed the vote (I guess?), so I change my -1 to a +1. I'd still like to 
get another vote from the community though to make sure it's kosher.


On 10/14/13 11:19 AM, Josh Elser wrote:

Yah, I agree.

If it comes down to having to write something else just to get what we
want to begin with (one-command install), we should just do it correctly.

Self -1

On 10/14/2013 11:15 AM, Michael Berman wrote:

One of the big advantages of an archetype is that it's convenient and
consistent to use...I agree that requiring a download and install in
order
to use the archetype really cuts down on its usefulness.  I don't get a
vote, but if I did, I think I would need a clean invocation path in order
to give my +1.  (And I don't think a custom one-off shell script that
messes with someone's local repo would qualify)


On Mon, Oct 14, 2013 at 7:19 AM, Mike Drob md...@mdrob.com wrote:


On Oct 14, 2013 12:14 AM, Josh Elser josh.el...@gmail.com wrote:

Well, dang.

My intent was definitely to *not* make a user have to download, install

and then invoke the archetype, but provide a single maven command to
be run
to get up and running.

Could wrap this up in a shell script? Installing to the local
repo/catalog
is a pretty nasty side effect though.

I hate doing it, but that might be a non-starter (even as the guy

repeatedly cutting these releases). I haven't decided my opinion on
whether
or not this is super important for a 1.4.x version of the code.
Thanks for
the information either way, Mike.


On 10/13/2013 12:29 AM, Mike Drob wrote:

There is no archetype-catalog.xml file deployed, so I have to mvn

install

before I am able to generate projects using the archetype. After I

install

however, everything works fine.






accumulo pull request: ACCUMULO-1730 improvements

2013-10-15 Thread jstoneham
Github user jstoneham closed the pull request at:

https://github.com/apache/accumulo/pull/1



accumulo pull request: ACCUMULO-1730 improvements

2013-10-15 Thread jstoneham
Github user jstoneham closed the pull request at:

https://github.com/apache/accumulo/pull/2



accumulo pull request: ACCUMULO-1730 improvements

2013-10-15 Thread jstoneham
GitHub user jstoneham opened a pull request:

https://github.com/apache/accumulo/pull/3

ACCUMULO-1730 improvements

Correct start and end indices for AND and OR nodes.

See https://issues.apache.org/jira/browse/ACCUMULO-1730 for discussion.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/jstoneham/accumulo ACCUMULO-1730

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/accumulo/pull/3.patch


commit 9ceb320e1742a7b0fbb3bd753d28453e9313f517
Author: John Stoneham jstone...@texeltek.com
Date:   2013-10-15T17:14:14Z

ACCUMULO-1730 Rename variable for clarity.

commit 33118cb721b9598af5cbe26cd4048ecb7189f0a0
Author: John Stoneham jstone...@texeltek.com
Date:   2013-10-15T17:15:00Z

ACCUMULO-1730 Correct ParseTree start/end markers for logical connector 
nodes.

commit 876c5ce5ee9a039d56c8e4c7ca04d0401a47cdac
Author: John Stoneham jstone...@texeltek.com
Date:   2013-10-15T17:15:19Z

ACCUMULO-1730 Expose ColumnVisibility normalize/stringify methods.





Re: accumulo pull request: ACCUMULO-1730 improvements

2013-10-15 Thread Eric Newton
Committed and pushed.

-Eric


On Tue, Oct 15, 2013 at 1:17 PM, jstoneham g...@git.apache.org wrote:
 GitHub user jstoneham opened a pull request:

 https://github.com/apache/accumulo/pull/3

 ACCUMULO-1730 improvements

 Correct start and end indices for AND and OR nodes.

 See https://issues.apache.org/jira/browse/ACCUMULO-1730 for discussion.

 You can merge this pull request into a Git repository by running:

 $ git pull https://github.com/jstoneham/accumulo ACCUMULO-1730

 Alternatively you can review and apply these changes as the patch at:

 https://github.com/apache/accumulo/pull/3.patch

 
 commit 9ceb320e1742a7b0fbb3bd753d28453e9313f517
 Author: John Stoneham jstone...@texeltek.com
 Date:   2013-10-15T17:14:14Z

 ACCUMULO-1730 Rename variable for clarity.

 commit 33118cb721b9598af5cbe26cd4048ecb7189f0a0
 Author: John Stoneham jstone...@texeltek.com
 Date:   2013-10-15T17:15:00Z

 ACCUMULO-1730 Correct ParseTree start/end markers for logical connector 
 nodes.

 commit 876c5ce5ee9a039d56c8e4c7ca04d0401a47cdac
 Author: John Stoneham jstone...@texeltek.com
 Date:   2013-10-15T17:15:19Z

 ACCUMULO-1730 Expose ColumnVisibility normalize/stringify methods.

 



accumulo pull request: ACCUMULO-1730 improvements

2013-10-15 Thread jstoneham
Github user jstoneham closed the pull request at:

https://github.com/apache/accumulo/pull/3



Contributions from github pull requests and license grant

2013-10-15 Thread Sean Busbey
I don't recall a discussion happening about granting license to the ASF
when the repo moved to git. Looking at the git workflow guide[1], I don't
see any mention of licensing.

In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
is an important part of submitting a contribution. In those projects,
people are instructed to attach their patches to an open jira so that
license can be granted; pull requests from github generally aren't allowed.

Do we have a stance as a project on this? I think those two projects
basically say that the grant is required by the Apache License itself. Are
there ASF rules that can provide guidance on this?

-Sean

[1]: http://accumulo.apache.org/git.html
[2]:
https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
[3]:
https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork

-- 
Sean


Re: [VOTE] Accumulo Instamo Archetype 1.4.4-RC3

2013-10-15 Thread Christopher
If you need another vote, you can have my +1

I verified:

* good GPG sigs, sha1s, md5s
* source tarball and zip match git tag (except generated DEPENDENCIES
file, which is okay)
* HEAD of tag matches specified commit sha1
* tag builds reproducibly (mvn verify -P apache-release)
* can create new project from archetype in staging repo
* created project can build (mvn verify)
* created project can run MapReduce and shell commands in README
* no obvious problems in POM
* contents of jar files look fine

Encountered issues (not serious, can be improved in future releases or
not at all):

* not sure tarLongFileMode=gnu worked (file *.tar.gz still reports it
as gzip compressed data, from FAT filesystem)
* unit test that starts mini should be IT instead


--
Christopher L Tubbs II
http://gravatar.com/ctubbsii


On Tue, Oct 15, 2013 at 1:24 PM, Josh Elser josh.el...@gmail.com wrote:
 Ok, I apparently shouldn't have taken Mike's word as gospel :)

 profile
 idinstamo-staging/id
 repositories
 repository
 idinstamo/id
 nameASF staging repo/name

 urlhttps://repository.apache.org/content/repositories/orgapacheaccumulo-156//url
 layoutdefault/layout
 /repository
 /repositories
 /profile

 in my settings.xml, and invoking

 mvn -Pinstamo-staging archetype:generate
 -DarchetypeArtifactId=accumulo-instamo-archetype -DarchetypeVersion=1.4.4
 -DarchetypeGroupId=org.apache.accumulo

 Worked for me. Can someone else verify please? I technically never closed
 the vote (I guess?), so I change my -1 to a +1. I'd still like to get
 another vote from the community though to make sure it's kosher.


 On 10/14/13 11:19 AM, Josh Elser wrote:

 Yah, I agree.

 If it comes down to having to write something else just to get what we
 want to begin with (one-command install), we should just do it correctly.

 Self -1

 On 10/14/2013 11:15 AM, Michael Berman wrote:

 One of the big advantages of an archetype is that it's convenient and
 consistent to use...I agree that requiring a download and install in
 order
 to use the archetype really cuts down on its usefulness.  I don't get a
 vote, but if I did, I think I would need a clean invocation path in order
 to give my +1.  (And I don't think a custom one-off shell script that
 messes with someone's local repo would qualify)


 On Mon, Oct 14, 2013 at 7:19 AM, Mike Drob md...@mdrob.com wrote:

 On Oct 14, 2013 12:14 AM, Josh Elser josh.el...@gmail.com wrote:

 Well, dang.

 My intent was definitely to *not* make a user have to download, install

 and then invoke the archetype, but provide a single maven command to
 be run
 to get up and running.

 Could wrap this up in a shell script? Installing to the local
 repo/catalog
 is a pretty nasty side effect though.

 I hate doing it, but that might be a non-starter (even as the guy

 repeatedly cutting these releases). I haven't decided my opinion on
 whether
 or not this is super important for a 1.4.x version of the code.
 Thanks for
 the information either way, Mike.


 On 10/13/2013 12:29 AM, Mike Drob wrote:

 There is no archetype-catalog.xml file deployed, so I have to mvn

 install

 before I am able to generate projects using the archetype. After I

 install

 however, everything works fine.






Re: Contributions from github pull requests and license grant

2013-10-15 Thread Josh Elser
We should probably be ensuring that we have something stating that the 
contributor is assigning license to the ASF. I remember talking about 
this, but I don't recall the outcome (will have to search history).


I'm not sure if there are specific steps that need to be formally 
followed or if the contributor stating the above on a Jira issue is 
sufficient.


On 10/15/13 3:50 PM, Sean Busbey wrote:

I don't recall a discussion happening about granting license to the ASF
when the repo moved to git. Looking at the git workflow guide[1], I don't
see any mention of licensing.

In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
is an important part of submitting a contribution. In those projects,
people are instructed to attach their patches to an open jira so that
license can be granted; pull requests from github generally aren't allowed.

Do we have a stance as a project on this? I think those two projects
basically say that the grant is required by the Apache License itself. Are
there ASF rules that can provide guidance on this?

-Sean

[1]: http://accumulo.apache.org/git.html
[2]:
https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
[3]:
https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork



Re: Contributions from github pull requests and license grant

2013-10-15 Thread Billie Rinaldi
Here [1] is another data point.




On Tue, Oct 15, 2013 at 1:03 PM, Josh Elser josh.el...@gmail.com wrote:

 We should probably be ensuring that we have something stating that the
 contributor is assigning license to the ASF. I remember talking about this,
 but I don't recall the outcome (will have to search history).

 I'm not sure if there are specific steps that need to be formally followed
 or if the contributor stating the above on a Jira issue is sufficient.


 On 10/15/13 3:50 PM, Sean Busbey wrote:

 I don't recall a discussion happening about granting license to the ASF
 when the repo moved to git. Looking at the git workflow guide[1], I don't
 see any mention of licensing.

 In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
 is an important part of submitting a contribution. In those projects,
 people are instructed to attach their patches to an open jira so that
 license can be granted; pull requests from github generally aren't
 allowed.

 Do we have a stance as a project on this? I think those two projects
 basically say that the grant is required by the Apache License itself. Are
 there ASF rules that can provide guidance on this?

 -Sean

 [1]: 
 http://accumulo.apache.org/**git.htmlhttp://accumulo.apache.org/git.html
 [2]:
 https://cwiki.apache.org/**confluence/display/AVRO/How+**
 To+Contribute#HowToContribute-**Contributingyourworkhttps://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
 [3]:
 https://cwiki.apache.org/**confluence/display/Hive/**HowToContribute#**
 HowToContribute-**Contributingyourworkhttps://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork




Re: Contributions from github pull requests and license grant

2013-10-15 Thread Billie Rinaldi
Oops, sorry [1].

All of these links still say that you explicitly need to check a box
assigning license to the ASF.  That is not actually true: any code
submitted through ASF infrastructure is assumed to have license assigned to
the ASF, and the checkbox used to exist as a way not to assign license.
The checkbox no longer exists.  Anyway, I haven't seen any guidance yet
that says we can accept pull requests without the contents of the patch
also being attached to a ticket.

[1]:
http://karaf.apache.org/manual/latest-2.3.x/developers-guide/github-contributions.html


On Tue, Oct 15, 2013 at 1:08 PM, Billie Rinaldi billie.rina...@gmail.comwrote:

 Here [1] is another data point.




 On Tue, Oct 15, 2013 at 1:03 PM, Josh Elser josh.el...@gmail.com wrote:

 We should probably be ensuring that we have something stating that the
 contributor is assigning license to the ASF. I remember talking about this,
 but I don't recall the outcome (will have to search history).

 I'm not sure if there are specific steps that need to be formally
 followed or if the contributor stating the above on a Jira issue is
 sufficient.


 On 10/15/13 3:50 PM, Sean Busbey wrote:

 I don't recall a discussion happening about granting license to the ASF
 when the repo moved to git. Looking at the git workflow guide[1], I don't
 see any mention of licensing.

 In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
 is an important part of submitting a contribution. In those projects,
 people are instructed to attach their patches to an open jira so that
 license can be granted; pull requests from github generally aren't
 allowed.

 Do we have a stance as a project on this? I think those two projects
 basically say that the grant is required by the Apache License itself.
 Are
 there ASF rules that can provide guidance on this?

 -Sean

 [1]: 
 http://accumulo.apache.org/**git.htmlhttp://accumulo.apache.org/git.html
 [2]:
 https://cwiki.apache.org/**confluence/display/AVRO/How+**
 To+Contribute#HowToContribute-**Contributingyourworkhttps://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
 [3]:
 https://cwiki.apache.org/**confluence/display/Hive/**HowToContribute#**
 HowToContribute-**Contributingyourworkhttps://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork





Re: Contributions from github pull requests and license grant

2013-10-15 Thread Christopher
It is my understanding that:

The status quo hasn't changed just because we've moved to git.
Committers are still required to perform their due diligence to ensure
that any patch applied has the proper assignment to the ASF. To be
clear, the relevant issue is that patches applied via a pull request
make it easier for committers to overlook that step (because git makes
it so easy!), whereas patches attached to a JIRA are a little more
clear, because by submitting code to JIRA on ASF's infrastructure, one
can assume (unless otherwise stated) that it is owned by the ASF
(although, technically, JIRA doesn't have any Terms of Service stating
this when you sign up for an account... I just checked).

We should have something on our page page that describes a procedure
for explicitly indicating the patch associated with a pull request is
being assigned to the ASF. Perhaps we should just say, pull requests
are fine, but indicate the assignment explicitly on the ticket
itself.

--
Christopher L Tubbs II
http://gravatar.com/ctubbsii


On Tue, Oct 15, 2013 at 3:50 PM, Sean Busbey bus...@cloudera.com wrote:
 I don't recall a discussion happening about granting license to the ASF
 when the repo moved to git. Looking at the git workflow guide[1], I don't
 see any mention of licensing.

 In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
 is an important part of submitting a contribution. In those projects,
 people are instructed to attach their patches to an open jira so that
 license can be granted; pull requests from github generally aren't allowed.

 Do we have a stance as a project on this? I think those two projects
 basically say that the grant is required by the Apache License itself. Are
 there ASF rules that can provide guidance on this?

 -Sean

 [1]: http://accumulo.apache.org/git.html
 [2]:
 https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
 [3]:
 https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork

 --
 Sean


Re: Contributions from github pull requests and license grant

2013-10-15 Thread Billie Rinaldi
On Tue, Oct 15, 2013 at 1:45 PM, Christopher ctubb...@apache.org wrote:

 It is my understanding that:

 The status quo hasn't changed just because we've moved to git.
 Committers are still required to perform their due diligence to ensure
 that any patch applied has the proper assignment to the ASF. To be
 clear, the relevant issue is that patches applied via a pull request
 make it easier for committers to overlook that step (because git makes
 it so easy!), whereas patches attached to a JIRA are a little more
 clear, because by submitting code to JIRA on ASF's infrastructure, one
 can assume (unless otherwise stated) that it is owned by the ASF
 (although, technically, JIRA doesn't have any Terms of Service stating
 this when you sign up for an account... I just checked).


The licensing of a Contribution is explicitly discussed in the Apache
License.  So the licensing isn't assumed because it's ASF JIRA, it's
assumed because our code is licensed under the Apache License, and any
Contribution to it is therefore also Apache licensed.

Judging by how other projects are handling this, I am left to conclude that
a Contribution must consist of a patch itself, and not a reference to a
patch that exists outside of our infrastructure.  I have not yet found
explicit guidance about this.



 We should have something on our page page that describes a procedure
 for explicitly indicating the patch associated with a pull request is
 being assigned to the ASF. Perhaps we should just say, pull requests
 are fine, but indicate the assignment explicitly on the ticket
 itself.

 --
 Christopher L Tubbs II
 http://gravatar.com/ctubbsii


 On Tue, Oct 15, 2013 at 3:50 PM, Sean Busbey bus...@cloudera.com wrote:
  I don't recall a discussion happening about granting license to the ASF
  when the repo moved to git. Looking at the git workflow guide[1], I don't
  see any mention of licensing.
 
  In other projects (e.g. Avro[2] and Hive[3]) assigning license to the ASF
  is an important part of submitting a contribution. In those projects,
  people are instructed to attach their patches to an open jira so that
  license can be granted; pull requests from github generally aren't
 allowed.
 
  Do we have a stance as a project on this? I think those two projects
  basically say that the grant is required by the Apache License itself.
 Are
  there ASF rules that can provide guidance on this?
 
  -Sean
 
  [1]: http://accumulo.apache.org/git.html
  [2]:
 
 https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
  [3]:
 
 https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork
 
  --
  Sean



Re: Contributions from github pull requests and license grant

2013-10-15 Thread Mike Drob
This discussion [1] from legal-discuss seems to be the most recent word
regarding github pull requests, and looks like they are fair game.

[1]: http://markmail.org/thread/3gvsg4qgc2khve27


On Tue, Oct 15, 2013 at 6:34 PM, Billie Rinaldi billie.rina...@gmail.comwrote:

 On Tue, Oct 15, 2013 at 1:45 PM, Christopher ctubb...@apache.org wrote:

  It is my understanding that:
 
  The status quo hasn't changed just because we've moved to git.
  Committers are still required to perform their due diligence to ensure
  that any patch applied has the proper assignment to the ASF. To be
  clear, the relevant issue is that patches applied via a pull request
  make it easier for committers to overlook that step (because git makes
  it so easy!), whereas patches attached to a JIRA are a little more
  clear, because by submitting code to JIRA on ASF's infrastructure, one
  can assume (unless otherwise stated) that it is owned by the ASF
  (although, technically, JIRA doesn't have any Terms of Service stating
  this when you sign up for an account... I just checked).
 

 The licensing of a Contribution is explicitly discussed in the Apache
 License.  So the licensing isn't assumed because it's ASF JIRA, it's
 assumed because our code is licensed under the Apache License, and any
 Contribution to it is therefore also Apache licensed.

 Judging by how other projects are handling this, I am left to conclude that
 a Contribution must consist of a patch itself, and not a reference to a
 patch that exists outside of our infrastructure.  I have not yet found
 explicit guidance about this.


 
  We should have something on our page page that describes a procedure
  for explicitly indicating the patch associated with a pull request is
  being assigned to the ASF. Perhaps we should just say, pull requests
  are fine, but indicate the assignment explicitly on the ticket
  itself.
 
  --
  Christopher L Tubbs II
  http://gravatar.com/ctubbsii
 
 
  On Tue, Oct 15, 2013 at 3:50 PM, Sean Busbey bus...@cloudera.com
 wrote:
   I don't recall a discussion happening about granting license to the ASF
   when the repo moved to git. Looking at the git workflow guide[1], I
 don't
   see any mention of licensing.
  
   In other projects (e.g. Avro[2] and Hive[3]) assigning license to the
 ASF
   is an important part of submitting a contribution. In those projects,
   people are instructed to attach their patches to an open jira so that
   license can be granted; pull requests from github generally aren't
  allowed.
  
   Do we have a stance as a project on this? I think those two projects
   basically say that the grant is required by the Apache License itself.
  Are
   there ASF rules that can provide guidance on this?
  
   -Sean
  
   [1]: http://accumulo.apache.org/git.html
   [2]:
  
 
 https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
   [3]:
  
 
 https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork
  
   --
   Sean
 



Re: Contributions from github pull requests and license grant

2013-10-15 Thread Christopher
On Tue, Oct 15, 2013 at 6:34 PM, Billie Rinaldi
billie.rina...@gmail.com wrote:

 The licensing of a Contribution is explicitly discussed in the Apache
 License.  So the licensing isn't assumed because it's ASF JIRA, it's
 assumed because our code is licensed under the Apache License, and any
 Contribution to it is therefore also Apache licensed.

I think we (specifically, me) are getting mixed up on the terminology.
I didn't think we were talking about what the license is
(contributions are most certainly AL 2.0 licensed), so much as its
assignment (or perhaps, rather, the assignment of ownership or
copyright), which determines things like whether the license can be
changed (relicensed under a future AL 3.0, for instance), and how we
cite it (eg. do we have to put something separate in the NOTICE file).

 Judging by how other projects are handling this, I am left to conclude that
 a Contribution must consist of a patch itself, and not a reference to a
 patch that exists outside of our infrastructure.  I have not yet found
 explicit guidance about this.

If that's true, then I'm not sure how it benefits us to have the ASF
infrastructure forward pull requests to the GitHub mirror to our dev
list.
Instead, we should emphasize how to create patches with git
format-patch, and how to apply them with a quick curl JIRA patch
url | git am or similar command.


Re: Contributions from github pull requests and license grant

2013-10-15 Thread Christopher
This quote from that thread seems particularly relevant:

Again, there is no such requirement [to publish via JIRA] for
commits/pushes at Apache. The person responsible for moving the bits
into our repository is responsible for verifying that they have the
right to do so before the push is made. (I take it that right to do
so means that the patch is free of IP).

The fact that pull requests include the SHA1 seem to provide
sufficient documentation of intent... but it probably means we need to
preserve those SHA1s by not rebase-ing pull requests, and only doing a
merge, when we apply them (so that the mapping of the documentation of
intent on the dev list to the actual bits in the repo is preserved).

Additionally, we need to be careful when applying a pull request that
we don't pull in additional commits from the repo that weren't in the
original pull request. That means, we need to follow the instructions
in the pull request carefully. The instructions in the pull request
show how to do a merge from the originating repo's url... which could
have additional commits (or may not even exist anymore)... and not the
pull request url. It's probably safer to apply the patch from the pull
request with 'curl patchUrl | git am' or the url to the pull request
(if possible) instead of the originator's repo.

--
Christopher L Tubbs II
http://gravatar.com/ctubbsii


On Tue, Oct 15, 2013 at 6:49 PM, Mike Drob md...@mdrob.com wrote:
 This discussion [1] from legal-discuss seems to be the most recent word
 regarding github pull requests, and looks like they are fair game.

 [1]: http://markmail.org/thread/3gvsg4qgc2khve27


 On Tue, Oct 15, 2013 at 6:34 PM, Billie Rinaldi 
 billie.rina...@gmail.comwrote:

 On Tue, Oct 15, 2013 at 1:45 PM, Christopher ctubb...@apache.org wrote:

  It is my understanding that:
 
  The status quo hasn't changed just because we've moved to git.
  Committers are still required to perform their due diligence to ensure
  that any patch applied has the proper assignment to the ASF. To be
  clear, the relevant issue is that patches applied via a pull request
  make it easier for committers to overlook that step (because git makes
  it so easy!), whereas patches attached to a JIRA are a little more
  clear, because by submitting code to JIRA on ASF's infrastructure, one
  can assume (unless otherwise stated) that it is owned by the ASF
  (although, technically, JIRA doesn't have any Terms of Service stating
  this when you sign up for an account... I just checked).
 

 The licensing of a Contribution is explicitly discussed in the Apache
 License.  So the licensing isn't assumed because it's ASF JIRA, it's
 assumed because our code is licensed under the Apache License, and any
 Contribution to it is therefore also Apache licensed.

 Judging by how other projects are handling this, I am left to conclude that
 a Contribution must consist of a patch itself, and not a reference to a
 patch that exists outside of our infrastructure.  I have not yet found
 explicit guidance about this.


 
  We should have something on our page page that describes a procedure
  for explicitly indicating the patch associated with a pull request is
  being assigned to the ASF. Perhaps we should just say, pull requests
  are fine, but indicate the assignment explicitly on the ticket
  itself.
 
  --
  Christopher L Tubbs II
  http://gravatar.com/ctubbsii
 
 
  On Tue, Oct 15, 2013 at 3:50 PM, Sean Busbey bus...@cloudera.com
 wrote:
   I don't recall a discussion happening about granting license to the ASF
   when the repo moved to git. Looking at the git workflow guide[1], I
 don't
   see any mention of licensing.
  
   In other projects (e.g. Avro[2] and Hive[3]) assigning license to the
 ASF
   is an important part of submitting a contribution. In those projects,
   people are instructed to attach their patches to an open jira so that
   license can be granted; pull requests from github generally aren't
  allowed.
  
   Do we have a stance as a project on this? I think those two projects
   basically say that the grant is required by the Apache License itself.
  Are
   there ASF rules that can provide guidance on this?
  
   -Sean
  
   [1]: http://accumulo.apache.org/git.html
   [2]:
  
 
 https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
   [3]:
  
 
 https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork
  
   --
   Sean
 



Re: Contributions from github pull requests and license grant

2013-10-15 Thread Billie Rinaldi
Brilliant! Thanks for finding that thread, Mike.
On Oct 15, 2013 3:50 PM, Mike Drob md...@mdrob.com wrote:

 This discussion [1] from legal-discuss seems to be the most recent word
 regarding github pull requests, and looks like they are fair game.

 [1]: http://markmail.org/thread/3gvsg4qgc2khve27


 On Tue, Oct 15, 2013 at 6:34 PM, Billie Rinaldi billie.rina...@gmail.com
 wrote:

  On Tue, Oct 15, 2013 at 1:45 PM, Christopher ctubb...@apache.org
 wrote:
 
   It is my understanding that:
  
   The status quo hasn't changed just because we've moved to git.
   Committers are still required to perform their due diligence to ensure
   that any patch applied has the proper assignment to the ASF. To be
   clear, the relevant issue is that patches applied via a pull request
   make it easier for committers to overlook that step (because git makes
   it so easy!), whereas patches attached to a JIRA are a little more
   clear, because by submitting code to JIRA on ASF's infrastructure, one
   can assume (unless otherwise stated) that it is owned by the ASF
   (although, technically, JIRA doesn't have any Terms of Service stating
   this when you sign up for an account... I just checked).
  
 
  The licensing of a Contribution is explicitly discussed in the Apache
  License.  So the licensing isn't assumed because it's ASF JIRA, it's
  assumed because our code is licensed under the Apache License, and any
  Contribution to it is therefore also Apache licensed.
 
  Judging by how other projects are handling this, I am left to conclude
 that
  a Contribution must consist of a patch itself, and not a reference to a
  patch that exists outside of our infrastructure.  I have not yet found
  explicit guidance about this.
 
 
  
   We should have something on our page page that describes a procedure
   for explicitly indicating the patch associated with a pull request is
   being assigned to the ASF. Perhaps we should just say, pull requests
   are fine, but indicate the assignment explicitly on the ticket
   itself.
  
   --
   Christopher L Tubbs II
   http://gravatar.com/ctubbsii
  
  
   On Tue, Oct 15, 2013 at 3:50 PM, Sean Busbey bus...@cloudera.com
  wrote:
I don't recall a discussion happening about granting license to the
 ASF
when the repo moved to git. Looking at the git workflow guide[1], I
  don't
see any mention of licensing.
   
In other projects (e.g. Avro[2] and Hive[3]) assigning license to the
  ASF
is an important part of submitting a contribution. In those projects,
people are instructed to attach their patches to an open jira so that
license can be granted; pull requests from github generally aren't
   allowed.
   
Do we have a stance as a project on this? I think those two projects
basically say that the grant is required by the Apache License
 itself.
   Are
there ASF rules that can provide guidance on this?
   
-Sean
   
[1]: http://accumulo.apache.org/git.html
[2]:
   
  
 
 https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute#HowToContribute-Contributingyourwork
[3]:
   
  
 
 https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-Contributingyourwork
   
--
Sean
  
 



Re: linking to the repos for Accumulo contrib projects

2013-10-15 Thread Mike Drob
New page and also linked from the source page makes the most sense to me.
On Oct 15, 2013 1:20 AM, Sean Busbey bus...@cloudera.com wrote:

 Hiya!

 I was just looking for the current wikisearch example and had some
 difficulty. I ended up tracking through the INFRA ticket to switch to
 git[1] followed by looking at the top level ASF repo list to find the
 specific repo name[2].

 (for those searching in the future who find this message, it's
 accumulo-wikisearch [3])

 Could we add a link to each of the contrib project repos on the main site?
 I don't know if it'd be better on the Source  Guide page, Paper and
 Other Links, or on a new page off of the menu.

 If anyone has a preference I'd like to know before I create a doc jira and
 file a patch. If no one else has a preference, I'd say a new page would be
 appropriate.

 -Sean

 [1]: https://issues.apache.org/jira/browse/INFRA-6392
 [2]: https://git-wip-us.apache.org/repos/asf?s=accumulo*
 [3]:
 https://git-wip-us.apache.org/repos/asf?p=accumulo-wikisearch.git;a=summary

 --
 Sean