Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-13 Thread Upayavira
Diversity as has been discussed has generally been about contributors
not mentors, but while we should be able to expect mentors to act
primarily in the best interests of the ASF not their employer, having
some diversity of mentors is going to be a good thing.

The flaws in the initial landgrab over committership at the beginning of
a podling's life have been demonstrated. I believe it is best to start
with a known set of contributors (from one org is fine) and watch them
demonstrate their ability to open up their project quickly by setting a
low bar to committership.

It does seem like this project now has a wealth of mentor options. I've
seen projects with six mentors, five of which were quickly inactive. It
makes sense I'd say to start a discussion amongst those who have
volunteered to see who is best placed to be an active mentor. A project
choosing their own mentors also makes sense, as these are people they
are going to be working with.

Just some reflections.

Upayavira

On Wed, Feb 13, 2013, at 12:27 AM, Josh Wills wrote:
 I thought that the need for diversity referred to the community, not to
 the
 mentors.
 
 I strongly advocate for newly incubating projects choosing their own
 initial members (you need only search for the epic thread that resulted
 from the Crunch Proposal circa May 2012), but it seems like recent data
 suggests that starting out with a small set of committers from a single
 organization working on a large and complex codebase is a recipe for
 diversity issues and a difficult path out of the incubator.
 
 Kevin et al., please choose whomever you want, but do consider getting
 some
 folks from other organizations to seed the community. It does so much
 good
 for the project over the long term.
 
 
 On Tue, Feb 12, 2013 at 3:31 PM, Devaraj Das d...@hortonworks.com
 wrote:
 
  Hi folks,
  Happy to see such a hugely positive response on the proposal. I'll put
  up a vote tomorrow.
 
  Now, on the mentors, I think we have enough at this point and we have
  also addressed the diversity concern. I had got the confirmation from
  Chris Douglas and Chris Mattmann yesterday afternoon that they would
  participate as mentors in the project. On the nominations for getting
  involved in the project, you are welcome to come in as contributors.
  Individuals in the initial list of committers all have a view point
  that the committers list be kept as such and the standard practice
  (based on contributions, after the project is accepted for incubation)
  for accepting new committers be followed here.
 
  Thanks
  Devaraj.
 
  On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
  kevin.min...@hortonworks.com wrote:
   Hi Bertrand,
   Currently all of the mentors are affiliated with Hortonworks as well.  We
   can reach out to a number of other people and organizations that are
   interested in the project to diversify if this is an important
  consideration
   at this stage.
  
   One other question I have is about creating a wiki to host the proposal.
   I
   don't seem to have privs to create http://wiki.apache.org/incubator/knox
  .
   Would that be the correct place to evolve the proposal?  If so, how do I
  go
   about creating in?
   Kevin.
  
  
   On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:
  
   On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
   kevin.min...@hortonworks.com wrote:
   ...
  
   === Nominated Mentors ===
  
   Owen O’Malley (omalley AT apache DOT org)
   Mahadev Konar (mahadev AT apache DOT org)
   Alan Gates (gates AT apache DOT org)
   Devaraj Das (ddas AT apache DOT org)
  
   ...
  
   Considering all initial committers are from hortonworks, I'd like to
   see affiliations for mentors as well - or at least indicate which
   mentors are *not* affiliated with hortonworks.
  
   -Bertrand
  
   -
   To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
   For additional commands, e-mail: general-h...@incubator.apache.org
  
  
  
   -
   To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
   For additional commands, e-mail: general-h...@incubator.apache.org
  
 
 
  On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
  kevin.min...@hortonworks.com wrote:
   Hi Bertrand,
   Currently all of the mentors are affiliated with Hortonworks as well.  We
   can reach out to a number of other people and organizations that are
   interested in the project to diversify if this is an important
  consideration
   at this stage.
  
   One other question I have is about creating a wiki to host the proposal.
   I
   don't seem to have privs to create http://wiki.apache.org/incubator/knox
  .
   Would that be the correct place to evolve the proposal?  If so, how do I
  go
   about creating in?
   Kevin.
  
  
   On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:
  
   On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
   

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Alex Karasulu
I thought about this a bit last night. If y'all are interested I too could
also mentor the project. That should add some diversity to the mentors
list. I see value in it and would like to see this community succeed.

I'm not affiliated with any company.


On Mon, Feb 11, 2013 at 9:23 PM, Eric Sammer esam...@cloudera.com wrote:

 Kevin:

 Makes complete sense.

 I'd like to offer to join the project, if it's accepted for incubation. I'm
 a committer on MRUnit and Flume, and on the PMC for both. I've helped both
 projects through the incubation phase, and I also know a little bit about
 this Hadoop thing. ;)

 Thanks!


 On Mon, Feb 11, 2013 at 9:28 AM, Kevin Minder
 kevin.min...@hortonworks.comwrote:

  Hi Eric,
  Let me answer your second question first.
 
  Q: Is it your intention to provide job submissions and data ingestion
 APIs
  for MR and HDFS, respectively?
  A: Yes we plan to progress the project to cover all existing ecosystem
  projects.  In addition the project is based on a modular framework that
  allows for each extension to cover services that are either new or
  proprietary.  Certainly there exist very high volume data ingest use
 cases
  for which using a gateway may be impractical but in general the idea is
 to
  support all required client interaction with Hadoop via the gateway.
 
  Now for your first question...
 
  Q: Can you explain a bit more about what the target use case is?
  A: One typical use case will be that the gateway will run in a DMW.  It
  will as you say be integrations with various directory services and is
  extensible to cover those not included.  The gateway will then propagate
  the identity into the Hadoop cluster using Hadoop specific mechanisms.
  The
  key point is that there will typically be a single port open on the
 client
  side to the gateway.  The Hadoop cluster is firewalled, only providing
  access to the Hadoop services to the gateway instances.
  A: Another use case is that an organization is already using some SSO
  solution and the gateway would be integrated with that to verify any SSO
  token and then propagate the identity to the Hadoop services.
 
  I will collect this and add it to the proposal wiki once I have privs to
  create the page.
 
  Thanks!
  Kevin.
 
 
  On 2/11/13 12:03 PM, Eric Sammer wrote:
 
  Kevin:
 
  Interesting proposal. Can you explain a bit more about what the target
 use
  case is? It sounds like there's SSO-ish functionality (presumably a
 doAs()
  machine) with integration with directory services, but the proposal also
  mentions a single point for data and jobs. Is it your intention to
  provide job submissions and data ingestion APIs for MR and HDFS,
  respectively? Do you plan to target other ecosystem projects such as
  HBase?
  Sorry if I missed this in the proposal.
 
  Thanks!
 
 
  On Mon, Feb 11, 2013 at 6:55 AM, Kevin Minder
  kevin.min...@hortonworks.com**wrote:
 
   Knox Gateway Proposal
 
  == Abstract ==
 
  Knox Gateway is a system that provides a single point of secure access
  for
  Apache Hadoop clusters.
 
  == Proposal ==
 
  The Knox Gateway (“Gateway” or “Knox”) is a system that provides a
 single
  point of authentication and access for Apache Hadoop services in a
  cluster.
  The goal is to simplify Hadoop security for both users (i.e. who access
  the
  cluster data and execute jobs) and operators (i.e. who control access
 and
  manage the cluster). The Gateway runs as a server (or cluster of
 servers)
  that serve one or more Hadoop clusters.
 
  Provide perimeter security to make Hadoop security setup easier
  Support authentication and token verification security scenarios
  Deliver users a single cluster end-point that aggregates capabilities
 for
  data and jobs
  Enable integration with enterprise and cloud identity management
  environments
 
  == Background ==
 
  An Apache Hadoop cluster is presented to consumers as a loose
 collection
  of independent services. This makes it difficult for users to interact
  with
  Hadoop since each service maintains it’s own method of access and
  security.
  As well, for operators, configuration and administration of a secure
  Hadoop
  cluster is a complex and many Hadoop clusters are insecure as a result.
 
  == Rationale ==
 
  Organizations that are struggling with Hadoop cluster security result
 in
  a) running Hadoop without security or b) slowing adoption of Hadoop.
 The
  Gateway aims to provide perimeter security that integrates more easily
  into
  existing organizations’ security infrastructure. Doing so will simplify
  security for these organizations and benefit all Hadoop stakeholders
  (i.e.
  users and operators). Additionally, making a dedicated perimeter
 security
  project part of the Apache Hadoop ecosystem will prevent fragmentation
 in
  this area and further increase the value of Hadoop as a data platform.
 
  == Current Status ==
 
  Prototype available, developed by the list of initial committers.
 
  === Meritocracy ===
 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Tom White
This looks like an interesting and useful project. I'd like to volunteer as
a mentor on it.

Thanks,
Tom
On 11 Feb 2013 14:56, Kevin Minder kevin.min...@hortonworks.com wrote:

 Knox Gateway Proposal

 == Abstract ==

 Knox Gateway is a system that provides a single point of secure access for
 Apache Hadoop clusters.

 == Proposal ==

 The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
 point of authentication and access for Apache Hadoop services in a cluster.
 The goal is to simplify Hadoop security for both users (i.e. who access the
 cluster data and execute jobs) and operators (i.e. who control access and
 manage the cluster). The Gateway runs as a server (or cluster of servers)
 that serve one or more Hadoop clusters.

 Provide perimeter security to make Hadoop security setup easier
 Support authentication and token verification security scenarios
 Deliver users a single cluster end-point that aggregates capabilities for
 data and jobs
 Enable integration with enterprise and cloud identity management
 environments

 == Background ==

 An Apache Hadoop cluster is presented to consumers as a loose collection
 of independent services. This makes it difficult for users to interact with
 Hadoop since each service maintains it’s own method of access and security.
 As well, for operators, configuration and administration of a secure Hadoop
 cluster is a complex and many Hadoop clusters are insecure as a result.

 == Rationale ==

 Organizations that are struggling with Hadoop cluster security result in
 a) running Hadoop without security or b) slowing adoption of Hadoop. The
 Gateway aims to provide perimeter security that integrates more easily into
 existing organizations’ security infrastructure. Doing so will simplify
 security for these organizations and benefit all Hadoop stakeholders (i.e.
 users and operators). Additionally, making a dedicated perimeter security
 project part of the Apache Hadoop ecosystem will prevent fragmentation in
 this area and further increase the value of Hadoop as a data platform.

 == Current Status ==

 Prototype available, developed by the list of initial committers.

 === Meritocracy ===

 We desire to build a diverse developer community around Gateway following
 the Apache Way. We want to make the project open source and will encourage
 contributors from multiple organizations following the Apache meritocracy
 model.

 === Community ===

 We hope to extend the user and developer base in the future and build a
 solid open source community around Gateway. Apache Hadoop has a large
 ecosystem of open source projects, each with a strong community of
 contributors. All project communities in this ecosystem have an opportunity
 to participate in the advancement of the Gateway project because
 ultimately, Gateway will enable the security capabilities of their project
 to be more enterprise friendly.

 === Core Developers ===

 Gateway is currently being developed by several engineers from Hortonworks
 - Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit Mohanty.
 All the engineers have deep expertise in middleware, security  identity
 systems and are quite familiar with the Hadoop ecosystem.

 === Alignment ===

 The ASF is a natural host for Gateway given that it is already the home of
 Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
 projects. Gateway is designed to solve the security challenges familiar to
 the Hadoop ecosystem family of projects.

 == Known Risks ==

 === Orphaned products  Reliance on Salaried Developers ===

 The core developers plan to work full time on the project. We believe that
 this project will be of general interest to many Hadoop users and will
 attract a diverse set of contributors. We intend to demonstrate this by
 having contributors from several organizations recognized as committers by
 the time Knox graduates from incubation.

 === Inexperience with Open Source ===

 All of the core developers are active users and followers of open source.
 As well, Hortonworks has a strong heritage of success with contributions to
 Apache Hadoop Projects.

 === Homogeneous Developers ===

 The current core developers are from Hortonworks, however, we hope to
 establish a developer community that includes contributors from several
 corporations.

 === Reliance on Salaried Developers ===

 Currently, the developers are paid to do work on Gateway. However, once
 the project has a community built around it, we expect to get committers
 and developers from outside the current core developers.

 === Relationships with Other Apache Products ===

 Gateway is going to be used by the users and operators of Hadoop, and the
 Hadoop ecosystem in general.

 === A Excessive Fascination with the Apache Brand ===

 Our interest in developing Gateway in Apache project is to follow an
 established development model, as well since many of the Hadoop ecosystem
 projects also are part of Apache, Gateway will 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Kevin Minder

Hi Hadrian,

Currently I would like to keep the focus on Hadoop services to make sure 
that we can produce a useful product in a reasonable time frame.  We 
have been careful with the initial code however to treat all Hadoop 
integrations as modular plugins.  So we have and will consider 
increasing the scope to other apache projects at the right time.


Also, the code is currently available at https://github.com/hortonworks/knox

Kevin.

On 2/11/13 10:54 PM, Hadrian Zbarcea wrote:

+1 on the idea.
+1 (strong) on the need to increase diversity.

I would suggest to open a bit the scope of the project to target other 
apache projects that act as services. I know it was mentioned, but it 
sounded to me more like a possibility than a focus.


I would also volunteer as a committer and, if you prefer, mentor. I 
assume it would be possible to take a peek at the code grant in the 
coming days.


Cheers,
Hadrian

On 02/11/2013 10:10 AM, Bertrand Delacretaz wrote:

On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
kevin.min...@hortonworks.com wrote:
...

=== Nominated Mentors ===

Owen O’Malley (omalley AT apache DOT org)
Mahadev Konar (mahadev AT apache DOT org)
Alan Gates (gates AT apache DOT org)
Devaraj Das (ddas AT apache DOT org)

...

Considering all initial committers are from hortonworks, I'd like to
see affiliations for mentors as well - or at least indicate which
mentors are *not* affiliated with hortonworks.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Alejandro Abdelnur
Hi Kevin,

I'd like to offer joining the project if accepted for incubation.

I'm actively working in Hadoop (PMC member) and Oozie (PMC member), helped
driving Oozie through incubation and graduation. I'm also the initial
developer of hadoop-auth and httpfs (standalone WebHDFS gateway). In, Oozie
provides REST APIs for jobs submission (workflows, coordinators, bundles,
mapreduce, streaming, pig). In addition I've been involved in efforts for
providing HTTP access to Hadoop stack.

Thanks.



On Mon, Feb 11, 2013 at 11:54 AM, Kevin Minder kevin.min...@hortonworks.com
 wrote:

 The proposal is now hosted on the wiki.
 http://wiki.apache.org/**incubator/knoxhttp://wiki.apache.org/incubator/knox
 Currently just the initial proposal.  I'll be incorporating
 feedback/clarifications shortly.


 On 2/11/13 11:35 AM, Bertrand Delacretaz wrote:

 Hi,

 On Mon, Feb 11, 2013 at 5:07 PM, Kevin Minder
 kevin.min...@hortonworks.com wrote:

 ...Currently all of the mentors are affiliated with Hortonworks as well.
  We
 can reach out to a number of other people and organizations that are
 interested in the project to diversify if this is an important
 consideration
 at this stage

 I think that would be good - just ask for more mentors on this list.
 Without such diversity this is basically an all-hortonworks podling,
 which is not good IMO.

  I
 don't seem to have privs to create http://wiki.apache.org/**
 incubator/knox http://wiki.apache.org/incubator/knox.
 Would that be the correct place to evolve the proposal?  If so, how do I
 go
 about creating in?...

 Yes, 
 http://wiki.apache.org/**incubator/ishttp://wiki.apache.org/incubator/isthe
  right place - just ask
 for write access on this list, a [wiki] marker in the subject line
 will probably help. I don't think I have the rights to grant that
 access myself.

 -Bertrand

 --**--**-
 To unsubscribe, e-mail: 
 general-unsubscribe@incubator.**apache.orggeneral-unsubscr...@incubator.apache.org
 For additional commands, e-mail: 
 general-help@incubator.apache.**orggeneral-h...@incubator.apache.org



 --**--**-
 To unsubscribe, e-mail: 
 general-unsubscribe@incubator.**apache.orggeneral-unsubscr...@incubator.apache.org
 For additional commands, e-mail: 
 general-help@incubator.apache.**orggeneral-h...@incubator.apache.org




-- 
Alejandro


Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Devaraj Das
Hi folks,
Happy to see such a hugely positive response on the proposal. I'll put
up a vote tomorrow.

Now, on the mentors, I think we have enough at this point and we have
also addressed the diversity concern. I had got the confirmation from
Chris Douglas and Chris Mattmann yesterday afternoon that they would
participate as mentors in the project. On the nominations for getting
involved in the project, you are welcome to come in as contributors.
Individuals in the initial list of committers all have a view point
that the committers list be kept as such and the standard practice
(based on contributions, after the project is accepted for incubation)
for accepting new committers be followed here.

Thanks
Devaraj.

On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
kevin.min...@hortonworks.com wrote:
 Hi Bertrand,
 Currently all of the mentors are affiliated with Hortonworks as well.  We
 can reach out to a number of other people and organizations that are
 interested in the project to diversify if this is an important consideration
 at this stage.

 One other question I have is about creating a wiki to host the proposal.  I
 don't seem to have privs to create http://wiki.apache.org/incubator/knox.
 Would that be the correct place to evolve the proposal?  If so, how do I go
 about creating in?
 Kevin.


 On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:

 On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
 ...

 === Nominated Mentors ===

 Owen O’Malley (omalley AT apache DOT org)
 Mahadev Konar (mahadev AT apache DOT org)
 Alan Gates (gates AT apache DOT org)
 Devaraj Das (ddas AT apache DOT org)

 ...

 Considering all initial committers are from hortonworks, I'd like to
 see affiliations for mentors as well - or at least indicate which
 mentors are *not* affiliated with hortonworks.

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
kevin.min...@hortonworks.com wrote:
 Hi Bertrand,
 Currently all of the mentors are affiliated with Hortonworks as well.  We
 can reach out to a number of other people and organizations that are
 interested in the project to diversify if this is an important consideration
 at this stage.

 One other question I have is about creating a wiki to host the proposal.  I
 don't seem to have privs to create http://wiki.apache.org/incubator/knox.
 Would that be the correct place to evolve the proposal?  If so, how do I go
 about creating in?
 Kevin.


 On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:

 On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
 ...

 === Nominated Mentors ===

 Owen O’Malley (omalley AT apache DOT org)
 Mahadev Konar (mahadev AT apache DOT org)
 Alan Gates (gates AT apache DOT org)
 Devaraj Das (ddas AT apache DOT org)

 ...

 Considering all initial committers are from hortonworks, I'd like to
 see affiliations for mentors as well - or at least indicate which
 mentors are *not* affiliated with hortonworks.

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
kevin.min...@hortonworks.com wrote:
 Hi Bertrand,
 Currently all of the mentors are affiliated with Hortonworks as well.  We
 can reach out to a number of other people and organizations that are
 interested in the project to diversify if this is an important consideration
 at this stage.

 One other question I have is about creating a wiki to host the proposal.  I
 don't seem to have privs to create http://wiki.apache.org/incubator/knox.
 Would that be the correct place to evolve the proposal?  If so, how do I go
 about creating in?
 Kevin.


 On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:

 On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
 ...

 === Nominated Mentors ===

 Owen O’Malley (omalley AT apache DOT org)
 Mahadev Konar (mahadev AT apache DOT org)
 Alan Gates (gates AT apache DOT org)
 Devaraj Das (ddas AT apache DOT org)

 ...

 Considering all initial committers are from hortonworks, I'd like to
 see affiliations for mentors as well - or at least indicate which
 mentors are *not* affiliated with hortonworks.

 -Bertrand

 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Josh Wills
I thought that the need for diversity referred to the community, not to the
mentors.

I strongly advocate for newly incubating projects choosing their own
initial members (you need only search for the epic thread that resulted
from the Crunch Proposal circa May 2012), but it seems like recent data
suggests that starting out with a small set of committers from a single
organization working on a large and complex codebase is a recipe for
diversity issues and a difficult path out of the incubator.

Kevin et al., please choose whomever you want, but do consider getting some
folks from other organizations to seed the community. It does so much good
for the project over the long term.


On Tue, Feb 12, 2013 at 3:31 PM, Devaraj Das d...@hortonworks.com wrote:

 Hi folks,
 Happy to see such a hugely positive response on the proposal. I'll put
 up a vote tomorrow.

 Now, on the mentors, I think we have enough at this point and we have
 also addressed the diversity concern. I had got the confirmation from
 Chris Douglas and Chris Mattmann yesterday afternoon that they would
 participate as mentors in the project. On the nominations for getting
 involved in the project, you are welcome to come in as contributors.
 Individuals in the initial list of committers all have a view point
 that the committers list be kept as such and the standard practice
 (based on contributions, after the project is accepted for incubation)
 for accepting new committers be followed here.

 Thanks
 Devaraj.

 On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
  Hi Bertrand,
  Currently all of the mentors are affiliated with Hortonworks as well.  We
  can reach out to a number of other people and organizations that are
  interested in the project to diversify if this is an important
 consideration
  at this stage.
 
  One other question I have is about creating a wiki to host the proposal.
  I
  don't seem to have privs to create http://wiki.apache.org/incubator/knox
 .
  Would that be the correct place to evolve the proposal?  If so, how do I
 go
  about creating in?
  Kevin.
 
 
  On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:
 
  On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
  kevin.min...@hortonworks.com wrote:
  ...
 
  === Nominated Mentors ===
 
  Owen O’Malley (omalley AT apache DOT org)
  Mahadev Konar (mahadev AT apache DOT org)
  Alan Gates (gates AT apache DOT org)
  Devaraj Das (ddas AT apache DOT org)
 
  ...
 
  Considering all initial committers are from hortonworks, I'd like to
  see affiliations for mentors as well - or at least indicate which
  mentors are *not* affiliated with hortonworks.
 
  -Bertrand
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 
 
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 


 On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
  Hi Bertrand,
  Currently all of the mentors are affiliated with Hortonworks as well.  We
  can reach out to a number of other people and organizations that are
  interested in the project to diversify if this is an important
 consideration
  at this stage.
 
  One other question I have is about creating a wiki to host the proposal.
  I
  don't seem to have privs to create http://wiki.apache.org/incubator/knox
 .
  Would that be the correct place to evolve the proposal?  If so, how do I
 go
  about creating in?
  Kevin.
 
 
  On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:
 
  On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
  kevin.min...@hortonworks.com wrote:
  ...
 
  === Nominated Mentors ===
 
  Owen O’Malley (omalley AT apache DOT org)
  Mahadev Konar (mahadev AT apache DOT org)
  Alan Gates (gates AT apache DOT org)
  Devaraj Das (ddas AT apache DOT org)
 
  ...
 
  Considering all initial committers are from hortonworks, I'd like to
  see affiliations for mentors as well - or at least indicate which
  mentors are *not* affiliated with hortonworks.
 
  -Bertrand
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 
 
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 


 On Mon, Feb 11, 2013 at 8:07 AM, Kevin Minder
 kevin.min...@hortonworks.com wrote:
  Hi Bertrand,
  Currently all of the mentors are affiliated with Hortonworks as well.  We
  can reach out to a number of other people and organizations that are
  interested in the project to diversify if this is an 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Kevin Minder

Hi Bertrand,
Currently all of the mentors are affiliated with Hortonworks as well.  
We can reach out to a number of other people and organizations that are 
interested in the project to diversify if this is an important 
consideration at this stage.


One other question I have is about creating a wiki to host the 
proposal.  I don't seem to have privs to create 
http://wiki.apache.org/incubator/knox.  Would that be the correct place 
to evolve the proposal?  If so, how do I go about creating in?

Kevin.

On 2/11/13 10:10 AM, Bertrand Delacretaz wrote:

On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
kevin.min...@hortonworks.com wrote:
...

=== Nominated Mentors ===

Owen O’Malley (omalley AT apache DOT org)
Mahadev Konar (mahadev AT apache DOT org)
Alan Gates (gates AT apache DOT org)
Devaraj Das (ddas AT apache DOT org)

...

Considering all initial committers are from hortonworks, I'd like to
see affiliations for mentors as well - or at least indicate which
mentors are *not* affiliated with hortonworks.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Bertrand Delacretaz
Hi,

On Mon, Feb 11, 2013 at 5:07 PM, Kevin Minder
kevin.min...@hortonworks.com wrote:
 ...Currently all of the mentors are affiliated with Hortonworks as well.  We
 can reach out to a number of other people and organizations that are
 interested in the project to diversify if this is an important consideration
 at this stage

I think that would be good - just ask for more mentors on this list.
Without such diversity this is basically an all-hortonworks podling,
which is not good IMO.


 I
 don't seem to have privs to create http://wiki.apache.org/incubator/knox.
 Would that be the correct place to evolve the proposal?  If so, how do I go
 about creating in?...

Yes, http://wiki.apache.org/incubator/is the right place - just ask
for write access on this list, a [wiki] marker in the subject line
will probably help. I don't think I have the rights to grant that
access myself.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Eric Sammer
Kevin:

Interesting proposal. Can you explain a bit more about what the target use
case is? It sounds like there's SSO-ish functionality (presumably a doAs()
machine) with integration with directory services, but the proposal also
mentions a single point for data and jobs. Is it your intention to
provide job submissions and data ingestion APIs for MR and HDFS,
respectively? Do you plan to target other ecosystem projects such as HBase?
Sorry if I missed this in the proposal.

Thanks!


On Mon, Feb 11, 2013 at 6:55 AM, Kevin Minder
kevin.min...@hortonworks.comwrote:

 Knox Gateway Proposal

 == Abstract ==

 Knox Gateway is a system that provides a single point of secure access for
 Apache Hadoop clusters.

 == Proposal ==

 The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
 point of authentication and access for Apache Hadoop services in a cluster.
 The goal is to simplify Hadoop security for both users (i.e. who access the
 cluster data and execute jobs) and operators (i.e. who control access and
 manage the cluster). The Gateway runs as a server (or cluster of servers)
 that serve one or more Hadoop clusters.

 Provide perimeter security to make Hadoop security setup easier
 Support authentication and token verification security scenarios
 Deliver users a single cluster end-point that aggregates capabilities for
 data and jobs
 Enable integration with enterprise and cloud identity management
 environments

 == Background ==

 An Apache Hadoop cluster is presented to consumers as a loose collection
 of independent services. This makes it difficult for users to interact with
 Hadoop since each service maintains it’s own method of access and security.
 As well, for operators, configuration and administration of a secure Hadoop
 cluster is a complex and many Hadoop clusters are insecure as a result.

 == Rationale ==

 Organizations that are struggling with Hadoop cluster security result in
 a) running Hadoop without security or b) slowing adoption of Hadoop. The
 Gateway aims to provide perimeter security that integrates more easily into
 existing organizations’ security infrastructure. Doing so will simplify
 security for these organizations and benefit all Hadoop stakeholders (i.e.
 users and operators). Additionally, making a dedicated perimeter security
 project part of the Apache Hadoop ecosystem will prevent fragmentation in
 this area and further increase the value of Hadoop as a data platform.

 == Current Status ==

 Prototype available, developed by the list of initial committers.

 === Meritocracy ===

 We desire to build a diverse developer community around Gateway following
 the Apache Way. We want to make the project open source and will encourage
 contributors from multiple organizations following the Apache meritocracy
 model.

 === Community ===

 We hope to extend the user and developer base in the future and build a
 solid open source community around Gateway. Apache Hadoop has a large
 ecosystem of open source projects, each with a strong community of
 contributors. All project communities in this ecosystem have an opportunity
 to participate in the advancement of the Gateway project because
 ultimately, Gateway will enable the security capabilities of their project
 to be more enterprise friendly.

 === Core Developers ===

 Gateway is currently being developed by several engineers from Hortonworks
 - Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit Mohanty.
 All the engineers have deep expertise in middleware, security  identity
 systems and are quite familiar with the Hadoop ecosystem.

 === Alignment ===

 The ASF is a natural host for Gateway given that it is already the home of
 Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
 projects. Gateway is designed to solve the security challenges familiar to
 the Hadoop ecosystem family of projects.

 == Known Risks ==

 === Orphaned products  Reliance on Salaried Developers ===

 The core developers plan to work full time on the project. We believe that
 this project will be of general interest to many Hadoop users and will
 attract a diverse set of contributors. We intend to demonstrate this by
 having contributors from several organizations recognized as committers by
 the time Knox graduates from incubation.

 === Inexperience with Open Source ===

 All of the core developers are active users and followers of open source.
 As well, Hortonworks has a strong heritage of success with contributions to
 Apache Hadoop Projects.

 === Homogeneous Developers ===

 The current core developers are from Hortonworks, however, we hope to
 establish a developer community that includes contributors from several
 corporations.

 === Reliance on Salaried Developers ===

 Currently, the developers are paid to do work on Gateway. However, once
 the project has a community built around it, we expect to get committers
 and developers from outside the current core developers.

 === Relationships 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Kevin Minder

Hi Eric,
Let me answer your second question first.

Q: Is it your intention to provide job submissions and data ingestion APIs for 
MR and HDFS, respectively?
A: Yes we plan to progress the project to cover all existing ecosystem 
projects.  In addition the project is based on a modular framework that allows 
for each extension to cover services that are either new or proprietary.  
Certainly there exist very high volume data ingest use cases for which using a 
gateway may be impractical but in general the idea is to support all required 
client interaction with Hadoop via the gateway.

Now for your first question...

Q: Can you explain a bit more about what the target use case is?
A: One typical use case will be that the gateway will run in a DMW.  It will as 
you say be integrations with various directory services and is extensible to 
cover those not included.  The gateway will then propagate the identity into 
the Hadoop cluster using Hadoop specific mechanisms.  The key point is that 
there will typically be a single port open on the client side to the gateway.  
The Hadoop cluster is firewalled, only providing access to the Hadoop services 
to the gateway instances.
A: Another use case is that an organization is already using some SSO solution 
and the gateway would be integrated with that to verify any SSO token and then 
propagate the identity to the Hadoop services.

I will collect this and add it to the proposal wiki once I have privs to 
create the page.


Thanks!
Kevin.

On 2/11/13 12:03 PM, Eric Sammer wrote:

Kevin:

Interesting proposal. Can you explain a bit more about what the target use
case is? It sounds like there's SSO-ish functionality (presumably a doAs()
machine) with integration with directory services, but the proposal also
mentions a single point for data and jobs. Is it your intention to
provide job submissions and data ingestion APIs for MR and HDFS,
respectively? Do you plan to target other ecosystem projects such as HBase?
Sorry if I missed this in the proposal.

Thanks!


On Mon, Feb 11, 2013 at 6:55 AM, Kevin Minder
kevin.min...@hortonworks.comwrote:


Knox Gateway Proposal

== Abstract ==

Knox Gateway is a system that provides a single point of secure access for
Apache Hadoop clusters.

== Proposal ==

The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
point of authentication and access for Apache Hadoop services in a cluster.
The goal is to simplify Hadoop security for both users (i.e. who access the
cluster data and execute jobs) and operators (i.e. who control access and
manage the cluster). The Gateway runs as a server (or cluster of servers)
that serve one or more Hadoop clusters.

Provide perimeter security to make Hadoop security setup easier
Support authentication and token verification security scenarios
Deliver users a single cluster end-point that aggregates capabilities for
data and jobs
Enable integration with enterprise and cloud identity management
environments

== Background ==

An Apache Hadoop cluster is presented to consumers as a loose collection
of independent services. This makes it difficult for users to interact with
Hadoop since each service maintains it’s own method of access and security.
As well, for operators, configuration and administration of a secure Hadoop
cluster is a complex and many Hadoop clusters are insecure as a result.

== Rationale ==

Organizations that are struggling with Hadoop cluster security result in
a) running Hadoop without security or b) slowing adoption of Hadoop. The
Gateway aims to provide perimeter security that integrates more easily into
existing organizations’ security infrastructure. Doing so will simplify
security for these organizations and benefit all Hadoop stakeholders (i.e.
users and operators). Additionally, making a dedicated perimeter security
project part of the Apache Hadoop ecosystem will prevent fragmentation in
this area and further increase the value of Hadoop as a data platform.

== Current Status ==

Prototype available, developed by the list of initial committers.

=== Meritocracy ===

We desire to build a diverse developer community around Gateway following
the Apache Way. We want to make the project open source and will encourage
contributors from multiple organizations following the Apache meritocracy
model.

=== Community ===

We hope to extend the user and developer base in the future and build a
solid open source community around Gateway. Apache Hadoop has a large
ecosystem of open source projects, each with a strong community of
contributors. All project communities in this ecosystem have an opportunity
to participate in the advancement of the Gateway project because
ultimately, Gateway will enable the security capabilities of their project
to be more enterprise friendly.

=== Core Developers ===

Gateway is currently being developed by several engineers from Hortonworks
- Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit Mohanty.
All the engineers 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Alex Karasulu
Hi Kevin,

This sounds like a much needed project. I endorse the concept but as
Bertrand pointed out you need a bit more diversity. Otherwise I see no
problem with moving forward.

Good luck!


On Mon, Feb 11, 2013 at 4:55 PM, Kevin Minder
kevin.min...@hortonworks.comwrote:

 Knox Gateway Proposal

 == Abstract ==

 Knox Gateway is a system that provides a single point of secure access for
 Apache Hadoop clusters.

 == Proposal ==

 The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
 point of authentication and access for Apache Hadoop services in a cluster.
 The goal is to simplify Hadoop security for both users (i.e. who access the
 cluster data and execute jobs) and operators (i.e. who control access and
 manage the cluster). The Gateway runs as a server (or cluster of servers)
 that serve one or more Hadoop clusters.

 Provide perimeter security to make Hadoop security setup easier
 Support authentication and token verification security scenarios
 Deliver users a single cluster end-point that aggregates capabilities for
 data and jobs
 Enable integration with enterprise and cloud identity management
 environments

 == Background ==

 An Apache Hadoop cluster is presented to consumers as a loose collection
 of independent services. This makes it difficult for users to interact with
 Hadoop since each service maintains it’s own method of access and security.
 As well, for operators, configuration and administration of a secure Hadoop
 cluster is a complex and many Hadoop clusters are insecure as a result.

 == Rationale ==

 Organizations that are struggling with Hadoop cluster security result in
 a) running Hadoop without security or b) slowing adoption of Hadoop. The
 Gateway aims to provide perimeter security that integrates more easily into
 existing organizations’ security infrastructure. Doing so will simplify
 security for these organizations and benefit all Hadoop stakeholders (i.e.
 users and operators). Additionally, making a dedicated perimeter security
 project part of the Apache Hadoop ecosystem will prevent fragmentation in
 this area and further increase the value of Hadoop as a data platform.

 == Current Status ==

 Prototype available, developed by the list of initial committers.

 === Meritocracy ===

 We desire to build a diverse developer community around Gateway following
 the Apache Way. We want to make the project open source and will encourage
 contributors from multiple organizations following the Apache meritocracy
 model.

 === Community ===

 We hope to extend the user and developer base in the future and build a
 solid open source community around Gateway. Apache Hadoop has a large
 ecosystem of open source projects, each with a strong community of
 contributors. All project communities in this ecosystem have an opportunity
 to participate in the advancement of the Gateway project because
 ultimately, Gateway will enable the security capabilities of their project
 to be more enterprise friendly.

 === Core Developers ===

 Gateway is currently being developed by several engineers from Hortonworks
 - Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit Mohanty.
 All the engineers have deep expertise in middleware, security  identity
 systems and are quite familiar with the Hadoop ecosystem.

 === Alignment ===

 The ASF is a natural host for Gateway given that it is already the home of
 Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
 projects. Gateway is designed to solve the security challenges familiar to
 the Hadoop ecosystem family of projects.

 == Known Risks ==

 === Orphaned products  Reliance on Salaried Developers ===

 The core developers plan to work full time on the project. We believe that
 this project will be of general interest to many Hadoop users and will
 attract a diverse set of contributors. We intend to demonstrate this by
 having contributors from several organizations recognized as committers by
 the time Knox graduates from incubation.

 === Inexperience with Open Source ===

 All of the core developers are active users and followers of open source.
 As well, Hortonworks has a strong heritage of success with contributions to
 Apache Hadoop Projects.

 === Homogeneous Developers ===

 The current core developers are from Hortonworks, however, we hope to
 establish a developer community that includes contributors from several
 corporations.

 === Reliance on Salaried Developers ===

 Currently, the developers are paid to do work on Gateway. However, once
 the project has a community built around it, we expect to get committers
 and developers from outside the current core developers.

 === Relationships with Other Apache Products ===

 Gateway is going to be used by the users and operators of Hadoop, and the
 Hadoop ecosystem in general.

 === A Excessive Fascination with the Apache Brand ===

 Our interest in developing Gateway in Apache project is to follow an
 established development model, 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Jakob Homan
When do you expect the github to be made available?
-Jakob



On Mon, Feb 11, 2013 at 10:21 AM, Alex Karasulu akaras...@apache.orgwrote:

 Hi Kevin,

 This sounds like a much needed project. I endorse the concept but as
 Bertrand pointed out you need a bit more diversity. Otherwise I see no
 problem with moving forward.

 Good luck!


 On Mon, Feb 11, 2013 at 4:55 PM, Kevin Minder
 kevin.min...@hortonworks.comwrote:

  Knox Gateway Proposal
 
  == Abstract ==
 
  Knox Gateway is a system that provides a single point of secure access
 for
  Apache Hadoop clusters.
 
  == Proposal ==
 
  The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
  point of authentication and access for Apache Hadoop services in a
 cluster.
  The goal is to simplify Hadoop security for both users (i.e. who access
 the
  cluster data and execute jobs) and operators (i.e. who control access and
  manage the cluster). The Gateway runs as a server (or cluster of servers)
  that serve one or more Hadoop clusters.
 
  Provide perimeter security to make Hadoop security setup easier
  Support authentication and token verification security scenarios
  Deliver users a single cluster end-point that aggregates capabilities for
  data and jobs
  Enable integration with enterprise and cloud identity management
  environments
 
  == Background ==
 
  An Apache Hadoop cluster is presented to consumers as a loose collection
  of independent services. This makes it difficult for users to interact
 with
  Hadoop since each service maintains it’s own method of access and
 security.
  As well, for operators, configuration and administration of a secure
 Hadoop
  cluster is a complex and many Hadoop clusters are insecure as a result.
 
  == Rationale ==
 
  Organizations that are struggling with Hadoop cluster security result in
  a) running Hadoop without security or b) slowing adoption of Hadoop. The
  Gateway aims to provide perimeter security that integrates more easily
 into
  existing organizations’ security infrastructure. Doing so will simplify
  security for these organizations and benefit all Hadoop stakeholders
 (i.e.
  users and operators). Additionally, making a dedicated perimeter security
  project part of the Apache Hadoop ecosystem will prevent fragmentation in
  this area and further increase the value of Hadoop as a data platform.
 
  == Current Status ==
 
  Prototype available, developed by the list of initial committers.
 
  === Meritocracy ===
 
  We desire to build a diverse developer community around Gateway following
  the Apache Way. We want to make the project open source and will
 encourage
  contributors from multiple organizations following the Apache meritocracy
  model.
 
  === Community ===
 
  We hope to extend the user and developer base in the future and build a
  solid open source community around Gateway. Apache Hadoop has a large
  ecosystem of open source projects, each with a strong community of
  contributors. All project communities in this ecosystem have an
 opportunity
  to participate in the advancement of the Gateway project because
  ultimately, Gateway will enable the security capabilities of their
 project
  to be more enterprise friendly.
 
  === Core Developers ===
 
  Gateway is currently being developed by several engineers from
 Hortonworks
  - Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit
 Mohanty.
  All the engineers have deep expertise in middleware, security  identity
  systems and are quite familiar with the Hadoop ecosystem.
 
  === Alignment ===
 
  The ASF is a natural host for Gateway given that it is already the home
 of
  Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
  projects. Gateway is designed to solve the security challenges familiar
 to
  the Hadoop ecosystem family of projects.
 
  == Known Risks ==
 
  === Orphaned products  Reliance on Salaried Developers ===
 
  The core developers plan to work full time on the project. We believe
 that
  this project will be of general interest to many Hadoop users and will
  attract a diverse set of contributors. We intend to demonstrate this by
  having contributors from several organizations recognized as committers
 by
  the time Knox graduates from incubation.
 
  === Inexperience with Open Source ===
 
  All of the core developers are active users and followers of open source.
  As well, Hortonworks has a strong heritage of success with contributions
 to
  Apache Hadoop Projects.
 
  === Homogeneous Developers ===
 
  The current core developers are from Hortonworks, however, we hope to
  establish a developer community that includes contributors from several
  corporations.
 
  === Reliance on Salaried Developers ===
 
  Currently, the developers are paid to do work on Gateway. However, once
  the project has a community built around it, we expect to get committers
  and developers from outside the current core developers.
 
  === Relationships with Other Apache Products 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Eric Sammer
Kevin:

Makes complete sense.

I'd like to offer to join the project, if it's accepted for incubation. I'm
a committer on MRUnit and Flume, and on the PMC for both. I've helped both
projects through the incubation phase, and I also know a little bit about
this Hadoop thing. ;)

Thanks!


On Mon, Feb 11, 2013 at 9:28 AM, Kevin Minder
kevin.min...@hortonworks.comwrote:

 Hi Eric,
 Let me answer your second question first.

 Q: Is it your intention to provide job submissions and data ingestion APIs
 for MR and HDFS, respectively?
 A: Yes we plan to progress the project to cover all existing ecosystem
 projects.  In addition the project is based on a modular framework that
 allows for each extension to cover services that are either new or
 proprietary.  Certainly there exist very high volume data ingest use cases
 for which using a gateway may be impractical but in general the idea is to
 support all required client interaction with Hadoop via the gateway.

 Now for your first question...

 Q: Can you explain a bit more about what the target use case is?
 A: One typical use case will be that the gateway will run in a DMW.  It
 will as you say be integrations with various directory services and is
 extensible to cover those not included.  The gateway will then propagate
 the identity into the Hadoop cluster using Hadoop specific mechanisms.  The
 key point is that there will typically be a single port open on the client
 side to the gateway.  The Hadoop cluster is firewalled, only providing
 access to the Hadoop services to the gateway instances.
 A: Another use case is that an organization is already using some SSO
 solution and the gateway would be integrated with that to verify any SSO
 token and then propagate the identity to the Hadoop services.

 I will collect this and add it to the proposal wiki once I have privs to
 create the page.

 Thanks!
 Kevin.


 On 2/11/13 12:03 PM, Eric Sammer wrote:

 Kevin:

 Interesting proposal. Can you explain a bit more about what the target use
 case is? It sounds like there's SSO-ish functionality (presumably a doAs()
 machine) with integration with directory services, but the proposal also
 mentions a single point for data and jobs. Is it your intention to
 provide job submissions and data ingestion APIs for MR and HDFS,
 respectively? Do you plan to target other ecosystem projects such as
 HBase?
 Sorry if I missed this in the proposal.

 Thanks!


 On Mon, Feb 11, 2013 at 6:55 AM, Kevin Minder
 kevin.min...@hortonworks.com**wrote:

  Knox Gateway Proposal

 == Abstract ==

 Knox Gateway is a system that provides a single point of secure access
 for
 Apache Hadoop clusters.

 == Proposal ==

 The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
 point of authentication and access for Apache Hadoop services in a
 cluster.
 The goal is to simplify Hadoop security for both users (i.e. who access
 the
 cluster data and execute jobs) and operators (i.e. who control access and
 manage the cluster). The Gateway runs as a server (or cluster of servers)
 that serve one or more Hadoop clusters.

 Provide perimeter security to make Hadoop security setup easier
 Support authentication and token verification security scenarios
 Deliver users a single cluster end-point that aggregates capabilities for
 data and jobs
 Enable integration with enterprise and cloud identity management
 environments

 == Background ==

 An Apache Hadoop cluster is presented to consumers as a loose collection
 of independent services. This makes it difficult for users to interact
 with
 Hadoop since each service maintains it’s own method of access and
 security.
 As well, for operators, configuration and administration of a secure
 Hadoop
 cluster is a complex and many Hadoop clusters are insecure as a result.

 == Rationale ==

 Organizations that are struggling with Hadoop cluster security result in
 a) running Hadoop without security or b) slowing adoption of Hadoop. The
 Gateway aims to provide perimeter security that integrates more easily
 into
 existing organizations’ security infrastructure. Doing so will simplify
 security for these organizations and benefit all Hadoop stakeholders
 (i.e.
 users and operators). Additionally, making a dedicated perimeter security
 project part of the Apache Hadoop ecosystem will prevent fragmentation in
 this area and further increase the value of Hadoop as a data platform.

 == Current Status ==

 Prototype available, developed by the list of initial committers.

 === Meritocracy ===

 We desire to build a diverse developer community around Gateway following
 the Apache Way. We want to make the project open source and will
 encourage
 contributors from multiple organizations following the Apache meritocracy
 model.

 === Community ===

 We hope to extend the user and developer base in the future and build a
 solid open source community around Gateway. Apache Hadoop has a large
 ecosystem of open source projects, each with a strong 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Kevin Minder

I hope to have the repo converted from private to public today (2/11).

On 2/11/13 1:44 PM, Jakob Homan wrote:

When do you expect the github to be made available?
-Jakob



On Mon, Feb 11, 2013 at 10:21 AM, Alex Karasulu akaras...@apache.orgwrote:


Hi Kevin,

This sounds like a much needed project. I endorse the concept but as
Bertrand pointed out you need a bit more diversity. Otherwise I see no
problem with moving forward.

Good luck!


On Mon, Feb 11, 2013 at 4:55 PM, Kevin Minder
kevin.min...@hortonworks.comwrote:


Knox Gateway Proposal

== Abstract ==

Knox Gateway is a system that provides a single point of secure access

for

Apache Hadoop clusters.

== Proposal ==

The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
point of authentication and access for Apache Hadoop services in a

cluster.

The goal is to simplify Hadoop security for both users (i.e. who access

the

cluster data and execute jobs) and operators (i.e. who control access and
manage the cluster). The Gateway runs as a server (or cluster of servers)
that serve one or more Hadoop clusters.

Provide perimeter security to make Hadoop security setup easier
Support authentication and token verification security scenarios
Deliver users a single cluster end-point that aggregates capabilities for
data and jobs
Enable integration with enterprise and cloud identity management
environments

== Background ==

An Apache Hadoop cluster is presented to consumers as a loose collection
of independent services. This makes it difficult for users to interact

with

Hadoop since each service maintains it’s own method of access and

security.

As well, for operators, configuration and administration of a secure

Hadoop

cluster is a complex and many Hadoop clusters are insecure as a result.

== Rationale ==

Organizations that are struggling with Hadoop cluster security result in
a) running Hadoop without security or b) slowing adoption of Hadoop. The
Gateway aims to provide perimeter security that integrates more easily

into

existing organizations’ security infrastructure. Doing so will simplify
security for these organizations and benefit all Hadoop stakeholders

(i.e.

users and operators). Additionally, making a dedicated perimeter security
project part of the Apache Hadoop ecosystem will prevent fragmentation in
this area and further increase the value of Hadoop as a data platform.

== Current Status ==

Prototype available, developed by the list of initial committers.

=== Meritocracy ===

We desire to build a diverse developer community around Gateway following
the Apache Way. We want to make the project open source and will

encourage

contributors from multiple organizations following the Apache meritocracy
model.

=== Community ===

We hope to extend the user and developer base in the future and build a
solid open source community around Gateway. Apache Hadoop has a large
ecosystem of open source projects, each with a strong community of
contributors. All project communities in this ecosystem have an

opportunity

to participate in the advancement of the Gateway project because
ultimately, Gateway will enable the security capabilities of their

project

to be more enterprise friendly.

=== Core Developers ===

Gateway is currently being developed by several engineers from

Hortonworks

- Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit

Mohanty.

All the engineers have deep expertise in middleware, security  identity
systems and are quite familiar with the Hadoop ecosystem.

=== Alignment ===

The ASF is a natural host for Gateway given that it is already the home

of

Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
projects. Gateway is designed to solve the security challenges familiar

to

the Hadoop ecosystem family of projects.

== Known Risks ==

=== Orphaned products  Reliance on Salaried Developers ===

The core developers plan to work full time on the project. We believe

that

this project will be of general interest to many Hadoop users and will
attract a diverse set of contributors. We intend to demonstrate this by
having contributors from several organizations recognized as committers

by

the time Knox graduates from incubation.

=== Inexperience with Open Source ===

All of the core developers are active users and followers of open source.
As well, Hortonworks has a strong heritage of success with contributions

to

Apache Hadoop Projects.

=== Homogeneous Developers ===

The current core developers are from Hortonworks, however, we hope to
establish a developer community that includes contributors from several
corporations.

=== Reliance on Salaried Developers ===

Currently, the developers are paid to do work on Gateway. However, once
the project has a community built around it, we expect to get committers
and developers from outside the current core developers.

=== Relationships with Other Apache Products ===

Gateway is going to be used by the users and 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Kevin Minder

The proposal is now hosted on the wiki.
http://wiki.apache.org/incubator/knox
Currently just the initial proposal.  I'll be incorporating 
feedback/clarifications shortly.


On 2/11/13 11:35 AM, Bertrand Delacretaz wrote:

Hi,

On Mon, Feb 11, 2013 at 5:07 PM, Kevin Minder
kevin.min...@hortonworks.com wrote:

...Currently all of the mentors are affiliated with Hortonworks as well.  We
can reach out to a number of other people and organizations that are
interested in the project to diversify if this is an important consideration
at this stage

I think that would be good - just ask for more mentors on this list.
Without such diversity this is basically an all-hortonworks podling,
which is not good IMO.


I
don't seem to have privs to create http://wiki.apache.org/incubator/knox.
Would that be the correct place to evolve the proposal?  If so, how do I go
about creating in?...

Yes, http://wiki.apache.org/incubator/is the right place - just ask
for write access on this list, a [wiki] marker in the subject line
will probably help. I don't think I have the rights to grant that
access myself.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Kevin Minder

The current Knox source is now publicly readable.
https://github.com/hortonworks/knox.git

On 2/11/13 1:44 PM, Jakob Homan wrote:

When do you expect the github to be made available?
-Jakob



On Mon, Feb 11, 2013 at 10:21 AM, Alex Karasulu akaras...@apache.orgwrote:


Hi Kevin,

This sounds like a much needed project. I endorse the concept but as
Bertrand pointed out you need a bit more diversity. Otherwise I see no
problem with moving forward.

Good luck!


On Mon, Feb 11, 2013 at 4:55 PM, Kevin Minder
kevin.min...@hortonworks.comwrote:


Knox Gateway Proposal

== Abstract ==

Knox Gateway is a system that provides a single point of secure access

for

Apache Hadoop clusters.

== Proposal ==

The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
point of authentication and access for Apache Hadoop services in a

cluster.

The goal is to simplify Hadoop security for both users (i.e. who access

the

cluster data and execute jobs) and operators (i.e. who control access and
manage the cluster). The Gateway runs as a server (or cluster of servers)
that serve one or more Hadoop clusters.

Provide perimeter security to make Hadoop security setup easier
Support authentication and token verification security scenarios
Deliver users a single cluster end-point that aggregates capabilities for
data and jobs
Enable integration with enterprise and cloud identity management
environments

== Background ==

An Apache Hadoop cluster is presented to consumers as a loose collection
of independent services. This makes it difficult for users to interact

with

Hadoop since each service maintains it’s own method of access and

security.

As well, for operators, configuration and administration of a secure

Hadoop

cluster is a complex and many Hadoop clusters are insecure as a result.

== Rationale ==

Organizations that are struggling with Hadoop cluster security result in
a) running Hadoop without security or b) slowing adoption of Hadoop. The
Gateway aims to provide perimeter security that integrates more easily

into

existing organizations’ security infrastructure. Doing so will simplify
security for these organizations and benefit all Hadoop stakeholders

(i.e.

users and operators). Additionally, making a dedicated perimeter security
project part of the Apache Hadoop ecosystem will prevent fragmentation in
this area and further increase the value of Hadoop as a data platform.

== Current Status ==

Prototype available, developed by the list of initial committers.

=== Meritocracy ===

We desire to build a diverse developer community around Gateway following
the Apache Way. We want to make the project open source and will

encourage

contributors from multiple organizations following the Apache meritocracy
model.

=== Community ===

We hope to extend the user and developer base in the future and build a
solid open source community around Gateway. Apache Hadoop has a large
ecosystem of open source projects, each with a strong community of
contributors. All project communities in this ecosystem have an

opportunity

to participate in the advancement of the Gateway project because
ultimately, Gateway will enable the security capabilities of their

project

to be more enterprise friendly.

=== Core Developers ===

Gateway is currently being developed by several engineers from

Hortonworks

- Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit

Mohanty.

All the engineers have deep expertise in middleware, security  identity
systems and are quite familiar with the Hadoop ecosystem.

=== Alignment ===

The ASF is a natural host for Gateway given that it is already the home

of

Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
projects. Gateway is designed to solve the security challenges familiar

to

the Hadoop ecosystem family of projects.

== Known Risks ==

=== Orphaned products  Reliance on Salaried Developers ===

The core developers plan to work full time on the project. We believe

that

this project will be of general interest to many Hadoop users and will
attract a diverse set of contributors. We intend to demonstrate this by
having contributors from several organizations recognized as committers

by

the time Knox graduates from incubation.

=== Inexperience with Open Source ===

All of the core developers are active users and followers of open source.
As well, Hortonworks has a strong heritage of success with contributions

to

Apache Hadoop Projects.

=== Homogeneous Developers ===

The current core developers are from Hortonworks, however, we hope to
establish a developer community that includes contributors from several
corporations.

=== Reliance on Salaried Developers ===

Currently, the developers are paid to do work on Gateway. However, once
the project has a community built around it, we expect to get committers
and developers from outside the current core developers.

=== Relationships with Other Apache Products ===

Gateway is going to be used by the 

Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-11 Thread Hadrian Zbarcea

+1 on the idea.
+1 (strong) on the need to increase diversity.

I would suggest to open a bit the scope of the project to target other 
apache projects that act as services. I know it was mentioned, but it 
sounded to me more like a possibility than a focus.


I would also volunteer as a committer and, if you prefer, mentor. I 
assume it would be possible to take a peek at the code grant in the 
coming days.


Cheers,
Hadrian

On 02/11/2013 10:10 AM, Bertrand Delacretaz wrote:

On Mon, Feb 11, 2013 at 3:55 PM, Kevin Minder
kevin.min...@hortonworks.com wrote:
...

=== Nominated Mentors ===

Owen O’Malley (omalley AT apache DOT org)
Mahadev Konar (mahadev AT apache DOT org)
Alan Gates (gates AT apache DOT org)
Devaraj Das (ddas AT apache DOT org)

...

Considering all initial committers are from hortonworks, I'd like to
see affiliations for mentors as well - or at least indicate which
mentors are *not* affiliated with hortonworks.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org