Re: [VOTE] Resolution to graduate Apache Impala to TLP

2017-11-10 Thread Tom White
+1

Tom

On Thu, Nov 9, 2017 at 4:25 AM, Jim Apple  wrote:
> The graduation of Impala to a TLP has been discussed[0] on dev@impala,
> voted on[1] on dev@impala, and discussed[2] on general@incubator. All
> threads were open 72 hours or more, and all seem to have quiesced.
>
> This is a call for a VOTE to graduate Impala to a TLP. The draft resolution
> is below. Please select from:
>
> [ ] +1: Graduate Impala to a TLP
> [ ] +-0: Neither graduate nor do not graduate Impala to a TLP
> [ ] -1: Do NOT graduate Impala to a TLP, because ...
>
> 
>
> [0]: <
> https://lists.apache.org/thread.html/2f5db4788aff9b0557354b9106c0328a29c1f90c1a74a228163949d2@%3Cdev.impala.apache.org%3E
>>
>
> [1]: <
> https://lists.apache.org/thread.html/a5a7c6895b3e019347d6e4e4cf49d67d094d31b8f2c7b4d59200f3e4@%3Cdev.impala.apache.org%3E
>>
>
> [2]: <
> https://lists.apache.org/thread.html/6b8598408f76a472532923c5a7fc510470b21671677ba3486568c57e@%3Cgeneral.incubator.apache.org%3E
>>
>
> 
>
> Establish the Apache Impala Project
>
> WHEREAS, the Board of Directors deems it to be in the best interests of
> the Foundation and consistent with the Foundation's purpose to establish
> a Project Management Committee charged with the creation and maintenance
> of open-source software, for distribution at no charge to the public,
> related to a high-performance distributed SQL engine.
>
> NOW, THEREFORE, BE IT RESOLVED, that a Project Management Committee
> (PMC), to be known as the "Apache Impala Project", be and hereby is
> established pursuant to Bylaws of the Foundation; and be it further
>
> RESOLVED, that the Apache Impala Project be and hereby is responsible
> for the creation and maintenance of software related to a
> high-performance distributed SQL engine; and be it further
>
> RESOLVED, that the office of "Vice President, Apache Impala" be and
> hereby is created, the person holding such office to serve at the
> direction of the Board of Directors as the chair of the Apache Impala
> Project, and to have primary responsibility for management of the
> projects within the scope of responsibility of the Apache Impala
> Project; and be it further
>
> RESOLVED, that the persons listed immediately below be and hereby are
> appointed to serve as the initial members of the Apache Impala Project:
>
> * Alex Behm 
> * Bharath Vissapragada 
> * Brock Noland 
> * Carl Steinbach 
> * Casey Ching 
> * Daniel Hecht 
> * Dimitris Tsirogiannis 
> * Henry Robinson 
> * Ishaan Joshi 
> * Jim Apple 
> * John Russell 
> * Juan Yu 
> * Lars Volker 
> * Lenni Kuff 
> * Marcel Kornacker 
> * Martin Grund 
> * Matthew Jacobs 
> * Michael Brown 
> * Michael Ho 
> * Sailesh Mukil 
> * Skye Wanderman-Milne 
> * Taras Bobrovytsky 
> * Tim Armstrong 
> * Todd Lipcon 
>
> NOW, THEREFORE, BE IT FURTHER RESOLVED, that Jim Apple be appointed to
> the office of Vice President, Apache Impala, to serve in accordance with
> and subject to the direction of the Board of Directors and the Bylaws of
> the Foundation until death, resignation, retirement, removal or
> disqualification, or until a successor is appointed; and be it further
>
> RESOLVED, that the initial Apache Impala PMC be and hereby is tasked
> with the creation of a set of bylaws intended to encourage open
> development and increased participation in the Apache Impala Project;
> and be it further
>
> RESOLVED, that the Apache Impala Project be and hereby is tasked with
> the migration and rationalization of the Apache Incubator Impala
> podling; and be it further
>
> RESOLVED, that all responsibilities pertaining to the Apache Incubator
> Impala podling encumbered upon the Apache Incubator PMC are hereafter
> discharged.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [RESTART] [VOTE] Graduate Apache Beam

2016-12-06 Thread Tom White
+1

Tom

On Mon, Dec 5, 2016 at 11:30 PM, Davor Bonaci  wrote:
> Hi everyone,
> Please vote on the draft resolution proposed by the Apache Beam PPMC below,
> which establishes Apache Beam as a new top-level project at the Apache
> Software Foundation, as follows:
>
> [ ] +1, Graduate Apache Beam from the Incubator.
> [ ] +0, Don't care.
> [ ] -1, Don't graduate Apache Beam from the Incubator because...
>
> Please note that this is a restarted vote, per John's request, to clarify
> the alternatives. The old voting thread is archived [1].
>
> Before voting, please see the full text of the draft resolution below and
> the corresponding discussion thread [2], and vote only after you feel ready
> to do so. The vote will be open for at least 72 hours. This is a procedural
> vote [3]; it is adopted by a simple majority of qualified votes (with no
> minimum).
>
> If approved by the Apache Incubator, the proposed resolution will be
> submitted to the Board of Directors for their consideration.
>
> Thank you!
>
> Davor
>
> [1]
> https://lists.apache.org/thread.html/a8e9cecfe93f0e464cc7c1774d2761ca14326df1101b7670ca8b1dc3@%3Cgeneral.incubator.apache.org%3E
> [2]
> https://lists.apache.org/thread.html/b9c1071b35558846836814575ada3cdca61c72dc1e672ab994a9c936@%3Cgeneral.incubator.apache.org%3E
> [3] http://apache.org/foundation/voting.html
>
> The full-text of the draft resolution proposed by the Apache Beam PPMC:
>
> X. Establish the Apache Beam Project
>
>WHEREAS, the Board of Directors deems it to be in the best
>interests of the Foundation and consistent with the
>Foundation's purpose to establish a Project Management
>Committee charged with the creation and maintenance of
>open-source software, for distribution at no charge to
>the public, related to a unified programming model for both
>batch and streaming data processing, enabling efficient
>execution across diverse distributed execution engines
>and providing extensibility points for connecting to different
>technologies and user communities.
>
>NOW, THEREFORE, BE IT RESOLVED, that a Project Management
>Committee (PMC), to be known as the "Apache Beam Project",
>be and hereby is established pursuant to Bylaws of the
>Foundation; and be it further
>
>RESOLVED, that the Apache Beam Project be and hereby is
>responsible for the creation and maintenance of software
>related to a unified programming model for both batch and
>streaming data processing, enabling efficient execution across
>diverse distributed execution engines and providing extensibility
>points for connecting to different technologies and user
>communities; and be it further
>
>RESOLVED, that the office of "Vice President, Apache Beam" be
>and hereby is created, the person holding such office to
>serve at the direction of the Board of Directors as the chair
>of the Apache Beam Project, and to have primary responsibility
>for management of the projects within the scope of
>responsibility of the Apache Beam Project; and be it further
>
>RESOLVED, that the persons listed immediately below be and
>hereby are appointed to serve as the initial members of the
>Apache Beam Project:
>
>  * Tyler Akidau 
>  * Davor Bonaci 
>  * Robert Bradshaw 
>  * Ben Chambers 
>  * Luke Cwik 
>  * Stephan Ewen 
>  * Dan Halperin 
>  * Kenneth Knowles 
>  * Aljoscha Krettek 
>  * Maximilian Michels 
>  * Jean-Baptiste Onofré 
>  * Frances Perry 
>  * Amit Sela 
>  * Josh Wills 
>
>NOW, THEREFORE, BE IT FURTHER RESOLVED, that Davor Bonaci
>be appointed to the office of Vice President, Apache Beam, to
>serve in accordance with and subject to the direction of the
>Board of Directors and the Bylaws of the Foundation until
>death, resignation, retirement, removal or disqualification,
>or until a successor is appointed; and be it further
>
>RESOLVED, that the initial Apache Beam PMC be and hereby is
>tasked with the creation of a set of bylaws intended to
>encourage open development and increased participation in the
>Apache Beam Project; and be it further
>
>RESOLVED, that the Apache Beam Project be and hereby
>is tasked with the migration and rationalization of the Apache
>Incubator Beam podling; and be it further
>
>RESOLVED, that all responsibilities pertaining to the Apache
>Incubator Beam podling encumbered upon the Apache Incubator
>Project are hereafter discharged.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incub

Re: [VOTE] Accept Spot into the Apache Incubator

2016-09-22 Thread Tom White
+1

Tom

On Tue, Sep 20, 2016 at 7:15 PM, Doug Cutting  wrote:
> Following the discussion thread, I would like to call a vote on
> accepting Spot into the Apache Incubator.
>
> [] +1 Accept Spot into the Apache Incubator
> [] +0 Abstain.
> [] -1 Do not accept Spot into the Apache Incubator because ...
>
> This vote will run for the usual 72 hours.
>
> The proposal is attached, but you can also access it on the wiki:
>https://wiki.apache.org/incubator/SpotProposal
>
> Thanks,
>
> Doug
>
> = SpotProposal =
>
> == Abstract ==
>
> Spot is an open source platform for network telemetry (packet, flow,
> and proxy at the moment) built on an open data model and Apache
> Hadoop.
>
> == Proposal ==
>
> Spot (formerly Open Network Insight, or ONI) is an open source
> solution for network telemetry (packet, flow, and proxy at the moment)
> built on an open data model and Apache Hadoop. It provides ingestion
> and transformation of binary data, scalable machine learning, and
> interactive visualization for identifying threats in network flows and
> DNS packets.
>
> Spot has a pluggable architecture that can accommodate multiple open
> data models. Although cybersecurity/network-intrusion analysis is the
> initial use case for Spot, we are actively encouraging the
> contribution of new models that will enable other adjacent
> applications, such as fraud detection or IT-operational analytics such
> as performance and health monitoring. Because these models are open,
> users maintain control of their own data.
>
> More information on Spot can be found at the existing project website
> at http://open-network-insight.org/.
>
> == Background ==
>
> It almost goes without saying that cybersecurity is an acute and
> paramount concern globally, for organizations of all types and
> sizes. Fortunately, thanks to the availability of massively scalable
> (in the PBs) data infrastructure, security professionals can now make
> authentically data-driven decisions about how they protect their
> assets. For example, records of network traffic, captured as network
> flows, are often stored and analyzed for use in network management,
> and this same information can provide valuable insights into network
> vulnerabilities.
>
> Cybersecurity is just one example, however: There are other examples
> of adjacent use cases, such as user fraud detection or IT-operations
> analytics, that would benefit from the combination of Spot
> functionality and PB-scale data sets for analysis.
>
> == Rationale ==
>
> Although cybersecurity is its initial use case/data model, Spot is
> intended to more generally tackle the dual challenges of facilitating
> the development of big data-driven analytic solutions, while helping
> vendors avoid having to create one/off infrastructure for each use
> case. Spot will eliminate issues related to vendor data models that
> create silos between solutions, and that make it difficult for users
> to consume these innovations from multiple vendors. In summary, Spot
> will accelerate the development of new massively scalable analytic
> applications that give users more flexibility, and more choices.
>
> As an initial effort, we are now seeking to build an ecosystem of
> developers, data scientists, and security professionals to make Spot
> the open, community-driven, cybersecurity platform standard it needs
> to become. By bringing Spot to Apache, we hope to galvanize these
> groups to cooperate in this highly matrixed effort, and to build a
> global, and diverse, Spot community.
>
> == Initial Goals ==
>
> Move the existing codebase, website, documentation, and mailing lists
> to Apache-hosted infrastructure Work with the infrastructure team to
> implement and approve our build and testing workflows in the context
> of the ASF Incremental development and releases per Apache guidelines
>
> == Current Status ==
>
> === Releases ===
>
> Spot has undergone one public release (1.0). This initial release was
> not performed in the typical ASF fashion; we will adopt the ASF source
> release process upon joining the incubator.
>
> === Source ===
>
> Spot’s source, including core platform and associated submodules, is
> currently hosted in several GitHub repositories under the indicated
> licenses:
>
>  * Core (Apache License 2.0)
>  * Oni-ingest (Apache License 2.0)
>  * Oni-ml (Apache License 2.0
>  * Oni-oa (BSD & MIT)
>  * Oni-setup (Apache License 2.0)
>  * Oni-nfdump (BSD)
>  * Oni-lda-c (GNU General Public License version 2)
>
> The repositories will be transitioned to Apache’s git hosting during
> incubation.  Issues related to GPL code will be resolved during
> incubation.
>
>
> === Issue Tracking ===
>
> Spot’s bug and feature tracking is hosted on Github at:
>
>  * https://github.com/Open-Network-Insight/open-network-insight/issues
>
> Issue tracking will be transitioned to Apache’s JIRA instance during 
> incubation.
>
> === Code review ===
>
> Spot maintainers currently use “LGTM” (Looks Good to Me) in comments
> on 

Re: [VOTE] Graduate Sentry

2016-02-26 Thread Tom White
+1 (binding)

Tom

On Wed, Feb 24, 2016 at 7:20 PM, Sravya Tirukkovalur  wrote:
> Hi all,
>
> Following the positive discussion[1] and vote[2] in the Sentry
> community and a discussion[3] on the incubator list to graduate
> Sentry, I am calling a VOTE to graduate the project from the Incubator
> to a TLP. Please vote on the resolution pasted below.
>
> [ ] +1 Graduate Sentry from the Incubator
> [ ] +0 Don't care
> [ ] -1 Don't graduate Sentry from the Incubator (please specify reason)
>
> This vote will be open for at least 72 hours.
>
> References:
>
> [1] https://s.apache.org/dev_discuss
> [2] https://s.apache.org/dev_vote_result
> [3] https://s.apache.org/general_discuss
> Other:
> https://s.apache.org/general_notify
> https://cwiki.apache.org/confluence/display/SENTRY/Sentry+maturity+assessment
>
> Resolution to create a TLP from graduating Incubator podling:
>
> ==
>
>
> X. Establish the Apache Sentry Project
>
>   WHEREAS, the Board of Directors deems it to be in the best
>   interests of the Foundation and consistent with the
>   Foundation's purpose to establish a Project Management
>   Committee charged with the creation and maintenance of
>   open-source software, for distribution at no charge to
>   the public, related to Fine grained authorization to data and
> metadata in Hadoop.
>
>   NOW, THEREFORE, BE IT RESOLVED, that a Project Management
>   Committee (PMC), to be known as the "Apache Sentry Project",
>   be and hereby is established pursuant to Bylaws of the
>   Foundation; and be it further
>
>   RESOLVED, that the Apache Sentry Project be and hereby is
>   responsible for the creation and maintenance of software
>   related to Fine grained authorization to data and metadata in Hadoop;
>   and be it further
>
>   RESOLVED, that the office of "Vice President, Apache Sentry" be
>   and hereby is created, the person holding such office to
>   serve at the direction of the Board of Directors as the chair
>   of the Apache Sentry Project, and to have primary responsibility
>   for management of the projects within the scope of
>   responsibility of the Apache Sentry Project; and be it further
>
>   RESOLVED, that the persons listed immediately below be and
>   hereby are appointed to serve as the initial members of the
>   Apache Sentry Project:
>
> * Ali Rizvi 
>
>* Anne Yu 
>
>* Arun Suresh 
>
>* Brock Noland 
>
>* Chaoyu Tang 
>
>* Colin Ma 
>
>* Daisy Zhou 
>
>* Dapeng Sun 
>
>* David Nalley 
>
>* Erick Tryzelaar 
>
>* Gregory Chanan 
>
>* Guoquan Shen 
>
>* Hadi Nahari 
>
>* Hao Hao 
>
>* Jarek Jarcec Cecho 
>
>* Johnny Zhang 
>
>* Karthik Ramachandran 
>
>* Mark Grover 
>
>* Milo Polte 
>
>* Lenni Kuff 
>
>* Patrick Daly 
>
>* Patrick Hunt 
>
>* Prasad Mujumdar 
>
>* Raghu Mani 
>
>* Sean Mackrory 
>
>* Shreepadma Venugopalan 
>
>* Sravya Tirukkovalur 
>
>* Tuong Truong 
>
>* Vamsee Yarlagadda 
>
>* Xiaomeng Huang 
>
>* Xuefu Zhang 
>
>   NOW, THEREFORE, BE IT FURTHER RESOLVED, that Sravya Tirukkovalur
>   be appointed to the office of Vice President, Apache Sentry, to
>   serve in accordance with and subject to the direction of the
>   Board of Directors and the Bylaws of the Foundation until
>   death, resignation, retirement, removal or disqualification,
>   or until a successor is appointed; and be it further
>
>   RESOLVED, that the initial Apache Sentry PMC be and hereby is
>   tasked with the creation of a set of bylaws intended to
>   encourage open development and increased participation in the
>   Apache Sentry Project; and be it further
>
>   RESOLVED, that the Apache Sentry Project be and hereby
>   is tasked with the migration and rationalization of the Apache
>   Incubator Sentry podling; and be it further
>
>   RESOLVED, that all responsibilities pertaining to the Apache
>   Incubator Sentry podling encumbered upon the Apache Incubator
>   Project are hereafter discharged.
>
> ==
>
>
> Regards,
> Sravya Tirukkovalur
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Beam into the Apache Incubator

2016-01-29 Thread Tom White
DK),
> on Stack Overflow
> (http://stackoverflow.com/questions/tagged/google-cloud-dataflow) and has
> had contributions from a number of organizations and indivuduals.
>
> Everyday, Cloud Dataflow is actively used by a number of organizations and
> institutions for batch and stream processing of data. We believe acceptance
> will allow us to consolidate existing Dataflow-related work, grow the
> Dataflow community, and deepen connections between Dataflow and other open
> source projects.
>
> === Core Developers ===
>
> The core developers for Dataflow and the Dataflow runners are:
>
>  * Frances Perry
>  * Tyler Akidau
>  * Davor Bonaci
>  * Luke Cwik
>  * Ben Chambers
>  * Kenn Knowles
>  * Dan Halperin
>  * Daniel Mills
>  * Mark Shields
>  * Craig Chambers
>  * Maximilian Michels
>  * Tom White
>  * Josh Wills
>  * Robert Bradshaw
>
> === Alignment ===
>
> The Beam SDK can be used to create Beam pipelines which can be executed on
> Apache Spark or Apache Flink. Beam is also related to other Apache projects,
> such as Apache Crunch. We plan on expanding functionality for Beam runners,
> support for additional domain specific languages, and increased portability
> so Beam is a powerful abstraction layer for data processing.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> The Dataflow SDK is presently used by several organizations, from small
> startups to Fortune 100 companies, to construct production pipelines which
> are executed in Google Cloud Dataflow. Google has a long-term commitment to
> advance the Dataflow SDK; moreover, Dataflow is seeing increasing interest,
> development, and adoption from organizations outside of Google.
>
> === Inexperience with Open Source ===
>
> Google believes strongly in open source and the exchange of information to
> advance new ideas and work. Examples of this commitment are active OSS
> projects such as Chromium (https://www.chromium.org) and Kubernetes
> (http://kubernetes.io/). With Dataflow, we have tried to be increasingly
> open and forward-looking; we have published a paper in the VLDB conference
> describing the Dataflow model
> (http://www.vldb.org/pvldb/vol8/p1792-Akidau.pdf) and were quick to release
> the Dataflow SDK as open source software with the launch of Cloud Dataflow.
> Our submission to the Apache Software Foundation is a logical extension of
> our commitment to open source software.
>
> === Homogeneous Developers ===
>
> The majority of committers in this proposal belong to Google due to the fact
> that Dataflow has emerged from several internal Google projects. This
> proposal also includes committers outside of Google who are actively
> involved with other Apache projects, such as Hadoop, Flink, and Spark.  We
> expect our entry into incubation will allow us to expand the number of
> individuals and organizations participating in Dataflow development.
> Additionally, separation of the Dataflow SDK from Google Cloud Dataflow
> allows us to focus on the open source SDK and model and do what is best for
> this project.
>
> === Reliance on Salaried Developers ===
>
> The Dataflow SDK and Dataflow runners have been developed primarily by
> salaried developers supporting the Google Cloud Dataflow project. While the
> Dataflow SDK and Cloud Dataflow have been developed by different teams (and
> this proposal would reinforce that separation) we expect our initial set of
> developers will still primarily be salaried. Contribution has not been
> exclusively from salaried developers, however. For example, the contrib
> directory of the Dataflow SDK
> (https://github.com/GoogleCloudPlatform/DataflowJavaSDK/tree/master/contrib)
> contains items from free-time contributors. Moreover, seperate projects,
> such as ScalaFlow (https://github.com/darkjh/scalaflow) have been created
> around the Dataflow model and SDK. We expect our reliance on salaried
> developers will decrease over time during incubation.
>
> === Relationship with other Apache products ===
>
> Dataflow directly interoperates with or utilizes several existing Apache
> projects.
>
>  * Build
>   * Apache Maven
>  * Data I/O, Libraries
>   * Apache Avro
>   * Apache Commons
>  * Dataflow runners
>   * Apache Flink
>   * Apache Spark
>
> Beam when used in batch mode shares similarities with Apache Crunch;
> however, Beam is focused on a model, SDK, and abstraction layer beyond Spark
> and Hadoop (MapReduce.) One key goal of Beam is to provide an intermediate
> abstraction layer which can easily be implemented and utilized across
> several different processing frameworks.
>
> === An excessive fascination with the Apache brand ===
>
> With this proposal we are not seeking 

Impala commit policy

2015-12-02 Thread Tom White
The vote to accept Impala into the incubator has passed
(http://s.apache.org/u6r), however there are still some concerns about
CTR/RTC. My main takeaways from the CTR/RTC thread are that it's not a
binary choice, and that it's entirely reasonable that different
communities have different commit policies at the ASF.

I think Julian Hyde's suggestion that the Impala podling start with no
explicit commit policy is a good one. Incubation should be used as a
time to work out what works best for a project. The initial Impala
community should discuss the commit policy as they go through the
process of setting up ASF infra and start growing the podling. In
particular this will include how Gerrit can be used as a tool to
facilitate reviews, and how that fits with ASF culture, which is
something that other projects are looking at too.

Cheers,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Kudu into the Apache Incubator

2015-11-25 Thread Tom White
+1 (binding)

Tom

On Tue, Nov 24, 2015 at 7:32 PM, Todd Lipcon  wrote:
> Hi all,
>
> Discussion on the [DISCUSS] thread seems to have wound down, so I'd like to
> call a VOTE on acceptance of Kudu into the ASF Incubator. The proposal is
> pasted below and also available on the wiki at:
> https://wiki.apache.org/incubator/KuduProposal
>
> The proposal is unchanged since the original version, except for the
> addition of Carl Steinbach as a Mentor.
>
> Please cast your votes:
>
> [] +1, accept Kudu into the Incubator
> [] +/-0, positive/negative non-counted expression of feelings
> [] -1, do not accept Kudu into the incubator (please state reasoning)
>
> Given the US holiday this week, I imagine many folks are traveling or
> otherwise offline. So, let's run the vote for a full week rather than the
> traditional 72 hours. Unless the IPMC objects to the extended voting
> period, the vote will close on Tues, Dec 1st at noon PST.
>
> Thanks
> -Todd
> -
>
> = Kudu Proposal =
>
> == Abstract ==
>
> Kudu is a distributed columnar storage engine built for the Apache Hadoop
> ecosystem.
>
> == Proposal ==
>
> Kudu is an open source storage engine for structured data which supports
> low-latency random access together with efficient analytical access
> patterns. Kudu distributes data using horizontal partitioning and
> replicates each partition using Raft consensus, providing low
> mean-time-to-recovery and low tail latencies. Kudu is designed within the
> context of the Apache Hadoop ecosystem and supports many integrations with
> other data analytics projects both inside and outside of the Apache
> Software Foundation.
>
>
>
> We propose to incubate Kudu as a project of the Apache Software Foundation.
>
> == Background ==
>
> In recent years, explosive growth in the amount of data being generated and
> captured by enterprises has resulted in the rapid adoption of open source
> technology which is able to store massive data sets at scale and at low
> cost. In particular, the Apache Hadoop ecosystem has become a focal point
> for such “big data” workloads, because many traditional open source
> database systems have lagged in offering a scalable alternative.
>
>
>
> Structured storage in the Hadoop ecosystem has typically been achieved in
> two ways: for static data sets, data is typically stored on Apache HDFS
> using binary data formats such as Apache Avro or Apache Parquet. However,
> neither HDFS nor these formats has any provision for updating individual
> records, or for efficient random access. Mutable data sets are typically
> stored in semi-structured stores such as Apache HBase or Apache Cassandra.
> These systems allow for low-latency record-level reads and writes, but lag
> far behind the static file formats in terms of sequential read throughput
> for applications such as SQL-based analytics or machine learning.
>
>
>
> Kudu is a new storage system designed and implemented from the ground up to
> fill this gap between high-throughput sequential-access storage systems
> such as HDFS and low-latency random-access systems such as HBase or
> Cassandra. While these existing systems continue to hold advantages in some
> situations, Kudu offers a “happy medium” alternative that can dramatically
> simplify the architecture of many common workloads. In particular, Kudu
> offers a simple API for row-level inserts, updates, and deletes, while
> providing table scans at throughputs similar to Parquet, a commonly-used
> columnar format for static data.
>
>
>
> More information on Kudu can be found at the existing open source project
> website: http://getkudu.io and in particular in the Kudu white-paper PDF:
> http://getkudu.io/kudu.pdf from which the above was excerpted.
>
> == Rationale ==
>
> As described above, Kudu fills an important gap in the open source storage
> ecosystem. After our initial open source project release in September 2015,
> we have seen a great amount of interest across a diverse set of users and
> companies. We believe that, as a storage system, it is critical to build an
> equally diverse set of contributors in the development community. Our
> experiences as committers and PMC members on other Apache projects have
> taught us the value of diverse communities in ensuring both longevity and
> high quality for such foundational systems.
>
> == Initial Goals ==
>
>  * Move the existing codebase, website, documentation, and mailing lists to
> Apache-hosted infrastructure
>  * Work with the infrastructure team to implement and approve our code
> review, build, and testing workflows in the context of the ASF
>  * Incremental development and releases per Apache guidelines
>
> == Current Status ==
>
>  Releases 
>
> Kudu has undergone one public release, tagged here
> https://github.com/cloudera/kudu/tree/kudu0.5.0-release
>
> This initial release was not performed in the typical ASF fashion -- no
> source tarball was released, but rather only convenience binaries made
> available in Cloud

Re: [VOTE] Accept Impala into the Apache Incubator

2015-11-25 Thread Tom White
//github.com/cloudera/Impala/wiki)
>
> Impala’s auto-generated API documentation (
> http://impala.io/doc/html/index.html)
>
> = Initial Source =
> Impala’s initial source contribution will come from
> http://github.com/cloudera/Impala/.
>
> = External Dependencies =
>
> Impala depends upon a number of third-party libraries, which we list below.
> We intend to compile a LICENSE.txt file in the very short term (see
> https://issues.cloudera.org/browse/IMPALA-2670).
>
>  * Google gflags (BSD)
>  * Google glog (BSD)
>  * Apache Thrift (Apache Software License v2.0)
>  * Apache Commons (Apache Software License v2.0)
>  * Apache Hadoop (Apache Software License v2.0)
>  * Apache HBase (Apache Software License v2.0)
>  * Apache Hive (Apache Software License v2.0)
>  * Boost (Boost Software License)
>  * OpenLdap (OpenLDAP Software License)
>  * rapidjson (MIT)
>  * Google RE2 (BSD-style)
>  * lz4 (BSD)
>  * snappy (BSD)
>  * cyrus-sasl (CMU License)
>  * Apache Avro (Apache Software License v2.0)
>  * Cloudera squeasel (Apache Software License v2.0)
>  * Apache htrace (Incubating) (Apache Software License v2.0)
>  * Apache Sentry (Incubating) (Apache Software License v2.0)
>  * Apache Shiro (Apache Software License v2.0)
>  * Twitter Bootstrap (Apache Software License v2.0)
>  * d3 (BSD)
>  * LLVM (BSD-like)
>
> Build and test dependencies:
>
>  * ant (Apache Software License v2.0)
>  * Apache Maven (Apache Software License v2.0)
>  * cmake (BSD)
>  * clang (BSD)
>  * Google gtest (Apache Software License v2.0)
>
> = Required Resources =
>
> We request that following resources be created for the project to use:
>
> == Mailing lists ==
>
>  * priv...@impala.incubator.apache.org (moderated subscriptions)
>  * comm...@impala.incubator.apache.org
>  * d...@impala.incubator.apache.org
>  * iss...@impala.incubator.apache.org
>  * u...@impala.incubator.apache.org
>
> == Git repository ==
> https://git.apache.org/impala.git
>
> == JIRA instance ==
> JIRA project IMPALA (IMPALA or IMP)
>
> == Other Resources ==
> We hope to continue using Gerrit for our code review and commit workflow.
> We are involved with discussions that the Kudu team at Cloudera have been
> having with Jake Farrell to start discussions on how Gerrit can fit into
> the ASF. We know that several other ASF projects or podlings are also
> interested in Gerrit.
>
> If the Infrastructure team does not have the bandwidth to support gerrit,
> we will continue to support our own instance of gerrit for Impala, and make
> the necessary integrations such that commits are properly authenticated and
> maintain sufficient provenance to uphold the ASF standards (e.g. via the
> solution adopted by the AsterixDB podling).
>
> = Initial Committers =
>
>  * Tim Armstrong
>  * Alex Behm
>  * Taras Bobrovytsky
>  * Casey Ching
>  * Martin Grund
>  * Daniel Hecht
>  * Michael Ho
>  * Matthew Jacobs
>  * Ishaan Joshi
>  * Lenni Kuff
>  * Marcel Kornacker
>  * Sailesh Mukil
>  * Henry Robinson
>  * John Russell
>  * Dimitris Tsirogiannis
>  * Skye Wanderman-Milne
>  * Juan Yu
>
> == Affiliations ==
> All: Cloudera Inc.
>
> = Sponsors =
>
> == Champion ==
> Tom White
>
> == Nominated Mentors ==
>  * Tom White (Cloudera)
>  * Todd Lipcon (Cloudera)
>  * Carl Steinbach (LinkedIn)
>  * Brock Noland (StreamSets)
>
>
> = Sponsoring Entity =
> We ask that the Incubator PMC sponsor this proposal.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Apache Parquet from the Incubator

2015-04-13 Thread Tom White
+1

Tom

On Mon, Apr 13, 2015 at 4:46 PM, Jake Farrell  wrote:
> This thread is to start a vote on the graduation resolution Apache Parquet
> has approved on the dev list with 14 positive votes (11 PPMC (4 from IPMC
> members), 3 community)
>
> Status page: http://incubator.apache.org/projects/parquet.html
> Graduation discussion: http://s.apache.org/8qU
> Graduation resolution: http://s.apache.org/w7
>
> Apache Parquet has been apart of the Incubator since May 2014 and in that
> time has
>
> * Had successful IPMC approved releases
> * Expanded the PPMC with new members
> * Worked and submitted fixes in other Apache projects to improve Parquet
> support
> * A successful community vote to graduate with 14 positive votes
>
> I would like to get the voting started with my own +1
>
> Please vote
>
> [ ] +1 Graduate Apache Parquet as a TLP
> [ ] +0
> [ ] -1 Don't graduate Apache Parquet as a TLP because…
>
> Voting will end in 72 hours, i.e. 12:00 UTC on 2015-04-16
> http://www.timeanddate.com/countdown/to?iso=20150416T12
>
> -Jake
>
>
>
> === Board Resolution ==
>
> Establish the Apache Parquet Project
>
> WHEREAS, the Board of Directors deems it to be in the best
> interests of the Foundation and consistent with the
> Foundation's purpose to establish a Project Management
> Committee charged with the creation and maintenance of
> open-source software, for distribution at no charge to the
> public, related to a columnar storage format for Hadoop.
>
> NOW, THEREFORE, BE IT RESOLVED, that a Project Management
> Committee (PMC), to be known as the "Apache Parquet Project",
> be and hereby is established pursuant to Bylaws of the
> Foundation; and be it further
>
> RESOLVED, that the Apache Parquet Project be and hereby is
> responsible for the creation and maintenance of software
> related to a columnar storage format for Hadoop; and be it further
>
> RESOLVED, that the office of "Vice President, Apache Parquet" be
> and hereby is created, the person holding such office to
> serve at the direction of the Board of Directors as the chair
> of the Apache Parquet Project, and to have primary responsibility
> for management of the projects within the scope of
> responsibility of the Apache Parquet Project; and be it further
>
> RESOLVED, that the persons listed immediately below be and
> hereby are appointed to serve as the initial members of the
> Apache Parquet Project:
>
>* Chris Aniszczyk 
>* Ryan Blue 
>* Jonathan Coveney 
>* Tianshuo Deng 
>* Jake Farrell 
>* Marcel Kornacker 
>* Mickael Lacour 
>* Julien Le Dem 
>* Alex Levenson 
>    * Nong Li 
>* Todd Lipcon 
>* Chris Mattmann 
>* Aniket Mokashi 
>* Lukas Nalezenec 
>* Brock Noland 
>* Wesley Graham Peck 
>* Remy Pecqueur 
>* Dmitriy Ryaboy 
>* Roman Shaposhnik 
>* Daniel Weeks 
>* Tom White 
>
> NOW, THEREFORE, BE IT FURTHER RESOLVED, that Julien Le Dem
> be appointed to the office of Vice President, Apache Parquet, to
> serve in accordance with and subject to the direction of the
> Board of Directors and the Bylaws of the Foundation until
> death, resignation, retirement, removal or disqualification,
> or until a successor is appointed; and be it further
>
> RESOLVED, that the initial Apache Parquet PMC be and hereby is
> tasked with the creation of a set of bylaws intended to
> encourage open development and increased participation in the
> Apache Parquet Project; and be it further
>
> RESOLVED, that the Apache Parquet Project be and hereby
> is tasked with the migration and rationalization of the Apache
> Incubator Parquet podling; and be it further
>
> RESOLVED, that all responsibilities pertaining to the Apache
> Incubator Parquet podling encumbered upon the Apache Incubator
> Project are hereafter discharged.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Parquet into the incubator

2014-05-19 Thread Tom White
t project being abandoned is minimal. There are many
> organizations using Parquet in production, including Twitter, Cloudera,
> Stripe, and Salesforce (
> http://blog.cloudera.com/blog/2013/10/parquet-at-salesforce-com/).
>
> === Inexperience with Open Source ===
>
> Parquet has existed as a healthy open source for one year. During that
> time, we have curated an open-source community successfully, attracting
> over 40 contributors (see
> https://github.com/Parquet/parquet-mr/graphs/contributors) from a diverse
> group of companies.
> Several of the core contributors to the project are deeply familiar with
> OSS and Apache specifically: Julien Le Dem was until recently the PMC Chair
> for Apache Pig, and Dmitriy Ryaboy, Aniket Mokashi, and Jonathan Coveney
> are also Apache Pig committers with contributions to several other Apache
> projects. Todd Lipcon and Tom White are committers to Apache Hadoop and
> multiple other related projects. Brock Noland is a Hive committer.
>
> === Homogenous Developers ===
>
> The initial committers come from a number of companies and countries.
> Parquet has an active community of developers, and we are committed to
> recruiting additional committers based on their contributions to the
> project. The java library component alone has contributions from 31
> individual github accounts, 14 of which contributed over 1000 lines of code.
>
> === Reliance on Salaried Developers ===
>
> It is expected that Parquet development will occur on both salaried time
> and on volunteer time, after hours. The majority of initial committers are
> paid by their employers to contribute to this project. However, they are
> all passionate about the project, and we are confident that the project
> will continue even if no salaried developers contribute to the project. As
> evidence of this statement, we present the GitHub punchcard (see
> https://github.com/Parquet/parquet-mr/graphs/punch-card) showing that a lot
> of activity happens on weekends. We are committed to recruiting additional
> committers including non-salaried developers.
>
> === Relationships with Other Apache Products ===
>
> As mentioned in the Alignment section, Parquet is closely related to
> Hadoop. It provides an API that allowed it to be easily integrated with
> many other apache projects: Pig, Hive, Avro, Thrift, Spark, Drill, Crunch,
> Tajo. Some of the features it provides are similar to the ORC file format
> which is part of the Hive project. However Parquet focused on being
> framework agnostic and language independent and has been really successful
> to that end. On top of the Apache projects mentioned above, Parquet is also
> integrated with other open source projects, including Protocol Buffers,
> Cloudera Impala or Scrooge. We look forward to continue collaborating with
> those communities, as well as other Apache communities.
>
> === An Excessive Fascination with the Apache Brand ===
>
> Parquet is an already healthy and well known open source project. This
> proposal is not for the purpose of generating publicity. Rather, the
> primary benefits to joining Apache are those outlined in the Rationale
> section.
>
> == Documentation ==
>
> Documentation is currently located as README markdown files:
>
>  * https://github.com/Parquet/parquet-format
>  * https://github.com/Parquet/parquet-mr
>
> == Source and Intellectual Property Submission Plan ==
>
> The Parquet codebase is currently hosted on Github:
> https://github.com/Parquet.
>
> These are the codebases that we would migrate to the Apache foundation.
>
> == External Dependencies ==
>
>
>  * Junit: EPL
>  * Apache Commons: ALv2
>  * Apache Thrift: ALv2
>  * Apache Maven: ALv2
>  * Apache Avro: ALv2
>  * Apache Hadoop: ALv2
>  * Google Guava: ALv2
>  * Google Protobuf: New BSD License
>
> == Cryptography ==
>
> We do not expect Parquet to be a controlled export item due to the use of
> encryption.
>
> == Required Resources ==
>
> === Mailing lists ===
>
>  * priv...@parquet.incubator.apache.org
>  * comm...@parquet.incubator.apache.org
>  * d...@parquet.incubator.apache.org
>
> == Subversion Directory ==
>
> Git is the preferred source control system:
>
>  * git://git.apache.org/parquet-format
>  * git://git.apache.org/parquet-mr
>
> == Issue Tracking ==
>
> We'd like to keep using the Git review and issue tracking tools.
> Controlling Pull requests closing through git commit messages in
> git.apache.org
>
> == Initial Committers ==
>
>  * Aniket Mokashi 
>  * Brock Noland 
>  * Chris Aniszczyk 
>  * Dmitriy Ryaboy 
>  * Jake Farrell 
>  * Jonathan Coveney 
>  * Julien Le Dem 
>  * Lukas Nalezen

Re: [VOTE] Release of Apache Twill 0.1.0-incubating [rc1]

2014-02-04 Thread Tom White
+1

I noticed a couple of things to fix for the next release (not blockers):
* Missing license header in twill-0.1.0-incubating/checkstyle.xml
* There's a file called DEPENDENCIES which looks like it is
auto-generated, and is not in the tagged source, but is in the source
archive.

Cheers,
Tom

On Sat, Feb 1, 2014 at 12:42 AM, Terence Yim  wrote:
> Hi all,
>
> This is to call for a vote for release of Apache Twill
> v0.1.0-incubating. This will be the first incubator release for Apache
> Twill.
>
> Vote on twill-dev:
> http://s.apache.org/Rsy
>
> Result on vote on twill-dev:
> http://s.apache.org/KMR
>
> The tag to be voted upon is v0.1.0-incubating:
> https://git-wip-us.apache.org/repos/asf?p=incubator-twill.git;a=tag;h=refs/tags/v0.1.0-incubating
>
> The source tarball, including signatures, digests, etc can be found at:
> https://dist.apache.org/repos/dist/dev/incubator/twill/0.1.0-incubating-rc1/src
>
> The binary artifacts can be found at:
> https://dist.apache.org/repos/dist/dev/incubator/twill/0.1.0-incubating-rc1/bin
>
> Release artifacts are signed with the following key:
> https://people.apache.org/keys/committer/chtyim.asc
>
> KEYS file available here:
> https://dist.apache.org/repos/dist/dev/incubator/twill/KEYS
>
> For information about the contents of this release see:
> https://git-wip-us.apache.org/repos/asf?p=incubator-twill.git;a=log;h=refs/tags/v0.1.0-incubating
>
> Please vote on releasing this package as Apache Twill 0.1.0-incubating
>
> The vote will be open for 72 hours.
>
> [ ] +1 Release this package as Apache Twill 0.1.0-incubating
> [ ] -1 Do not release this package because ...
>
> Thanks,
> Apache Twill Team
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Twill for Incubation

2013-11-08 Thread Tom White
 for HBase. Poorna Chandra and Albert Shau have
> made many contributions to Twill.
>
> == Alignment ==
>
> The ASF is the natural choice to host the Twill project as its goal of
> encouraging community-driven open source projects fits with our vision for
> Twill.
>
> Additionally, many other projects with which we are familiar and expect
> Twill to integrate with, such as ZooKeeper, YARN, HDFS, log4j, and others
> mentioned in the External Dependencies section are Apache projects, and
> Twill will benefit by close proximity to them.
>
> = Known Risks =
>
> == Orphaned Products ==
>
> There is very little risk of Twill being orphaned, as it is a key part of
> Continuuity’s products. The core Twill developers plan to continue to work
> on Twill, and Continuuity has funding in place to support their efforts
> going forward.
> Many other Big Data companies can benefit from Twill, and we have already
> received interest from various entities that would like to use and
> contribute to Twill.
>
> == Inexperience with Open Source ==
>
> Several of the core developers have experience with open source
> development. Terence Yim, Andreas Neumann and Gary Helmling are currently
> Apache committers for Helix, Oozie and HBase respectively
> Homogeneous Developers
>
> The current core developers are all Continuuity employees. However, we
> intend to establish a developer community that includes independent and
> corporate contributors. We are encouraging new contributors via our mailing
> lists, public presentations, and personal contacts, and we will continue to
> do so. Various entities have already expressed interest in becoming
> involved with Twill.
>
> == Reliance on Salaried Developers ==
>
> Currently, these developers are paid to work on Twill. Once the project has
> built a community, we expect to attract committers, developers and
> community
> other than the current core developers. However, because Continuuity
> products use Twill internally, the reliance on salaried developers is
> unlikely to change, at least in the near term.
>
> == Relationships with Other Apache Products ==
>
> Twill is deeply integrated with Apache projects. Twill uses Apache YARN as
> its underlying resource management and task scheduling system and Apache
> Zookeeper for coordination. In addition, Twill uses Apache HDFS and Apache
> Kafka. A number of other Apache projects are Twill dependencies and are
> listed in the External Dependencies section.
>
> == An Excessive Fascination with the Apache Brand ==
>
> While we respect the reputation of the Apache brand and have no doubt that
> it will attract contributors and users, our interest is primarily to give
> Twill a solid home as an open source project following an established
> development model. We have also given additional reasons in the Rationale
> and Alignment sections.
>
> = Documentation =
>
> The current documentation for Weave is at
> https://github.com/continuuity/weave.
> This will be adapted to the new name Twill.
>
> = Initial Source =
>
> The Weave codebase is currently hosted at
> https://github.com/continuuity/weave. Weave will be renamed to Twill to
> seed the Apache git repository.
>
> = External Dependencies =
>
> The dependencies all have Apache-compatible licenses:
>  * avro (Apache 2.0)
>  * hadoop (Apache 2.0)
>  * gson (Apache 2.0)
>  * guava-libraries (Apache 2.0)
>  * hbase (Apache 2.0)
>  * hdfs (Apache 2.0)
>  * kafka (Apache 2.0)
>  * netty (Apache 2.0)
>  * snappy-java (Apache 2.0)
>  * yarn (Apache 2.0)
>  * zookeeper (Apache 2.0)
>  * asm (BSD)
>  * junit (EPL v1.0)
>  * logback (EPL v1.0 )
>  * slf4j (MIT)
>
> = Cryptography =
>
> Twill will depend on secure Hadoop, which can optionally use Kerberos.
>
> = Required Resources =
>
> == Mailing Lists ==
>
>  * twill-private for private PMC discussions (with moderated subscriptions)
>  * twill-dev for technical discussions among contributors
>  * twill-commits for notification about commits
>
> == Subversion Directory ==
>
> Git is the preferred source control system: git://git.apache.org/twill
>
> == Issue Tracking ==
>
> JIRA Twill (TWILL)
>
> == Other Resources ==
>
> The existing code already has unit tests, so we would like a Hudson
> instance to run them whenever a new patch is submitted. This can be added
> after project creation.
>
> = Initial Committers =
>
>  * Terence Yim
>  * Andreas Neumann
>  * Gary Helmling
>  * Poorna Chandra
>  * Albert Shau
>
> = Affiliations =
>
>  * Terence Yim (Continuuity)
>  * Andreas Neumann (Continuuity)
>  * Gary Helmling (Continuuity)
>  * Poorna Chandra (Continuuity)
>  * Albert Shau (Continuuity)
>
> = Sponsors =
>
> == Champion ==
>
> Vinod K  (Apache Member)
>
> == Nominated Mentors ==
>
>  * Arun C Murthy 
>  * Tom White 
>  * Patrick Hunt 
>  * Andrei Savu 
>
> == Sponsoring Entity ==
>
> We are requesting that the Incubator sponsor this project.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Storm into the Incubator

2013-09-13 Thread Tom White
+1

Tom

On Thu, Sep 12, 2013 at 8:19 PM, Doug Cutting  wrote:
> Discussion about the Storm proposal has subsided, issues raised now
> seemingly resolved.
>
> I'd like to call a vote to accept Storm as a new Incubator podling.
>
> The proposal is included below and is also at:
>
>   https://wiki.apache.org/incubator/StormProposal
>
> Let's keep the vote open for four working days, until 18 September.
>
> [ ] +1 Accept Storm into the Incubator
> [ ] +0 Don't care.
> [ ] -1 Don't accept Storm because...
>
> Doug
>
>
> = Storm Proposal =
>
> == Abstract ==
>
> Storm is a distributed, fault-tolerant, and high-performance realtime
> computation system that provides strong guarantees on the processing
> of data.
>
> == Proposal ==
>
> Storm is a distributed real-time computation system. Similar to how
> Hadoop provides a set of general primitives for doing batch
> processing, Storm provides a set of general primitives for doing
> real-time computation. Its use cases span stream processing,
> distributed RPC, continuous computation, and more. Storm has become a
> preferred technology for near-realtime big-data processing by many
> organizations worldwide (see a partial list at
> https://github.com/nathanmarz/storm/wiki/Powered-By). As an open
> source project, Storm’s developer community has grown rapidly to 46
> members.
>
> == Background ==
>
> The past decade has seen a revolution in data processing. MapReduce,
> Hadoop, and related technologies have made it possible to store and
> process data at scales previously unthinkable. Unfortunately, these
> data processing technologies are not realtime systems, nor are they
> meant to be. The lack of a "Hadoop of realtime" has become the biggest
> hole in the data processing ecosystem. Storm fills that hole.
>
> Storm was initially developed and deployed at BackType in 2011. After
> 7 months of development BackType was acquired by Twitter in July 2011.
> Storm was open sourced in September 2011.
>
> Storm has been under continuous development on its Github repository
> since being open-sourced. It has undergone four major releases (0.5,
> 0.6, 0.7, 0.8) and many minor ones.
>
>
> == Rationale ==
>
> Storm is a general platform for low-latency big-data processing. It is
> complementary to the existing Apache projects, such as Hadoop. Many
> applications are actually exploring using both Hadoop and Storm for
> big-data processing. Bringing Storm into Apache is very beneficial to
> both Apache community and Storm community.
>
> The rapid growth of Storm community is empowered by open source. We
> believe the Apache foundation is a great fit as the long-term home for
> Storm, as it provides an established process for community-driven
> development and decision making by consensus. This is exactly the
> model we want for future Storm development.
>
> == Initial Goals ==
>
>* Move the existing codebase to Apache
>* Integrate with the Apache development process
>* Ensure all dependencies are compliant with Apache License version 2.0
>* Incremental development and releases per Apache guidelines
>
> == Current Status ==
>
> Storm has undergone four major releases (0.5, 0.6, 0.7, 0.8) and many
> minor ones. Storm 0.9 is about to be released. Storm is being used in
> production by over 50 organizations. Storm codebase is currently
> hosted at github.com, which will seed the Apache git repository.
>
> === Meritocracy ===
>
> We plan to invest in supporting a meritocracy. We will discuss the
> requirements in an open forum. Several companies have already
> expressed interest in this project, and we intend to invite additional
> developers to participate. We will encourage and monitor community
> participation so that privileges can be extended to those that
> contribute.
>
> === Community ===
>
> The need for a low-latency big-data processing platform in the open
> source is tremendous. Storm is currently being used by at least 50
> organizations worldwide (see
> https://github.com/nathanmarz/storm/wiki/Powered-By), and is the most
> starred Java project on Github. By bringing Storm into Apache, we
> believe that the community will grow even bigger.
>
> === Core Developers ===
>
> Storm was started by Nathan Marz at BackType, and now has developers
> from Yahoo!, Microsoft, Alibaba, Infochimps, and many other companies.
>
> === Alignment ===
>
> In the big-data processing ecosystem, Storm is a very popular
> low-latency platform, while Hadoop is the primary platform for batch
> processing. We believe that it will help the further growth of
> big-data community by having Hadoop and Storm aligned within Apache
> foundation. The alignment is also beneficial to other Apache
> communities (such as Zookeeper, Thrift, Mesos). We could include
> additional sub-projects, Storm-on-YARN and Storm-on-Mesos, in the near
> future.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> The risk of the Storm project being abandoned is minimal. There are at
> least 50 organizatio

Re: [VOTE]: Accept Sentry in Apache Incubator

2013-08-06 Thread Tom White
ist]].
>
> === Core Developers ===
>
> The core developers for the Sentry project are Brock Noland, Shreepadma
> Venugopalan, Prasad Mujumdar and  Jarek Jarcec Cecho. Other contributors
> include Arvind Prabhakar and Xuefu Zhang. All engineers have deep expertise
> in Hadoop and various other ecosystem components.
>
> === Alignment ===
>
> Sentry complements the access control feature of some projects in the
> Apache Hadoop ecosystem, such as HDFS file permissions, by providing finer
> grained access control to data and metadata. It supersedes the access
> control capabilities of some other projects such as Apache Hive by
> providing stronger guarantees against malicious access.  Currently, Sentry
> integrates with Apache Hive, however we are planning to provide support for
> other components such as Apache Pig.
>
> While projects such as Apache Knox aim to provide perimeter security, the
> goal of Sentry is to implement a fine-grained role-based access control
> policy. Thus Sentry complements Apache Knox.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> Sentry is already deployed in production at a few well established
> companies and they are actively sharing feature requests. The risks of it
> being orphaned is negligible.
>
> === Inexperience with Open Source ===
>
> All committers of  the Sentry project are intimately familiar with the
> Apache model for open-source development and are experienced with working
> with various Apache open -source communities.
>
> === Homogeneous Developers ===
>
> The initial set of committers includes developers from several
> organizations - Cloudera, Oracle, Lab41, Nvidia and Wibidata.  We expect
> that once approved for incubation, the project will further attract new
> contributors.
>
> === Reliance on Salaried Developers ===
>
> It is expected that Sentry will be developed on both salaried and volunteer
> time, although all of the initial developers will work on it mainly on
> salaried time.
>
> === Relationships with Other Apache Products ===
>
> Sentry depends on other Apache Projects: Apache Hadoop, Apache Log4J,
> Apache Hive, Apache Shiro, multiple Apache Commons components. Build is
> orchestrated by Apache Maven. Sentry complements Apache Knox.
>
> === An Excessive Fascination with the Apache Brand ===
>
> We would like Sentry to become an Apache project to further foster a
> healthy community of users and developers around it. Since Sentry solves an
> important problem faced by Apache Hadoop users and interacts with other
> components of the Apache Hadoop ecosystem, we believe that Apache is the
> right home for Sentry.
>
> == Documentation ==
>
>   * Cloudera provides documentation specific to its distribution of Sentry
> at:
> http://www.cloudera.com/content/cloudera-content/cloudera-docs/Sentry/Sentry.pdf
>   * Sentry jira at Cloudera: https://issues.cloudera.org/browse/access
>
> == Initial Source ==
>
> https://github.com/cloudera/access
>
> == Source and Intellectual Property Submission Plan ==
>
> All of Sentry’s code is under Apache 2 license already.
>
> == External Dependencies ==
>
> All dependencies have licenses compatible with ASL. Dependencies that are
> not directly using ASL are,
>
>   * Junit - Eclipse Public License
>
> == Cryptography ==
>
> Sentry currently doesn’t directly use any cryptographic libraries. However,
> Sentry uses Apache Shiro, which provides support for cryptography features
> such as hash, cipher etc.
>
> == Required Resources ==
>
> === Mailing Lists ===
>
>   * priv...@sentry.incubator.apache.org for private PMC discussions (with
> moderated subscriptions)
>   * secur...@sentry.incubator.apache.org for private security related
> discussions
>   * d...@sentry.incubator.apache.org
>   * comm...@sentry.incubator.apache.org
>
> === Source code repository ===
>
> Git repository running at http://git-wip-us.apache.org/.
>
> === Issue Tracking ===
>
> JIRA Sentry (SENTRY)
>
> === Other Resources ===
>
> The existing code already has unit and integration tests so we would like a
> Jenkins CI instance that would run the tests on reference environment. We
> would also like to use Jenkins to run tests for every newly submitted patch
> (so called pre-commit hook), however this can be added after project
> creation.
>
> == Initial Committers ==
>
>   * Ali Rizvi (ali.rizvi at oracle.com)
>   * Arvind Prabhakar (arvind at apache.org)
>   * Brock Noland  (brock at apache.org)
>   * Chaoyu Tang (ctang at cloudera.com)
>   * Daisy Zhou (daisy at wibidata.com)
>   * David Nalley (ke4qqq at apache.org)
>   * Erick Tryzelaar(etryzelaar at iqt.org)
>   *

Re: [VOTE] Release Apache Provisionr version 0.4.0-incubating, RC0

2013-06-27 Thread Tom White
+1 verified signature and checksum, ran RAT, checked LICENSE, NOTICE
and DISCLAIMER, built from source.

Tom

On Thu, Jun 27, 2013 at 1:33 AM, Andrei Savu  wrote:
> This is the first release candidate for Apache Provisionr, version
> 0.4.0-incubating.
>
> The PPMC vote thread:
>
> http://mail-archives.apache.org/mod_mbox/incubator-provisionr-dev/201306.mbox/%3CCAHZL8y-PzC6AVavK-cs5d8Sv%2BV2bfq0REHPjAaWn1Fs-y3PSqQ%40mail.gmail.com%3E
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12314423&version=12324250
>
> *** Please download, test and vote by July 2, 2013.
>
> Note that we are voting upon the source (tag), binaries are provided for
> convenience.
>
> Source and binary files:
> http://people.apache.org/~asavu/provisionr-0.4.0-incubating-candidate-0/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapacheprovisionr-035/
>
> The tag to be voted upon:
> https://git-wip-us.apache.org/repos/asf?p=incubator-provisionr.git;a=tag;h=62abf302b47460abff904e2e721606255561757d
>
> Provisionr's KEYS file containing PGP keys we use to sign the release:
> http://www.apache.org/dist/incubator/provisionr/KEYS
>
> Check the following email to understand how this releases candidate was
> created and tested so far (we are still lacking documentation):
>
> http://mail-archives.apache.org/mod_mbox/incubator-provisionr-dev/201306.mbox/%3CCAHZL8y8dHL%3DU2kFVGySX2aNuF%3DRhNg9n4%2Bcpd1yktvacCZwbPA%40mail.gmail.com%3E
>
> [ ] +1
> [ ] 0
> [ ] -1 (explain why)
>
> Thanks!

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Apache Mesos 0.12.0-incubating (RC1)

2013-06-17 Thread Tom White
+1 (verified checksum and signature, checked LICENSE, NOTICE, DISCLAIMER).

Tom

On Tue, Jun 11, 2013 at 1:05 AM, Benjamin Mahler
 wrote:
> Please vote on releasing the following candidate as Apache Mesos
> (incubating) version 0.12.0. This will be the fourth incubator release for
> Mesos in Apache.
>
> The candidate for Mesos 0.12.0-incubating release is available at:
> http://people.apache.org/~bmahler/mesos-0.12.0-incubating-RC1/mesos-0.12.0-incubating.tar.gz
>
> The tag to be voted on is 0.12.0-rc1:
> https://git-wip-us.apache.org/repos/asf?p=incubator-mesos.git;a=tag;h=57d7b9719dce662881b162eba10b5765a807d53c
>
> The MD5 checksum of the tarball can be found at:
> http://people.apache.org/~bmahler/mesos-0.12.0-incubating-RC1/mesos-0.12.0-incubating.tar.gz.md5
>
> The signature of the tarball can be found at:
> http://people.apache.org/~bmahler/mesos-0.12.0-incubating-RC1/mesos-0.12.0-incubating.tar.gz.asc
>
> PGP key used to sign the release:
> http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xD0BEBB95D141A5B6
>
> Please vote on releasing this package as Apache Mesos 0.12.0-incubating!
>
> The vote is open until Thursday, June 13th at 00:00 UTC and passes if
> a majority
> of at least 3 +1 IPMC votes are cast.
>
> [ ] +1 Release this package as Apache Mesos 0.12.0-incubating
> [ ] -1 Do not release this package because ...
>
> To learn more about Apache Mesos, please see
> http://incubator.apache.org/mesos.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduation of Apache Mesos

2013-06-12 Thread Tom White
+1

Tom

On Wed, Jun 12, 2013 at 9:03 PM, Mattmann, Chris A (398J)
 wrote:
> Hi All,
>
> The Apache Mesos community is ready to graduate. They have added
> committers and PPMC members while in the Incubator; have made a
> few releases; are discussing their issues on list and in the Apache
> way, and are inclusive and representative of Apache's goals as a
> Foundation.
>
> I'm extremely happy to put them up for Incubator graduation.
> We've VOTEd as a community to move forward with this:
>
> DISCUSS thread here: http://s.apache.org/XAu
> VOTE thread here: http://s.apache.org/K8C
> VOTE RESULT: Message-ID: 
>
> Project Incubator status page here:
> http://incubator.apache.org/projects/mesos.html
>
> Board resolution pasted at bottom of email.
>
> Existing tallies from the community VOTE:
>
> +1
> Chris Mattmann*
> Vinod Kone
> Benjamin Hindman
> Benjamin Mahler
> Yan Xiu
> Deepal Jayasinghe
> Brenden Matthews
> Matei Zaharia
> Ant Elder*
> Konstantin Boudnik
>
> * - indicates IPMC
>
> Please VOTE to graduate Apache Mesos from the Incubator. Though
> only Incubator PMC member VOTEs are binding, all are welcome to
> voice your opinion. I'll leave the VOTE open for at least 72 hours,
> and hopefully can get enough VOTEs in time to close it by Saturday
> or Sunday in time for the board meeting on 6/19.
>
> [ ] +1 Graduate Apache Mesos from the Incubator.
> [ ] +0 Don't care.
> [ ] -1 Don't graduate Apache Mesos from the Incubator because..
>
> Thanks everyone!
>
> Cheers,
> Chris
>
>
> ---board resolution
> WHEREAS, the Board of Directors deems it to be in the best
> interests of the Foundation and consistent with the
> Foundation's purpose to establish a Project Management
> Committee charged with the creation and maintenance of
> open-source software, for distribution at no charge to the
> public, related to efficient cluster management, resource
> isolation and sharing across distributed applications.
>
> NOW, THEREFORE, BE IT RESOLVED, that a Project Management
> Committee (PMC), to be known as the "Apache Mesos Project", be
> and hereby is established pursuant to Bylaws of the Foundation;
> and be it further
>
> RESOLVED, that the Apache Mesos Project be and hereby is
> responsible for the creation and maintenance of software
> related to efficient cluster management, resource isolation
> and sharing across distributed applications; and be it further
> RESOLVED, that the office of "Vice President, Apache Mesos" be
> and hereby is created, the person holding such office to serve
> at the direction of the Board of Directors as the chair of the
> Apache Mesos Project, and to have primary responsibility for
> management of the projects within the scope of responsibility
> of the Apache Mesos Project; and be it further
> RESOLVED, that the persons listed immediately below be and
> hereby are appointed to serve as the initial members of the
> Apache Mesos Project:
>
>  * Ali Ghodsi 
> * Andy Konwinski 
> * Benjamin Hindhman 
> * Benjamin Mahler 
> * Brian McCalister 
> * Ian Holsman 
> * Matei Alexandru Zahari 
> * Chris Mattmann 
> * Tom White 
> * Vinod Kone 
> * Brenden Matthews 
> * Thomas Marshall 
> * Charles Reiss 
>
>
> NOW, THEREFORE, BE IT FURTHER RESOLVED, that Benjamin Hindman be
> appointed to the office of Vice President, Apache Mesos, to
> serve in accordance with and subject to the direction of the
> Board of Directors and the Bylaws of the Foundation until
> death, resignation, retirement, removal or disqualification, or
> until a successor is appointed; and be it further
>
> RESOLVED, that the Apache Mesos Project be and hereby is
> tasked with the migration and rationalization of the Apache
> Incubator Mesos podling; and be it further
>
> RESOLVED, that all responsibilities pertaining to the Apache
> Incubator Mesos podling encumbered upon the Apache Incubator
> Project are hereafter discharged.
>
>
> ++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept jclouds into the Apache Incubator

2013-04-23 Thread Tom White
ute code and documentation on their own time and
> have done so for a lengthy period. Given the current stream of development
> requests and the committers' sense of ownership of the jclouds code, this
> arrangement is expected to continue with jclouds' induction into the ASF.
>
> === Relationships with Other Apache Products ===
>
> jclouds and Apache Libcloud address similiar use cases. However, jclouds
> supplies these services for the Java and Clojure communities whereas
> Libcloud provides them for the Python ecosystem.
>
> While jclouds does not directly rely upon any Apache project, it does
> support several Apache projects and has options to collaborate with several
> others. More specifically, jclouds currently supports Apache Whirr, Apache
> ACE, Apache Karaf, and Apache Camel, and options exist to use Apache Maven
> as a build tool with the jclouds API.
>
> jclouds includes support for the Apache CloudStack API and is used as a
> compatibility test tool for its EC2 interface. jclouds can also be used to
> test Apache Deltacloud EC2 portability.
>
> === An Excessive Fascination with the Apache Brand ===
>
> jclouds recognizes the fortitude of the Apache brand, but the motivation
> for becoming an Apache project is to strengthen and expand the jclouds
> community and its user base. While the jclouds community has seen steady
> growth over the past several years, association with the ASF is expected to
> expedite this pattern of growth. Development is expected to continue on
> jclouds under the Apache license whether or not it is supported by the ASF.
>
> == Documentation ==
>
> The [[http://www.jclouds.org/|jclouds]] project documentation is publicly
> available at the following sites:
>
>   * http://jclouds.org: installation guide, user guides, development
> resources, news, resources to get started
>   * https://github.com/jclouds/jclouds: current source, source code
> issues log
>   * https://github.com/jclouds/jclouds.github.com: static content for
> jclouds.org, documentation issues log
>   * https://twitter.com/jclouds: jclouds on Twitter
>   * https://groups.google.com/forum/?fromgroups#!forum/jclouds-dev: the
> jclouds development forum on Google Groups
>   * https://groups.google.com/forum/?fromgroups#!forum/jclouds: the
> jclouds community forum on Google Groups
>
> == Initial Source ==
>
> The initial source is located on GitHub in the following repositories:
>
>  * git://github.com/jclouds/jclouds.git
>  * git://github.com/jclouds/jclouds-labs.git
>  * git://github.com/jclouds/jclouds.github.com.git
>  * git://github.com/jclouds/jclouds-chef.git
>  * git://github.com/jclouds/jclouds-cli.git
>  * git://github.com/jclouds/jclouds-karaf.git
>  * git://github.com/jclouds/jclouds-examples.git
>
> == Source and Intellectual Property Submission Plan ==
>
> jclouds's initial source is licensed under the Apache License, Version
> 2.0. https://github.com/jclouds/jclouds/blob/master/resources/LICENSE.txt
>
> == External Dependencies ==
>
> This is a listing of Maven coordinates for all of the external
> dependencies jclouds uses. All of the dependencies are in Sonatype and
> their licenses should be accessible.
>
>  * aopalliance:aopalliance:jar:1.0:compile
>  * com.google.code.gson:gson:jar:2.2.2:compile
>  * com.google.guava:guava:jar:14.0.1:compile
>  * com.google.inject.extensions:guice-assistedinject:jar:3.0:compile
>  * com.google.inject:guice:jar:3.0:compile
>  * javax.annotation:jsr250-api:jar:1.0:compile
>  * javax.inject:javax.inject:jar:1:compile
>  * javax.ws.rs:jsr311-api:jar:1.1.1:compile
>  * org.99soft.guice:rocoto:jar:6.2:compile
>
> == Cryptography ==
>
> jclouds contains no cryptographic algorithms, but it does provide the
> ability for people to plug in various cryptographic libraries.
>
> == Required Resources ==
>
> === Mailing lists ===
>
>  * jclouds-dev: for development discussions
>  * jclouds-user: for community discussions
>  * jclouds-private: for PPMC discussions
>  * jclouds-commits: for code changes
>
> === Apache git repository ===
>
> The jclouds team is experienced in git and requests the following
> allocation on the Apache git server:
>
> git://git.apache.org/incubator-jclouds.git
>
> === Issue Tracking ===
>
> jclouds currently uses GitHub for issue tracking. The intent is to request
> an allocation for Jira upon acceptance into the Incubator. Proposed project
> name: jclouds
>
> == Initial Committers ==
>
>  * Ignasi Barrera, ignasi dot barrera at gmail dot com
>  * Andrew Bayer, abayer at apache dot org
>  * Ioannis Canellos, iocanel at gmail dot com
>  * Adrian Cole, adrianc at netflix dot com
>  * Andrew G

Re: [VOTE] Accept Provisionr into the Apache Incubator

2013-03-04 Thread Tom White
tters are
> paid by their employer to contribute to this project. However, they are all
> passionate about the project, and we are confident that the project will
> continue even if no salaried developers contribute to the project. We are
> committed to recruiting additional committers including non-salaried
> developers.
>
> === Relationships with Other Apache Products ===
>
> Provisionr is closely integrated with CloudStack, Karaf, CXF, BigTop in a
> numerous ways. We look forward to collaborating with those communities, as
> well as other Apache communities (like Apache Helix).
>
> === A Excessive Fascination with the Apache Brand ===
>
> We think that Provisionr will benefit from the community sharing ideas and
> best practices for running cloud services. The ASF does a great job at
> building communities, which is why we want to build Provisionr at Apache.
>
> == Documentation ==
>
>  * https://github.com/axemblr/axemblr-provisionr/wiki
>  *
> https://github.com/axemblr/axemblr-provisionr/wiki/Axemblr-Provisionr-in-5-Minutes
>  *
> https://github.com/axemblr/axemblr-provisionr/wiki/Starting-an-HDFS-cluster-on-EC2-with-Provisionr-and-Rundeck
>  *
> http://www.slideshare.net/savu.andrei/creating-pools-of-virtual-machines-apachecon-na-2013
>
> == Initial Source ==
>
> Available on GitHub: https://github.com/axemblr/axemblr-provisionr
>
> == Source and Intellectual Property Submission Plan ==
>
> We are going to prepare a software grant as needed. The source code is
> already available under the Apache 2.0 license.
>
> == External Dependencies ==
>
> The existing external dependencies all have Apache compatible licenses:
> Activiti (Apache 2.0), AWS SDK (Apache 2.0), jclouds (Apache 2.0), Google
> Guava (Apache 2.0).
>
> == Cryptography ==
>
> Provisionr uses standard APIs and tools for SSH and SSL.
>
> == Required Resources ==
>
> === Mailing lists ===
>
>  * provisionr-private
>  * provisionr-dev
>
> === Version Control ===
>
> Git repository
>
> === Issue Tracking ===
>
> JIRA Provisionr (PROVISIONR)
>
> === Other Resources ===
>
> The existing code already has unit and integration tests so we would like a
> Jenkins instance to run them whenever a new patch is submitted. This can be
> added after project creation.
>
> == Initial Committers ==
>
>  * Andrei Savu - asavu at apache dot org
>  * Ioan Eugen Stan - ieugen at apache dot org
>  * Alex Ciminian - alex.ciminian at gmail dot org
>  * Amandeep Khurana - amansk at gmail dot com
>  * Roman Shaposhnik - rvs at apache dot org
>  * Eric Sammer - esammer at apache dot org
>
> == Sponsors ==
>
> === Champion ===
>
> Tom White - tomwhite at apache dot org
>
> === Nominated Mentors ===
>
>  * Roman Shaposhnik - rvs at apache dot org
>  * Tom White - tomwhite at apache dot org
>  * Mohammad Nour - mnour at apache dot org
>
> === Sponsoring ===
>
> Incubator PMC

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Provisionr join the Apache Incubator

2013-02-27 Thread Tom White
On Tue, Feb 26, 2013 at 6:34 AM, Andrei Savu  wrote:

> Tom do you want to be a champion for this proposal?

Yes, I'd be happy to.

One thing I think is worth discussing more here is whether Provisionr
should be a TLP on graduation, or whether it should be a module in
Whirr. Provisionr shares no code with Whirr, but it would be natural
for Whirr to use Provisionr for reliable cloud provisioning. Indeed,
as the proposal states, "This project started as a re-write of the
core of Apache Whirr", so Provisionr is a sort of "Next Generation"
Whirr. More importantly, in terms of community, Andrei is on the Whirr
PMC and I think that the communities would have considerable overlap.
It's worth deciding now, since there is an ongoing discussion about
whether existing projects are suitable homes for newly graduated
projects [1].

BTW Andrei, please put the proposal on the Incubator wiki so others
can edit it, and so it's easy to find in the future.

Thanks,
Tom

[1] 
http://mail-archives.apache.org/mod_mbox/incubator-general/201302.mbox/%3ccabd8fluoxt71foldya60faebd4ys1-ovxapkkrufjn9hndd...@mail.gmail.com%3E

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Apache Crunch 0.5.0 (incubating) RC0

2013-02-18 Thread Tom White
+1 checked signatures and checksums, RAT report is clean, license and
notice look good, builds from source.

Tom

On Sat, Feb 16, 2013 at 1:08 AM, Josh Wills  wrote:
> Hello,
>
> This is a call for a vote on releasing the following candidate as Apache
> Crunch 0.5.0 (incubating). This is our third release at Apache, and it
> fixes the following issues:
>
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313526&version=12323476
>
> The vote will be open for at least 72 hours. We received 1 IPMC member vote
> from Patrick Hunt on the vote thread on crunch-dev, and will need two more
> IPMC votes in order to make the release.
>
> Release artifacts:
> http://people.apache.org/~jwills/crunch-0.5.0-incubating-RC0/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachecrunch-228/
>
> The tag to be voted upon:
> https://git-wip-us.apache.org/repos/asf?p=incubator-crunch
> .git;a=tag;h=e60ace8424109dc941b13262d43dab659ffaca8a
>
> Crunch's KEYS file:
> http://www.apache.org/dist/incubator/crunch/KEYS
>
> Thanks,
> Josh

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Apache Knox Hadoop Gateway Project into the Incubator

2013-02-17 Thread Tom White
t#1.14
>
> Jerico - EPLv1.0
>
> net.htmlparser.jericho:jericho-html#3.2
>
> Servlet - CDDLv1.0 or GPLv2
>
> javax.servlet:javax.servlet-api#3.0.1
>
> JUnit - CPLv1.0
>
> junit:junit#4.11
>
> Cryptography
>
> The Gateway uses cryptographic software indirectly as a result of
> having two dependencies: ApacheDS and Apache Shiro. Gateway does not
> include any special or custom cryptographic technologies.
>
> ApacheDS is an ASF project and has been classified Export Commodity
> Control Number (ECCN) 5D002.C.1 due to it’s dependency on Bouncy
> Castle. More information on the ApacheDS classification can be found
> at http://svn.apache.org/repos/asf/directory/apacheds/trunk/installers/README
>
> Apache Shiro is an ASF project and has been classified Export
> Commodity Control Number (ECCN) 5D002.C.1. More information on the
> Apache Shiro classification can be found at
> http://svn.apache.org/repos/asf/shiro/trunk/README
>
> Required Resources
>
> Mailing lists
>
> knox-dev AT incubator DOT apache DOT org knox-commits AT incubator DOT
> apache DOT org knox-user AT hms incubator apache DOT org knox-private
> AT incubator DOT apache DOT org
>
> Subversion Directory
>
> https://svn.apache.org/repos/asf/incubator/knox
>
> Issue Tracking
>
> JIRA Knox (KNOX)
>
> Initial Committers
>
> Kevin Minder (kevin DOT minder AT hortonworks DOT com)
>
> Larry McCay (lmccay AT hortonworks DOT com)
>
> John Speidel (jspeidel AT hortonworks DOT com)
> Tom Beerbower (tbeerbower AT hortonworks DOT com)
> Sumit Mohanty (smohanty AT hortonworks DOT com)
> Venkatesh Seetharam (venkatesh AT hortonworks DOT com)
>
> Affiliations
>
> Kevin Minder (Hortonworks)
>
> Larry McCay (Hortonworks)
>
> John Speidel (Hortonworks)
> Tom Beerbower (Hortonworks)
> Sumit Mohanty (Hortonworks)
> Venkatesh Seetharm (Hortonworks)
> Owen O'Malley (Hortonworks)
> Mahadev Konar (Hortonworks)
> Alan Gates (Hortonworks)
> Devaraj Das (Hortwonrks)
> Chris Douglas (Microsoft)
> Chris Mattmann (NASA)
> Tom White (Cloudera)
>
> Sponsors
>
> Champion
>
> Devaraj Das (ddas AT apache DOT org)
>
> Nominated Mentors
>
> Owen O’Malley (omalley AT apache DOT org)
> Mahadev Konar (mahadev AT apache DOT org)
> Alan Gates (gates AT apache DOT org)
> Devaraj Das (ddas AT apache DOT org)
> Chris Douglas (cdouglas AT apache DOT org)
> Chris Mattmann (chris DOT a DOT mattmann AT jpl DOT nasa DOT gov)
> Tom White (tom DOT e DOT white AT gmail DOT com)
>
> Sponsoring Entity
>
> Incubator PMC
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Knox Hadoop Gateway Project

2013-02-12 Thread Tom White
This looks like an interesting and useful project. I'd like to volunteer as
a mentor on it.

Thanks,
Tom
On 11 Feb 2013 14:56, "Kevin Minder"  wrote:

> Knox Gateway Proposal
>
> == Abstract ==
>
> Knox Gateway is a system that provides a single point of secure access for
> Apache Hadoop clusters.
>
> == Proposal ==
>
> The Knox Gateway (“Gateway” or “Knox”) is a system that provides a single
> point of authentication and access for Apache Hadoop services in a cluster.
> The goal is to simplify Hadoop security for both users (i.e. who access the
> cluster data and execute jobs) and operators (i.e. who control access and
> manage the cluster). The Gateway runs as a server (or cluster of servers)
> that serve one or more Hadoop clusters.
>
> Provide perimeter security to make Hadoop security setup easier
> Support authentication and token verification security scenarios
> Deliver users a single cluster end-point that aggregates capabilities for
> data and jobs
> Enable integration with enterprise and cloud identity management
> environments
>
> == Background ==
>
> An Apache Hadoop cluster is presented to consumers as a loose collection
> of independent services. This makes it difficult for users to interact with
> Hadoop since each service maintains it’s own method of access and security.
> As well, for operators, configuration and administration of a secure Hadoop
> cluster is a complex and many Hadoop clusters are insecure as a result.
>
> == Rationale ==
>
> Organizations that are struggling with Hadoop cluster security result in
> a) running Hadoop without security or b) slowing adoption of Hadoop. The
> Gateway aims to provide perimeter security that integrates more easily into
> existing organizations’ security infrastructure. Doing so will simplify
> security for these organizations and benefit all Hadoop stakeholders (i.e.
> users and operators). Additionally, making a dedicated perimeter security
> project part of the Apache Hadoop ecosystem will prevent fragmentation in
> this area and further increase the value of Hadoop as a data platform.
>
> == Current Status ==
>
> Prototype available, developed by the list of initial committers.
>
> === Meritocracy ===
>
> We desire to build a diverse developer community around Gateway following
> the Apache Way. We want to make the project open source and will encourage
> contributors from multiple organizations following the Apache meritocracy
> model.
>
> === Community ===
>
> We hope to extend the user and developer base in the future and build a
> solid open source community around Gateway. Apache Hadoop has a large
> ecosystem of open source projects, each with a strong community of
> contributors. All project communities in this ecosystem have an opportunity
> to participate in the advancement of the Gateway project because
> ultimately, Gateway will enable the security capabilities of their project
> to be more enterprise friendly.
>
> === Core Developers ===
>
> Gateway is currently being developed by several engineers from Hortonworks
> - Kevin Minder, Larry McCay, John Speidel, Tom Beerbower and Sumit Mohanty.
> All the engineers have deep expertise in middleware, security & identity
> systems and are quite familiar with the Hadoop ecosystem.
>
> === Alignment ===
>
> The ASF is a natural host for Gateway given that it is already the home of
> Hadoop, Hive, Pig, HBase, Oozie and other emerging big data software
> projects. Gateway is designed to solve the security challenges familiar to
> the Hadoop ecosystem family of projects.
>
> == Known Risks ==
>
> === Orphaned products & Reliance on Salaried Developers ===
>
> The core developers plan to work full time on the project. We believe that
> this project will be of general interest to many Hadoop users and will
> attract a diverse set of contributors. We intend to demonstrate this by
> having contributors from several organizations recognized as committers by
> the time Knox graduates from incubation.
>
> === Inexperience with Open Source ===
>
> All of the core developers are active users and followers of open source.
> As well, Hortonworks has a strong heritage of success with contributions to
> Apache Hadoop Projects.
>
> === Homogeneous Developers ===
>
> The current core developers are from Hortonworks, however, we hope to
> establish a developer community that includes contributors from several
> corporations.
>
> === Reliance on Salaried Developers ===
>
> Currently, the developers are paid to do work on Gateway. However, once
> the project has a community built around it, we expect to get committers
> and developers from outside the current core developers.
>
> === Relationships with Other Apache Products ===
>
> Gateway is going to be used by the users and operators of Hadoop, and the
> Hadoop ecosystem in general.
>
> === A Excessive Fascination with the Apache Brand ===
>
> Our interest in developing Gateway in Apache project is to follow an
> established development model, as well sin

Re: [VOTE][PROPOSAL] Hadoop Development Tools

2012-11-07 Thread Tom White
+1

Cheers,
Tom

On Tue, Nov 6, 2012 at 7:57 AM, Adam Berry  wrote:
> Hello,
>
> This proposal has been open for discussion for a a few weeks, so now 
> submitting for a vote for this project to be accepted into the incubator.
>
> Cheers,
> Adam Berry
>
> = HDT (Hadoop Development Tools) =
>
> == Abstract ==
> Tools to support developing applications that use Apache Hadoop from within 
> Eclipse.
>
> == Proposal ==
> Hadoop Development Tools are a set of extensions to Eclipse providing support 
> for creating, launching and debugging distributed applications, as well as 
> interacting with HDFS filesystems. This work will build on the existing Map 
> Reduce Tools present in the Apache Hadoop project.
>
> == Background ==
> Map Reduce Tools have existed as part of contrib for Apache Hadoop. 
> Unfortunately they are source tied to a single version of Hadoop, and 
> development has stalled, with little movement past the Hadoop 0.20 line.
>
> == Rationale ==
> Support for newer versions of Hadoop from within Eclipse is regularly raised 
> on the Hadoop mailing lists, so there is a clear need to drive these tools 
> forward. Development tools generally are worked on separate from the target 
> tools/platform, separating the tools out will allow for supporting multiple 
> versions, so a developer could work with a heterogeneous environment.
>
> == Initial Goals ==
>  * Give the tools project a home of its own.
>  * Port current MapReduce tools feature set to all current release lines of 
> Hadoop in a single Eclipse install.
>  * Documentation and tutorials for all features.
>  * Publish Eclipse update site, and join Eclipse marketplace listing.
>  * Establish release cycle that combines support for Hadoop and Eclipse 
> release cycles.
>  * Look to build support for YARN, MRUnit and possibly other Hadoop-related 
> projects.
>
> == Current Status ==
> The source for the current MapReduceTools lives in the contrib section of the 
> Hadoop source. In its current implementation it is tied to the version of 
> Hadoop against which it is compiled. The layout and API that it was developed 
> with means that it can only be used with the 0.20 or 1.0 Hadoop releases, the 
> new layout and YARN api introduced with the 0.23 and 2.0 lines are not 
> supported.
>
>
> === Meritocracy ===
> Several people and companies have already expressed an interest in 
> contributing to this project, and we hope to attract additional interest 
> during the proposal discussion. We plan to invest and support a meritocracy 
> that attracts, invites, and supports newcomers to build a vibrant and  
> diverse community.
>
> === Community ===
> The target community is developers who are working developing Map/Reduce 
> applications against Hadoop. Given the success of Hadoop the target group is 
> likely to be quite large. Separation from the Hadoop community would make it 
> easier to support multiple versions of hadoop, as well as merging the release 
> cycles of Hadoop and Eclipse to provide predictable iteration and improvement 
> in the toolset.
>
> === Core Developers ===
> The initial list of developers includes people experienced with Hadoop and 
> developing against the Eclipse platform.
>  * Adam Berry (amberry at yahoo-inc dot com)
>  * Jeffrey Zemerick (jeffrrey at mtnfog dot com)
>  * Evert Lammerts (Evert dot Lammerts at sara dot nl)
>  * Simone Gianni (simoneg at apache dot org)
>
> === Alignment ===
> Hadoop Development Tools aligns with both Hadoop and Eclipse. Hadoop as the 
> platform for the development target, and Eclipse as the IDE platform used as 
> the base for the tools.
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> === Inexperience with Open Source ===
> The committers have experience with Apache and Eclipse open source 
> development.
>
> === Reliance on Salaried Developers ===
> Hadoop Development Tools will be developed with a mix of salaried and 
> volunteer time.
>
> === Relationships with Other Apache Projects ===
> Hadoop Development Tools is closely related to Apache Hadoop.
>
> === An Excessive Fascination with the Apache Brand ===
> Given the success of Hadoop and associated projects, Apache is the natural 
> place for the Hadoop Development Tools. Chris Mattman suggested the Apache 
> Incubator as appropriate on the Hadoop general mailing list following the 
> success that MRUnit had taking the path from Hadoop contrib to an Apache top 
> level project.
>
> == Documentation ==
> Documentation for the current tools can be found at 
> http://wiki.apache.org/hadoop/EclipsePlugIn
>
> == Initial Source ==
> http://svn.apache.org/repos/asf/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/eclipse-plugin/
>
> ==  Source and Intellectual Property Submission Plan ==
> The source, and any suggested initial patches, are already hosted either in 
> Apache’s Subversion or JIRA.
>
> ==  External Dependencies ==
> Eclipse Platform
> Eclipse Java Development Tools
>
> ==  Cryptography ==
> Hadoop Developm

Re: [VOTE] Graduate Bigtop podling from Apache Incubator

2012-09-13 Thread Tom White
+1

Tom

On Wed, Sep 12, 2012 at 5:08 PM, Roman Shaposhnik  wrote:
> This is a call for vote to graduate Bigtop podling from Apache Incubator
>
> The Apache Bigtop project entered incubator in June of 2011. Since then we
> have grown the community in users and contributors, and we've made
> significant improvements to the project. Following the Apache guidelines we
> have made four releases, we are preparing a 5th major and 6th maintenance
> releases, and we've added two new committers. The current set of committers
> and PPMC members are from different organizations and have demonstrated
> interest in growing the community further. We have learned the basis to manage
> the different aspects of an Apache project. The community of Bigtop is active,
> healthy and growing and has demonstrated the ability to self-govern using
> accepted Apache practices. Bigtop community has voted to proceed with
> graduation [1] and the result can be found at [2].
>
> Please cast your votes:
>
> [  ] +1 Graduate Bigtop podling from Apache Incubator
> [  ] +0 Indifferent to the graduation status of Bigtop podling
> [  ] -1 Reject graduation of Bigtop podling from Apache Incubator
>
> This vote will remain open for at least 72 hours from now (till 15 Sep
> 2012, NOON PST). Please find the proposed board resolution below.
>
> [1] http://s.apache.org/SGm
> [2] http://s.apache.org/LOA
>
> Thanks,
> Roman Shaposhnik
>
> X. Establish the Apache Bigtop Project
>
>WHEREAS, the Board of Directors deems it to be in the best
>interests of the Foundation and consistent with the
>Foundation's purpose to establish a Project Management
>Committee charged with the creation and maintenance of
>open-source software related to a system for integration,
>packaging, deployment and validation of a big data management
>software distribution based on Apache Hadoop
>for distribution at no charge to the public.
>
>NOW, THEREFORE, BE IT RESOLVED, that a Project Management
>Committee (PMC), to be known as the "Apache Bigtop Project",
>be and hereby is established pursuant to Bylaws of the
>Foundation; and be it further
>
>RESOLVED, that the Apache Bigtop Project be and hereby is
>responsible for the creation and maintenance of software
>related to a system for
>open-source software related to a system for integration,
>packaging, deployment and validation of a big data management
>software distribution based on Apache Hadoop; and be it further
>
>RESOLVED, that the office of "Vice President, Apache Bigtop" be
>and hereby is created, the person holding such office to
>serve at the direction of the Board of Directors as the chair
>of the Apache Bigtop Project, and to have primary responsibility
>for management of the projects within the scope of
>responsibility of the Apache Bigtop Project; and be it further
>
>RESOLVED, that the persons listed immediately below be and
>hereby are appointed to serve as the initial members of the
>    Apache Bigtop Project:
>
> * Alan Gatesga...@apache.org
> * Patrick Hunt  ph...@apache.org
> * Steve Loughranste...@apache.org
> * Tom White tomwh...@apache.org
>
> * Alejandro Abdelnurt...@apache.org
> * Andrew Bayer  aba...@apache.org
> * Konstantin Boudnikc...@apache.org
> * Stephen Chu   s...@apache.org
> * Bruno Mahébm...@apache.org
> * Peter Linnell plinn...@apache.org
> * James Pagejamesp...@apache.org
> * Patrick Taylor Ramsey p...@apache.org
> * Roman Shaposhnik  r...@apache.org
> * Michael Stack st...@apache.org
> * Andrei Savu   as...@apache.org
> * Edward J. Yoonedwardy...@apache.org
>
> * Andre Arcilla arci...@apache.org
> * Eli Collins   e...@apache.org
> * Travis Crawford   traviscrawf...@apache.org
> * John Sichij...@apache.org
> * Owen O'Malley omal...@apache.org
>
>NOW, THEREFORE, BE IT FURTHER RESOLVED, that Roman Shaposhnik
>be appointed to the office of Vice President, Apache Bigtop,
>to serve in accordance with and subject to the direction of the
>Board of Directors and the Bylaws of the Foundation until
>death, resignation, retirement,

Re: [VOTE] Apache Crunch (incubating) 0.3.0 Release Candidate 1

2012-09-12 Thread Tom White
+1

Verified signature and checksums. Ran a RAT report. Checked for binary
and SCM files. Read LICENSE, NOTICE, DISCLAIMER files - all look
correct. Built from source successfully.

Cheers,
Tom

On Tue, Sep 11, 2012 at 1:58 PM, Josh Wills  wrote:
> Hello everyone,
>
> This is a call for a vote on releasing the following candidate as Apache
> Crunch 0.3.0 (incubating). This will be our first release. A vote was held
> on the developer mailing list and passed with 4 +1s:
>
> http://markmail.org/thread/yvtvog5lrj3a7gep
>
> +1s:
> phunt (IPMC)
> jwills (binding)
> greid (binding)
> mafr (binding)
>
> We need two additional IPMC votes.
>
> The release fixes the issues listed here:
>
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12313526&version=12322446
>
> Please download, test, and vote by September 14th at 6AM Pacific Time.
>
> Source files:
> http://people.apache.org/~jwills/crunch-0.3.0-incubating-RC1/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachecrunch-040/
>
> The tag to be voted upon:
> https://git-wip-us.apache.org/repos/asf?p=incubator-crunch.git;a=tag;h=4666bd889f9b641d7c0157bc4401a1b985fedc89
>
> Crunch's KEYS file:
> http://www.apache.org/dist/incubator/crunch/KEYS
>
> The vote will be open for 72 hours.
>
> [ ] +1  approve
> [ ] +0  no opinion
> [ ] -1  disapprove (and reason why)
>
> Thank you,
> Josh

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-25 Thread Tom White
+1

Tom

On Fri, Jul 20, 2012 at 12:42 PM, Aaron McCurry  wrote:
> I would like to call a vote for accepting Blur for incubation in the
> Apache Incubator. The full proposal is available below.
>
> Please cast your vote:
>
> [ ] +1, bring Blur into Incubator
> [ ] +0, I don't care either way,
> [ ] -1, do not bring Blur into Incubator, because...
>
> This vote will be open for 72 hours and only votes from the Incubator
> PMC are binding.
>
> Thank you for your consideration!
>
> Aaron
>
> http://wiki.apache.org/incubator/BlurProposal
>
> = Blur Proposal =
>
> == Abstract ==
> Blur is a search platform capable of searching massive amounts of data
> in a cloud computing environment. Blur leverages several existing
> Apache projects, including Apache Lucene, Apache Hadoop, Apache
> !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
> updates are possible with Blur.  Bulk updates are accomplished using
> Hadoop Map/Reduce and NRT are performed through direct Thrift calls.
>
> == Proposal ==
> Blur is an open source search platform capable of querying massive
> amounts of data at incredible speeds. Rather than using the flat,
> document-like data model used by most search solutions, Blur allows
> you to build rich data models and search them in a semi-relational
> manner similar to joins while querying a relational database. Using
> Blur, you can get precise search results against terabytes of data at
> Google-like speeds.  Blur leverages multiple open source projects
> including Hadoop, Lucene, Thrift and !ZooKeeper to create an
> environment where structured data can be transformed into an index
> that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
> bulk indexing into Blur.  Server failures are handled automatically by
> using !ZooKeeper for cluster state and HDFS for index storage.
>
> == Background ==
> Blur was created by Aaron !McCurry in 2010. Blur was developed to
> solve the challenges in dealing with searching huge quantities of data
> that the traditional RDBMS solutions could not cope with while still
> providing JOIN-like capabilities to query the data.  Several other
> open source projects have implemented aspects of this design including
> elasticsearch, Katta and Apache Solr.
>
> == Rationale ==
> There is a need for a distributed search capability within the Hadoop
> ecosystem. Currently, there are no other search solutions that
> natively leverage HDFS and the failover features of Hadoop in the same
> manner as the Blur project. The communities we expect to be most
> interested in such a project are government, health care, and other
> industries where scalability is a concern. We have made much progress
> in developing this project over the past 2 years and believe both the
> project and the interested communities would benefit from this work
> being openly available and having open development.  In future
> versions of Blur the API will more closely follow the API’s provided
> in Lucene so that systems that already use Lucene can more easily
> scale with Blur. Blur can be viewed as a query execution engine that
> Lucene based solutions can utilize when scale becomes an issue.
>
> == Initial Goals ==
> The initial goals of the project are:
>  * To migrate the Blur codebase, issue tracking and wiki from
> github.com and integrate the project with the ASF infrastructure.
>  * Add new committers to the project and grow the community in "The Apache 
> Way".
>
> == Current Status ==
>
> === Meritocracy ===
> Blur was initially developed by Aaron !McCurry in June 2010.  Since
> then Blur has continued to evolve with the support of a small
> development team at Near Infinity.  As a part of the Apache Software
> Foundation, the Apache Blur team intends to strongly encourage the
> community to help with and contribute to the project.  Apache Blur
> will actively seek potential committers and help them become familiar
> with the codebase.
>
> === Community ===
> A small community has developed around Blur and several project teams
> are currently using Blur for their big data search capability. The
> source code is currently available on GitHub and there is a dedicated
> website (blur.io) that provides an overview of the project. Blur has
> been shared with several members of the Apache community and has been
> presented at the Bay Area HUG (see
> http://www.meetup.com/hadoop/events/20109471/).
>
> === Core Developers ===
> The current developers are employed by Near Infinity Corporation, but
> we anticipate interest developing among other companies.
>
> === Alignment ===
> Blur is built on top of a number of Apache projects; Hadoop, Lucene,
> !ZooKeeper, and Thrift. It builds with Maven.  During the course of
> Blur development, a couple of patches have been committed back to the
> Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
> strong relationship with the before mentioned Apache projects, the
> incubator is a good match for Blur.
>
> == Known Ri

Re: [VOTE] Release Oozie 3.2.0-incubating (candidate 1)

2012-06-05 Thread Tom White
+1 I checked the signature and checksums, inspected the source tag,
ran a RAT report, checked the disclaimer, notice and license files.

Tom

On Fri, Jun 1, 2012 at 6:35 PM, Alejandro Abdelnur  wrote:
> Dear Incubator PMCs,
>
> Oozie Incubator community has voted on the Oozie 3.2.0-incubating release.
>
> The tally of the votes is:
>
> Results: 8 +1 votes, 0 -1 votes
>
>  developers/users: 3 +1 (mona, jeremy, roman)
>  committers: 4 +1 (harsh, virag, angelo, tucu)
>  mentors: 1 +1 (alan)
>
> Below you'll find the email sent to the Oozie developer alias with the VOTE
> request. There you'll find the details of the release and where to download
> the release artifact.
>
> Vote closes on Monday JUN/04 17:00 PST.
>
> Thanks
>
> Alejandro
>
> -- Forwarded message --
> From: Alejandro Abdelnur 
> Date: Tue, May 29, 2012 at 3:23 PM
> Subject: [VOTE] Release Oozie 3.2.0-incubating (candidate 1)
> To: oozie-...@incubator.apache.org
>
>
> Dear Oozie community,
>
> The release candidate 1 for for Oozie 3.2.0-incubating is available.
>
> Changes from candidate 0:
>
> * removes native files and adds instructions on how to test Pipe
> * consolidate rat report
> * minor fix in the demo example
>
>
> Some of the new features are:
>
> * Hive action
> * Sqoop action
> * Shell action
> * Tool to create/upgrade database schema
> * Kerberos SPNEGO authentication
> * Proxy User capabilities for Oozie users
> * Job ACLs support
> * Improved Job information over HTTP REST API
> * Improved Workflow validation
> * Sharelib directories per action
> * EL functions for Workflow and Coordinator applications
>
> In addition it includes several improvements for performance and
> stability and several bug fixes.
>
> Keys used to sign the release are available at
> http://svn.apache.org/viewvc/incubator/oozie/trunk/KEYS?view=markup.
>
> Please download, test, and try it out:
>
>   http://people.apache.org/~tucu/oozie-3.2.0-incubating-candidate-1
>
> The release, md5 signature, gpg signature, and rat report can all be
> found at the above URL.
>
> Vote closes on Friday JUN/01 15:30 PST.
>
> Thanks
>
> --
> Alejandro

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Crunch into the Apache Incubator

2012-05-24 Thread Tom White
ctured binary data formats
> like JSON, Apache Avro, and Apache Thrift, and is designed to be
> extensible to support working with any kind of data format that Java
> supports in its native form.
>
> == Initial Goals ==
>
> Crunch is currently in its first major release with a considerable
> number of enhancement requests, tasks, and issues recorded towards its
> future development. The initial goal of this project will be to
> continue to build community in the spirit of the "Apache Way", and to
> address the highly requested features and bug-fixes towards the next
> dot release.
>
> Some goals include:
>  * To stand up a sustaining Apache-based community around the Crunch codebase.
>  * Improved documentation of Java libraries and best practices.
>  * Support the ability to "fuse" logically independent pipeline stages
> that aggregate the same data in different ways into a single
> !MapReduce job.
>  * Performance, usability, and robustness improvements.
>  * Improving diagnostic reporting and debugging for individual !MapReduce 
> jobs.
>  * Providing a centralized place for contributed extensions and
> domain-specific applications.
>
> = Current Status =
>
> == Meritocracy ==
>
> Crunch was initially developed by Josh Wills in September 2011 at
> Cloudera. Developers external to Cloudera provided feedback, suggested
> features and fixes and implemented extensions of Crunch. Cloudera's
> engineering team has since maintained the project with Josh Wills, Tom
> White, and Brock Noland dedicated towards its improvement.
> Contributors to Crunch include developers from multiple organizations,
> including businesses and universities.
>
> == Community ==
>
> Crunch is currently used by a number of organizations all over the
> world. Crunch has an active and growing user and developer community
> with active participation in
> [[https://groups.google.com/a/cloudera.org/group/crunch-users/topics|user]]
> and 
> [[https://groups.google.com/a/cloudera.org/group/crunch-dev/topics|developer]]
> mailing lists.
>
> Since open sourcing the project, there have been eight individuals
> from five organizations who have contributed code.
>
> == Core Developers ==
>
> The core developers for Crunch are:
>  * Brock Noland: Wrote many of the test cases, user documentation, and
> contributed several bug fixes.
>  * Josh Wills: Josh wrote much of the original Crunch code.
>  * Gabriel Reid: Gabriel significantly improved Crunch's handling of
> Avro data and has contributed several bug fixes for the core planner.
>  * Tom White: Tom added several libraries for common !MapReduce
> pipeline operations, including the sort library and a library of set
> operations.
>  * Christian Tzolov: Christian has contributed several bug fixes for
> the Avro serialization module and the unit testing framework.
>  * Robert Chu: Robert did the left/right/outer join implementations
> for Crunch and fixed several bugs in the runtime configuration logic.
>
> Several of the core developers of Crunch have contributed towards
> Hadoop or related Apache projects and are familiar with Apache
> principles and philosophy for community driven software development.
>
> == Alignment ==
>
> Crunch complements several current Apache projects. It complements
> Hadoop !MapReduce by providing a higher-level API for developing
> complex data processing pipelines that require a sequence of
> !MapReduce jobs to perform. Crunch also supports Apache HBase in order
> to simplify the process of writing !MapReduce jobs that execute over
> HBase tables. Crunch makes extensive use of the Apache Avro data
> format as an internal data representation process that makes
> !MapReduce jobs execute quickly and efficiently.
>
> = Known Risks =
>
> == Orphaned Products ==
>
> Crunch is already deployed in production at multiple companies and
> they are actively participating in creating new features. Crunch is
> getting traction with developers and thus the risks of it being
> orphaned are minimal.
>
> == Inexperience with Open Source ==
>
> All code developed for Crunch has been open sourced by Cloudera under
> Apache 2.0 license.  All committers to Crunch are intimately familiar
> with the Apache model for open-source development and are experienced
> with working with new contributors.
>
> == Homogeneous Developers ==
>
> The initial set of committers is from a reduced set of organizations.
> However, we expect that once approved for incubation, the project will
> attract new contributors from diverse organizations and will thus grow
> organically. The submission of patches from developers from several
> different organizations is a strong indicat

Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-24 Thread Tom White
According to Clutch [1] the project has added 8 committers since it
entered incubation. Regarding diversity, committers from over four
organizations are actively involved in Flume development, which is
pretty healthy. There does seem to be a need to have more diversity at
the PPMC level, however, so that's something that could be worked on.

Tom

[1] http://incubator.apache.org/clutch.html

On Thu, May 24, 2012 at 2:06 PM, Dave Fisher  wrote:
>
> On May 24, 2012, at 11:49 AM, Ralph Goers wrote:
>
>>
>> On May 24, 2012, at 10:40 AM, Arvind Prabhakar wrote:
>>
>>> Hi,
>>>
>>> On Thu, May 24, 2012 at 12:19 AM, Ralph Goers 
>>> wrote:
>>>
 The ONLY issue I see for Flume to graduate is diversity.  No one will
 convince me that the current makeup constitutes diversity of any kind.

 Perhaps I shouldn't have brought up the mailing list issues as that was
 only meant in the spirit of trying to offer some advice on how more
 diversity could be achieved.  Flume is really the only community I
 participate in that contains Cloudera employees so I do find myself
 wondering if the way the project is run is because that is the way all
 projects with a large number of Cloudera employees are run.  That might
 make all of those participants comfortable but might create a barrier to
 others.

>>>
>>> Here are the committers who have been active in the past three months:
>>>
>>> * Brock Noland (Cloudera)
>>> * Hari Shreedharan  (Cloudera)
>>> * Jarek Jarcec Cecho (AVG Technologies)
>>> * Juhani Connolly   (CyberAgent)
>>> * Mike Percy (Cloudera)
>>> * Mingjie Lai (Trend Micro)
>>> * Prasad Mujumdar (Cloudera)
>>> * Will McQueen (Cloudera)
>>> * Arvind Prabhakar (Cloudera)
>>>
>>> There are four companies represented in this list: AVG Technologies,
>>> Cloudera, CyberAgent and Trend Micro. Compared to other projects that have
>>> successfully graduated from Incubator in the past, this meets the diversity
>>> requirements very well.
>>
>> I was mistaken and the list above is indeed correct.  For some reason I 
>> thought a couple of them had become Cloudera employees.
>>
>> However, none of those three are currently on the PPMC.  When you look at 
>> the PPMC list you should also include a few more Cloudera people who do 
>> participate in release votes and PPMC issues. Most, if not all, of the 
>> non-Cloudera PMC members don't.
>
> I started reading some of the Flume website and I think that when you go to 
> the main Wiki page:
>
> https://cwiki.apache.org/confluence/display/FLUME/Index
>
> When you click on the "Flume Cookbook" the resource is at cloudera.org.
>
> http://archive.cloudera.com/cdh/3/flume/Cookbook/
>
> This page lists "flume-...@cloudera.org" and is a file with a revision dated 
> May 7, 2012.
>
> You can make you own conclusions, but it looks like podling resources need to 
> be migrated to the ASF.
>
> Regards,
> Dave
>
>>
>>
>>
>>>
>>>

 In any case - I'm not insisting that the way the project is run needs to
 change. I'm simply saying I cannot support graduation with the current
 makeup of the committers and PMC. I don't have a hard and fast ratio -
 gaining 10 new unaffiliated committers who don't do much isn't nearly as
 good as 2 or 3 who are very active.  Ultimately the project needs to figure
 out how to solve this.

>>>
>>> Stating that some committers "who don't do much isn't nearly as good as 2
>>> or 3 who are very active" is an unfair characterization. This is more
>>> unfair for those who are part of the project but have not been active
>>> lately due to whatever reasons, but have played a foundational role in
>>> getting the project to a point where it is today. I think they are as
>>> important as any other committer who may be very active at the moment.
>>> Merit once earned, never expires [1].
>>>
>>> [1] http://www.apache.org/dev/committers.html#committer-set-term
>>
>> I think you misunderstood my point or I didn't state it very well.  
>> Diversity isn't achieved simply by having bodies.  IOW I am not suggesting 
>> offering commit rights to people who haven't earned it just to meet some 
>> ratio.  However, I am not suggesting the project has ever even considered 
>> doing that.
>>
>> Ralph
>>
>>
>>
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Apache MRUnit from Incubator

2012-05-04 Thread Tom White
+1 to graduate MRUnit.

Cheers,
Tom

On Thu, May 3, 2012 at 7:31 PM, Jim Donofrio  wrote:
> We havent heard anything +1 or -1 from any IPMC members besides our mentors.
> Any thoughts on this vote?
>
> We released 0.9.0-incubating on Tuesday so we have completed 4 releases and
> added 4 new commiters since the beginning of incubation
>
> To resummarize the current vote is below:
>
>
> 7 +1's
> 0 0's
> 0 -1's
>
> IPMC +1
> Patrick Hunt
> Chris Mattmann
>
> PPMC +1
> Brock Noland
> Dave Beech
> Jim Donofrio
> Jarek Jarcec Cecho
>
> Others +1
> Joey Echeverria
>
> On 04/28/2012 12:11 PM, Mattmann, Chris A (388J) wrote:
>>
>> Hi Jim,
>>
>> Yep, we need more VOTEs than 2 (3 I believe, but it would be nice to have
>> a bit more -- though not required). There's been a lot of traffic on
>> general@incbuator
>> lately so folks are probably just busy.
>>
>> I would wait until tonight or tomorrow and poll for some more VOTEs on the
>> VOTE thread.
>>
>> Once we get the required VOTEs, you can close the VOTE, and I can add the
>> resolution to
>> the board agenda.
>>
>> Cheers,
>> Chris
>>
>> On Apr 28, 2012, at 6:35 AM, Jim Donofrio wrote:
>>
>>> How many IPMC votes are required for graduation?
>>>
>>> We got 2 IPMC votes so far from mentors but havent gotten any on the
>>> general@ list. Since the vote has been open for more than 72 hours, does
>>> this mean we cant graduate yet?
>>>
>>>
>>> On 04/23/2012 11:56 PM, Jim Donofrio wrote:

 We havent heard anything on the DISCUSS thread since posting it over 72
 hours ago so I am starting a VOTE thread following Chris Mattmann's
 recommendation. I will leave the vote open for 72 hours.

 The current vote is below copying from the community vote [2] that
 passed:

 7 +1's
 0 0's
 0 -1's

 IPMC +1
 Patrick Hunt
 Chris Mattmann

 PPMC +1
 Brock Noland
 Dave Beech
 Jim Donofrio
 Jarek Jarcec Cecho

 Others +1
 Joey Echeverria


 In the last MRUnit incubator report [1] the 3 blockers were:
 * Grow the community size and diversity
 * Make another incubating release
 * Construct an MRUnit website to replace the existing stub

 We have since:
 * Added 2 new committers/PPMC members
 * 0.9.0-incubating will get released soon, pending one more IPMC +1
 * We have a new website

  From the beginning of incubation we have:
 * Added 4 new committers/PPMC members
 * Done 4 releases once 0.9.0-incubating is released soon, pending one
 more IPMC +1
 * Created a real website

 [1]:

 http://incubator.apache.org/mrunit/ppmc/incubator_reports.html#march-2012
 [2]:

 http://mail-archives.apache.org/mod_mbox/incubator-mrunit-dev/201204.mbox/%3C4F91FED1.2010609%40gmail.com%3E


 X. Establish the Apache MRUnit Project

 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software related to unit testing Apache Hadoop map
 reduce jobs for distribution at no charge to the public.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache MRUnit Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache MRUnit Project be and hereby is
 responsible for the creation and maintenance of software
 related to unit testing Apache Hadoop map reduce jobs;
 and be it further

 RESOLVED, that the office of "Vice President, Apache MRUnit" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache MRUnit Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache MRUnit Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache MRUnit Project:

 * Brock Noland br...@apache.org
 * Patrick Hunt ph...@apache.org
 * Nigel Daley ni...@apache.org
 * Eric Sammer esam...@apache.org
 * Aaron Kimball kimba...@apache.org
 * Konstantin Boudnik c...@apache.org
 * Garrett Wu g...@apache.org
 * Jim Donofrio jdonof...@apache.org
 * Jarek Jarcec Cecho jar...@apache.org
 * Dave Beech dbe...@apache.org

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Brock Noland
 be appointed to the office of Vice President, Apache MRUnit, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a s

Re: [VOTE] Release MRUnit version 0.9.0-incubating-rc2

2012-04-26 Thread Tom White
+1 checked checksums, signatures, license/notice/disclaimer, svn tag.

Tom

On Wed, Apr 18, 2012 at 1:22 PM, Brock Noland  wrote:
> This is an incubator release for Apache MRUnit, version 0.9.0-incubating.
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311292&version=12316360
>
> *** Please download, test and vote by [3 working days after sending].
>
> Note that we are voting upon the source (tag), binaries are provided
> for convenience.
>
> Source and binary files:
> http://people.apache.org/~brock/mrunit-0.9.0-incubating-candidate-2/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachemrunit-070/
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/mrunit/tags/release-0.9.0-incubating/
>
> MRUnit's KEYS file containing PGP keys we use to sign the release:
> http://svn.apache.org/repos/asf/incubator/mrunit/dist/KEYS
>
> Note that the Incubator PMC needs to vote upon the release after a
> successful PPMC vote before any release can be made official.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Apache Mesos 0.9.0-incubating (RC5)

2012-04-26 Thread Tom White
+1 checked checksums and signatures; license, notice, disclaimer and
svn tag all look correct.

Tom

On Thu, Apr 19, 2012 at 12:53 PM, Benjamin Hindman  wrote:
> Please vote on releasing the following candidate as Apache Mesos
> (incubating) version 0.9.0. This will be the first incubator release for
> Mesos in Apache, but the sixth release candidate.
>
> Changes since RC4:
>  * Updated NOTICE to include project name and copyright date as well as to
> include third-party licences.
>  * Changed one of our third-party components to be included as an archive
> of it's source rather than a binary bundle (Python egg).
>  * Added DISCLAIMER.
>
> The candidate for Mesos 0.9.0-incubating release is available at:
>
> http://people.apache.org/~benh/mesos-0.9.0-incubating-RC5/mesos-0.9.0-incubating.tar.gz
>
> The tag to be voted on:
>
> https://svn.apache.org/repos/asf/incubator/mesos/tags/release-0.9.0-incubating-RC5
>
> The MD5 checksum of the tarball can be found at:
>
> http://people.apache.org/~benh/mesos-0.9.0-incubating-RC5/mesos-0.9.0-incubating.tar.gz.md5
>
> The signature of the tarball can be found at:
>
> http://people.apache.org/~benh/mesos-0.9.0-incubating-RC5/mesos-0.9.0-incubating.tar.gz.asc
>
> Mesos' KEYS file, containing the PGP keys used to sign the release:
>  http://svn.apache.org/repos/asf/incubator/mesos/dist/KEYS
>
> Please vote on releasing this package as Apache Mesos 0.9.0-incubating!
>
> The vote is open until Monday, April 23rd at 8 pm (a bit more than 72 hours
> since it's over the weekend) and passes if a majority of at least 3 +1 IPMC
> votes are cast.
>
> [ ] +1 Release this package as Apache Mesos 0.9.0-incubating
> [ ] -1 Do not release this package because ...
>
> To learn more about Apache Mesos, please see
> http://incubator.apache.org/mesos.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] CloudStack for Apache Incubator

2012-04-10 Thread Tom White
+1 (binding)

Tom

On Mon, Apr 9, 2012 at 6:32 PM, Kevin Kluge  wrote:
> Hi All.  I'd like to call for a VOTE for CloudStack to enter the Incubator.  
> The proposal is available at [1] and I have also included it below.   Please 
> vote with:
> +1: accept CloudStack into Incubator
> +0: don't care
> -1: do not accept CloudStack into Incubator (please explain the objection)
>
> The vote is open for at least 72 hours from now (until at least 19:00 US-PST 
> on April 12, 2012).
>
> Thanks for the consideration.
>
> -kevin
>
> [1] http://wiki.apache.org/incubator/CloudStackProposal
>
>
>
>
> Abstract
>
> CloudStack is an IaaS ("Infrastracture as a Service") cloud orchestration 
> platform.
>
> Proposal
>
> CloudStack provides control plane software that can be used to create an IaaS 
> cloud. It includes an HTTP-based API for user and administrator functions and 
> a web UI for user and administrator access. Administrators can provision 
> physical infrastructure (e.g., servers, network elements, storage) into an 
> instance of CloudStack, while end users can use the CloudStack self-service 
> API and UI for the provisioning and management of virtual machines, virtual 
> disks, and virtual networks.
>
> Citrix Systems, Inc. submits this proposal to donate the CloudStack source 
> code, documentation, websites, and trademarks to the Apache Software 
> Foundation ("ASF").
>
> Background
>
> Amazon and other cloud pioneers invented IaaS clouds. Typically these clouds 
> provide virtual machines to end users. CloudStack additionally provides 
> baremetal OS installation to end users via a self-service interface. The 
> management of physical resources to provide the larger goal of cloud service 
> delivery is known as "orchestration". IaaS clouds are usually described as 
> "elastic" -- an elastic service is one that allows its user to rapidly scale 
> up or down their need for resources.
>
> A number of open source projects and companies have been created to implement 
> IaaS clouds. Cloud.com started CloudStack in 2008 and released the source 
> under GNU General Public License version 3 ("GPL v3") in 2010. Citrix 
> acquired Cloud.com, including CloudStack, in 2011. Citrix re-licensed the 
> CloudStack source under Apache License v2 in April, 2012.
>
> Rationale
>
> IaaS clouds provide the ability to implement datacenter operations in a 
> programmable fashion. This functionality is tremendously powerful and 
> benefits the community by providing:
>
> - More efficient use of datacenter personnel
> - More efficient use of datacenter hardware
> - Better responsiveness to user requests
> - Better uptime/availability through automation
>
> While there are several open source IaaS efforts today, none are governed by 
> an independent foundation such as ASF. Vendor influence and/or proprietary 
> implementations may limit the community's ability to choose the hardware and 
> software for use in the datacenter. The community at large will benefit from 
> the ability to enhance the orchestration layer as needed for particular 
> hardware or software support, and to implement algorithms and features that 
> may reduce cost or increase user satisfaction for specific use cases. In this 
> respect the independent nature of the ASF is key to the long term health and 
> success of the project.
>
> Initial Goals
>
> The CloudStack project has two initial goals after the proposal is accepted 
> and the incubation has begun.
>
> The Cloudstack Project's first goal is to ensure that the CloudStack source 
> includes only third party code that is licensed under the Apache License or 
> open source licenses that are approved by the ASF for use in ASF projects. 
> The CloudStack Project has begun the process of removing third party code 
> that is not licensed under an ASF approved license. This is an ongoing 
> process that will continue into the incubation period. Third party code 
> contributed to CloudStack under the CloudStack contribution agreement was 
> assigned to Cloud.com in exchange for distributing CloudStack under GPLv3. 
> The CloudStack project has begun the process of amending the previous 
> CloudStack contribution agreements to obtain consent from existing 
> contributors to change the CloudStack project's license. In the event that an 
> existing contributor does not consent to this change, the project is prepared 
> to remove that contributor's code. Additionally, there are binary 
> dependencies on redistributed libraries that are not provided with an 
> ASF-approved license. Finally, the CloudStack has source files incorporated 
> from third parties that were not provided with an ASF-approved license. We 
> have begun the process of re-writing this software. This is an ongoing 
> process that will extend into the incubation period. These issues are 
> discussed in more detail later in the proposal.
>
> Although CloudStack is open source, many design documents and discussions 
> that should have been publicly

Re: [VOTE] Release MRUnit version 0.8.1-incubating

2012-02-25 Thread Tom White
+1

Verified checksums and signature, ran unit tests, checked license and
notice. I also successfully ran MRUnit tests against Hadoop 1.0 and
0.23 code.

Tom

On Fri, Feb 17, 2012 at 12:22 PM, Brock Noland  wrote:
> This is an incubator release for Apache MRUnit, version 0.8.1-incubating.
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311292&version=12319881
>
> *** Please download, test and vote by [3 working days after sending].
>
> Note that we are voting upon the source (tag), binaries are provided
> for convenience.
>
> Source and binary files:
> http://people.apache.org/~brock/mrunit-0.8.1-incubating-candidate-1/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachemrunit-022/
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/mrunit/tags/release-0.8.1-incubating/
>
> MRUnit's KEYS file containing PGP keys we use to sign the release:
> http://svn.apache.org/repos/asf/incubator/mrunit/dist/KEYS
>
> Note that the Incubator PMC needs to vote upon the release after a
> successful PPMC vote before any release can be made official.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Sqoop version 1.4.1-incubating

2012-02-13 Thread Tom White
+1 Unpacked and ran the unit tests successfully. Checksums and
signature are good. RAT passes. LICENSE, NOTICE and DISCLAIMER look
ok.

Tom

On Fri, Feb 10, 2012 at 11:27 AM, Jarek Jarcec Cecho  wrote:
> This is the second incubator release for Apache Sqoop, version 
> 1.4.1-incubating.
>
> We already have one IPMC vote on dev list (Oliver Lamy) and we're looking for 
> additional two.
>
> *** Please cast your vote by 2012-02-14 ***
>
> Dev list vote thread:
> http://markmail.org/message/alpqeqwliq4xbbcc
>
> The list of fixed issues:
> http://svn.apache.org/repos/asf/incubator/sqoop/branches/branch-1.4.1/CHANGES.txt
>
> The tarball (*.tar.gz), signature (*.asc), checksum (*.md5),
> and test result (log/*.ant_test.log):
> http://people.apache.org/~jarcec/sqoop-1.4.1-incubating-rc3/
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/sqoop/tags/release-1.4.1-rc3
>
> The KEYS file:
> http://www.apache.org/dist/incubator/sqoop/KEYS

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Sqoop version 1.4.0-incubating (with release candidate rc1)

2011-11-23 Thread Tom White
+1

Checked signatures/md5, ran RAT, tag looks good.

Cheers,
Tom

On Sun, Nov 20, 2011 at 11:40 PM, Bilung Lee  wrote:
> This will be the first incubator release for Apache Sqoop, version
> 1.4.0-incubating.
>
> We got two IPMC votes from our dev list and are looking for a third.
>  Thanks!
>
> *** Please cast the vote by November 23, 2011 ***
>
> Dev list vote result:
> http://markmail.org/message/jehsoo2vi6xboovu
>
> The list of fixed issues:
> http://svn.apache.org/repos/asf/incubator/sqoop/branches/branch-1.4.0/CHANGES.txt
>
> The tarball (*.tar.gz), signature (*.asc), checksum (*.md5), license audit
> result (log/*.ant_releaseaudit.log), and test result (log/*.ant_test.log):
> http://people.apache.org/~blee/sqoop-1.4.0-incubating-rc1/
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/sqoop/tags/release-1.4.0-rc1
>
> The KEYS file:
> http://www.apache.org/dist/incubator/sqoop/KEYS
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release MRUnit version 0.5.0-incubating

2011-11-22 Thread Tom White
+1

Signatures and checksums look good. I could successfully build from
source. RAT ran cleanly.

Cheers,
Tom

On Fri, Nov 4, 2011 at 1:19 PM, Brock Noland  wrote:
> This is the first incubator release for Apache MRUnit, version
> 0.5.0-incubating.
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311292&version=12316328
>
> *** Please download, test and vote by [3 working days after sending].
>
> Note that we are voting upon the source (tag), binaries are provided
> for convenience.
>
> Source and binary files:
> http://people.apache.org/~brock/mrunit-0.5.0-incubating-candidate-0
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachemrunit-149/
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/mrunit/tags/release-0.5.0-incubating/
>
> MRUnit's KEYS file containing PGP keys we use to sign the release:
> http://svn.apache.org/repos/asf/incubator/mrunit/dist/KEYS
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release for Bigtop version 0.2.0-incubating RC2

2011-11-06 Thread Tom White
+1

I verified checksums and signature. NOTICE, LICENSE, and DISCLAIMER
files look good. I ran RAT and the files without licenses seem to be
test data.

Tom

On Fri, Nov 4, 2011 at 2:00 PM, Roman Shaposhnik  wrote:
> This takes care of Patrick's RAT concerns. From now on you can run RAT by:
>  $ mvn -Prelease install
> at the top level of a project. This is THE only change between RC1 and RC2
> and given how cosmetic it is I'd like to think that all the existing
> +1 votes don't
> need to be recast. Please let me know if that's not the case.
>
> =
> This is the second incubator release for Apache Bigtop, version
> 0.2.0-incubating.
>
> It fixes the following issues:
>  https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12317591&projectId=12311420
>
> *** Please download, test, and vote by Saturday, November 5
>
> Note that we are voting on the source (tag): release-0.2.0-incubating-RC2
>
> Source tarball, checksums, signature:
>    http://people.apache.org/~rvs/bigtop-0.2.0-incubating-RC2/
>
> The tag to be voted on:
>   
> https://svn.apache.org/repos/asf/incubator/bigtop/tags/release-0.2.0-incubating-RC2/
>
> Bigtop's KEYS file, containing the PGP keys used to sign the release:
>   http://svn.apache.org/repos/asf/incubator/bigtop/dist/KEYS
>
> Note that the Incubator PMC needs to vote on the release after a successful
> PPMC vote before any release can be made official.
>
> Thanks!
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.6.0-incubating

2011-08-24 Thread Tom White
+1

I checked signatures and checksums, ran RAT, and started a cluster.

Cheers,
Tom

On Mon, Aug 22, 2011 at 10:37 PM, Andrei Savu  wrote:
> This is the first incubator release for Apache Whirr, version 0.6.0-incubating
> and the last release as an incubator project.
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=1230&version=12316468
>
> *** Please download, test and vote by August 26, 2011.
>
> Note that we are voting upon the source (tag), binaries are provided
> for convenience.
>
> Source and binary files:
> http://people.apache.org/~asavu/whirr-0.6.0-incubating-candidate-0
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachewhirr-058
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.6.0-incubating
>
> Whirr's KEYS file containing PGP keys we use to sign the release:
> http://svn.apache.org/repos/asf/incubator/whirr/dist/KEYS
>
> Note that the Incubator PMC needs to vote upon the release after a
> successful PPMC vote before any release can be made official.
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release for Bigtop version 0.1.0-incubating RC2

2011-08-24 Thread Tom White
+1

I verified signatures and checksums, and ran RAT.

Cheers,
Tom

On Mon, Aug 22, 2011 at 11:07 AM, Andrew Bayer  wrote:
> This is the first incubator release for Apache Bigtop, version
> 0.1.0-incubating.
>
> It fixes the following issues:
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12317549&styleName=Html&projectId=12311420
>
> *** Please download, test, and vote by Thursday, August 24 (3 working days
> from now)
>
> Note that we are voting on the source (tag).
>
> Source tarball, checksums, signature:
> http://people.apache.org/~abayer/bigtop-0.1.0-incubating-candidate-2/
>
> The tag to be voted on:
> http://svn.apache.org/repos/asf/incubator/bigtop/tags/release-0.1.0-incubating-RC2
> (svn rev. 1160352)
>
> Bigtop's KEYS file, containing the PGP keys used to sign the release:
> http://svn.apache.org/repos/asf/incubator/bigtop/dist/KEYS
>
> Note that the Incubator PMC needs to vote on the release after a successful
> PPMC vote before any release can be made official.
>
> Thanks!
>
> A.
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE][RESULT] Graduation of the Whirr Podling

2011-08-09 Thread Tom White
On Tue, Aug 9, 2011 at 12:44 AM, Robert Burrell Donkin
 wrote:
> On Tue, Aug 9, 2011 at 8:28 AM, Tom White  wrote:
>> The vote passes, with 4 binding +1 votes (plus an additional two IPMC
>> votes from the Whirr dev thread [1]), 1 non-binding +1 vote, no 0
>> votes, and no -1 votes.
>>
>> The binding +1 votes:
>> Chris A. Mattmann
>> Mohammad Nour El-Din
>> Doug Cutting
>> Stefan Seelmann
>
> (Just for the record) I tried to cast another binding +1 vote using
> the boxes [1] (but on reflection I made it easy to miss by cluttering
> the VOTE thread with noise)

Sorry I missed that in the tally. Thanks for voting.

Cheers,
Tom

>
> Congratulations on graduation
>
> Robert
>
> http://mail-archives.apache.org/mod_mbox/incubator-general/201108.mbox/%3CCAKTa1mjReAgUj7Sqx2NJjCL3C49g0FTKWE4w4NzcPky+=go...@mail.gmail.com%3E
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE][RESULT] Graduation of the Whirr Podling

2011-08-09 Thread Tom White
The vote passes, with 4 binding +1 votes (plus an additional two IPMC
votes from the Whirr dev thread [1]), 1 non-binding +1 vote, no 0
votes, and no -1 votes.

The binding +1 votes:
Chris A. Mattmann
Mohammad Nour El-Din
Doug Cutting
Stefan Seelmann

Non-binding +1 votes:

Hyunsik Choi

Thanks everyone for voting.
Tom

[1] 
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201107.mbox/%3ccabqr8u8nttkqxjerp-txnn4jljrvqyrsrqxuesjiyq2td53...@mail.gmail.com%3E

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Graduation of the Whirr Podling

2011-08-03 Thread Tom White
Hi everyone,

The Whirr podling joined the incubator in May 2010. Since then it has
made 5 releases following Apache guidelines, added 4 new committers,
and added 3 new PPMC members. The community is healthy and growing,
and we've shown an ability to self-govern using accepted Apache
practices.

The Whirr podling has now voted to graduate:

Vote: 
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201107.mbox/%3ccabqr8u_+mk8w_z-4vu-+9mwg+c+r4c1kmuhxa1drmulujnu...@mail.gmail.com%3E
Result: 
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201107.mbox/%3ccabqr8u8nttkqxjerp-txnn4jljrvqyrsrqxuesjiyq2td53...@mail.gmail.com%3E

The vote received 7 PPMC approvals, of which 3 were also IPMC members
(Patrick Hunt, Doug Cutting, and myself).

I would like to ask the IPMC to approve the graduation.

[  ] +1 - I approve of the Whirr graduation
[  ] +0 - I have no opinion
[  ] -1 - There's an issue with graduation at this time, which is

Voting will be open for 72 hours. Please find the proposed board
resolution below.

Thanks
Tom

## Resolution to create a TLP from graduating Incubator podling

X. Establish the Apache Whirr Project

   WHEREAS, the Board of Directors deems it to be in the best
   interests of the Foundation and consistent with the
   Foundation's purpose to establish a Project Management
   Committee charged with the creation and maintenance of
   open-source software related to running services on cloud
   infrastructure for distribution at no charge to the public.

   NOW, THEREFORE, BE IT RESOLVED, that a Project Management
   Committee (PMC), to be known as the "Apache Whirr Project",
   be and hereby is established pursuant to Bylaws of the
   Foundation; and be it further

   RESOLVED, that the Apache Whirr Project be and hereby is
   responsible for the creation and maintenance of software
   related to running services on cloud infrastructure;
   and be it further

   RESOLVED, that the office of "Vice President, Apache Whirr" be
   and hereby is created, the person holding such office to
   serve at the direction of the Board of Directors as the chair
   of the Apache Whirr Project, and to have primary responsibility
   for management of the projects within the scope of
   responsibility of the Apache Whirr Project; and be it further

   RESOLVED, that the persons listed immediately below be and
   hereby are appointed to serve as the initial members of the
   Apache Whirr Project:

 * Adrian Cole  
 * Lars George  
 * Patrick Hunt 
 * Tibor Kiss   
 * Johan Oskarsson  
 * Andrew Purtell   
 * Andrei Savu  
 * Tom White

   NOW, THEREFORE, BE IT FURTHER RESOLVED, that Tom White
   be appointed to the office of Vice President, Apache Whirr, to
   serve in accordance with and subject to the direction of the
   Board of Directors and the Bylaws of the Foundation until
   death, resignation, retirement, removal or disqualification,
   or until a successor is appointed; and be it further

   RESOLVED, that the initial Apache Whirr PMC be and hereby is
   tasked with the creation of a set of bylaws intended to
   encourage open development and increased participation in the
   Apache Whirr Project; and be it further

   RESOLVED, that the Apache Whirr Project be and hereby
   is tasked with the migration and rationalization of the Apache
   Incubator Whirr podling; and be it further

   RESOLVED, that all responsibilities pertaining to the Apache
   Incubator Whirr podling encumbered upon the Apache Incubator
   Project are hereafter discharged.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Oozie to join the Incubator

2011-07-01 Thread Tom White
+1

Tom

On Wed, Jun 29, 2011 at 8:10 PM, Mohammad Islam  wrote:
> Hi All,
>
> The discussion about Oozie proposal is settling down. Therefore I would like 
> to
> initiate a vote to accept Oozie as an Apache Incubator project.
>
> The latest proposal is pasted at the end and it could be found in the wiki as
> well:
>
> http://wiki.apache.org/incubator/OozieProposal
>
>
> The related discussion thread is at:
> http://www.mail-archive.com/general@incubator.apache.org/msg29633.html
>
>
> Please cast your votes:
>
> [  ] +1 Accept Oozie for incubation
> [  ] +0 Indifferent to Oozie incubation
> [  ] -1 Reject Oozie for incubation
>
> This vote will close 72 hours  from now.
>
> Regards,
> Mohammad
>
>
> Abstract
> Oozie is a server-based workflow scheduling and coordination system to manage
> data processing jobs for Apache HadoopTM.
>
> Proposal
> Oozie is an  extensible, scalable and reliable system to define, manage,
> schedule,  and execute complex Hadoop workloads via web services. More
> specifically, this includes:
>
>        * XML-based declarative framework to specify a job or a complex 
> workflow of
> dependent jobs.
>
>        * Support different types of job such as Hadoop Map-Reduce, Pipe, 
> Streaming,
> Pig, Hive and custom java applications.
>
>        * Workflow scheduling based on frequency and/or data availability.
>        * Monitoring capability, automatic retry and failure handing of jobs.
>        * Extensible and pluggable architecture to allow arbitrary grid 
> programming
> paradigms.
>
>        * Authentication, authorization, and capacity-aware load throttling to 
> allow
> multi-tenant software as a service.
>
> Background
> Most data  processing applications require multiple jobs to achieve their 
> goals,
> with inherent dependencies among the jobs. A dependency could be  sequential,
> where one job can only start after another job has finished.  Or it could be
> conditional, where the execution of a job depends on the  return value or 
> status
> of another job. In other cases, parallel  execution of multiple jobs may be
> permitted – or desired – to exploit  the massive pool of compute nodes 
> provided
> by Hadoop.
>
> These  job dependencies are often expressed as a Directed Acyclic Graph, also
> called a workflow. A node in the workflow is typically a job (a  computation 
> on
> the grid) or another type of action such as an eMail  notification. 
> Computations
> can be expressed in map/reduce, Pig, Hive or  any other programming paradigm
> available on the grid. Edges of the graph  represent transitions from one node
> to the next, as the execution of a  workflow proceeds.
>
> Describing  a workflow in a declarative way has the advantage of decoupling 
> job
> dependencies and execution control from application logic. Furthermore,  the
> workflow is modularized into jobs that can be reused within the same  workflow
> or across different workflows. Execution of the workflow is  then driven by a
> runtime system without understanding the application  logic of the jobs. This
> runtime system specializes in reliable and  predictable execution: It can 
> retry
> actions that have failed or invoke a  cleanup action after termination of the
> workflow; it can monitor  progress, success, or failure of a workflow, and 
> send
> appropriate alerts  to an administrator. The application developer is relieved
> from  implementing these generic procedures.
>
> Furthermore,  some applications or workflows need to run in periodic intervals
> or  when dependent data is available. For example, a workflow could be  
> executed
> every day as soon as output data from the previous 24 instances  of another,
> hourly workflow is available. The workflow coordinator  provides such 
> scheduling
> features, along with prioritization, load  balancing and throttling to 
> optimize
> utilization of resources in the  cluster. This makes it easier to maintain,
> control, and coordinate  complex data applications.
>
> Nearly  three years ago, a team of Yahoo! developers addressed these critical
> requirements for Hadoop-based data processing systems by developing a  new
> workflow management and scheduling system called Oozie. While it was  
> initially
> developed as a Yahoo!-internal project, it was designed and  implemented with
> the intention of open-sourcing. Oozie was released as a GitHub project in 
> early
> 2010. Oozie is used in production within Yahoo and  since it has been
> open-sourced it has been gaining adoption with  external developers
>
> Rationale
> Commonly,  applications that run on Hadoop require multiple Hadoop jobs in 
> order
> to  obtain the desired results. Furthermore, these Hadoop jobs are commonly  a
> combination of Java map-reduce jobs, Streaming map-reduce jobs, Pipes
> map-reduce jobs, Pig jobs, Hive jobs, HDFS operations, Java programs  and 
> shell
> scripts.
>
> Because  of this, developers find themselves writing ad-hoc glue programs to
> combine these Hadoop jobs

[VOTE][RESULT] Accept Bigtop for incubation

2011-06-20 Thread Tom White
The 72 hour voting period has elapsed, and the vote has passed, with
19 +1 votes (12 binding), no -1 votes, and no 0 votes.

Binding votes (IPMC members):

Chris A. Mattmann
Leif Hedstrom
Patrick Hunt
Ralph Goers
Michael McCandless
Robert Burrell Donkin
Nigel Daley
Tom White
Mohammad Nour El-Din
Ian Holsman
Alan Gates
Doug Cutting

Non-binding votes:

Ashish
Eli Collins
Arvind Prabhakar
Eric Sammer
Paul Zimdars
Edward J. Yoon
Eric Charles

Thanks to everyone who voted.

Cheers,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Bigtop for incubation

2011-06-18 Thread Tom White
+1

Tom

On Fri, Jun 17, 2011 at 10:15 AM, Tom White  wrote:
> As there are no active discussions on the proposal thread, I would
> like to initiate a vote to accept Bigtop as an Apache Incubator
> project.
>
> The proposal is available at
>
> http://wiki.apache.org/incubator/BigtopProposal?action=recall&rev=13
>
> I've also put a copy of the proposal at the end of this email.
>
> The discussion thread is available at
>
> http://mail-archives.apache.org/mod_mbox/incubator-general/201106.mbox/%3cbanlktimriyvs5g5maklqvinauz9h6s5...@mail.gmail.com%3E
>
> Please cast your votes:
>
> [  ] +1 Accept Bigtop for incubation
> [  ] +0 Indifferent to Bigtop incubation
> [  ] -1 Reject Bigtop for incubation
>
> This vote will close 72 hours from now.
>
> Thanks,
> Tom
>
> = Bigtop - Apache Hadoop Ecosystem Packaging and Test =
>
> == Abstract ==
>
> Bigtop - a project for the development of packaging and tests of the
> Hadoop ecosystem.
>
> == Proposal ==
>
> The primary goal of Bigtop is to build a community around the
> packaging and interoperability testing of Hadoop-related projects.
> This includes testing at various levels (packaging, platform, runtime,
> upgrade, etc...) developed by a community with a focus on the system
> as a whole, rather than individual projects.
>
> Build, packaging and integration test code that depends upon official
> releases of the Apache Hadoop-related projects (HDFS, MapReduce,
> HBase, Hive, Pig, ZooKeeper, etc...) will be developed and released by
> this project. As bugs and other issues are found we expect these to be
> fixed upstream.
>
> == Background ==
>
> The initial packaging and test code for Bigtop was developed by
> Cloudera to package projects from the Apache Hadoop ecosystem and
> provide a consistent, inter-operable framework.
>
> == Rationale ==
>
> Hadoop defines itself as:
>
> {{{
> The Apache Hadoop project develops open-source software for reliable,
> scalable, distributed computing. Hadoop includes these subprojects:
>
> * Hadoop Common: The common utilities that support the other Hadoop 
> subprojects.
> * HDFS: A distributed file system that provides high throughput access
> to application data.
> * MapReduce: A software framework for distributed processing of large
> data sets on compute clusters.
> }}}
>
> There are also several other Hadoop-related projects at Apache.  Some
> TLP examples include HBase, Hive, Mahout, ZooKeeper, and Pig.  There
> are also several new projects in the Incubator such as HCatalog, Hama
> and Sqoop.
>
> From a packaging and deployment perspective, the current
> loosely-coupled nature of the project has limitations:
>  1. Insufficient building against trunk versions of dependent projects
> (in the style of Apache Gump).
>  1. Insufficient testing against the trunk versions of dependent projects.
>  1. No consistent packaging for the Linux servers which provide the
> main Hadoop datacenter platform.
>  1. No functional testing against multi-machine clusters as part of
> the regular automated build process. This is due to a lack of a
> physical or virtual Hadoop cluster for testing, and not enough test
> suites designed to run against a live cluster with known datasets.
>
> The intent of this project is to build a community where the projects
> are brought together, packaged, and tested for interoperability.
>
> Projects such as Apache Whirr (incubating), which deploy and use a
> collection of Hadoop-related projects, would benefit from the
> interoperability testing done by Bigtop, rather than picking and
> testing project combinations themselves.
>
> == Initial Goals ==
>
> Much of the code for Bigtop has been released by Cloudera under the
> Apache 2.0 license for over two years.
>
> Some current goals include:
>  * create a set of packages for the Hadoop ecosystem, over a wide
> range of platforms
>  * interoperability test these projects
>  * document project sets that are known to work well together
>
> Bigtop’s release artifact would consist of a single tarball of
> packaging and test code that, when built, would produce source and
> binary Linux packages for the upstream projects.
>
> = Current Status =
>
> == Meritocracy ==
>
> Bigtop was originally developed and released as an open source
> packaging infrastructure, CDH, by Cloudera.
>
> == Community ==
>
> The community is primarily the original developers at Cloudera,
> however a number of contributions to the packaging specifications have
> been accepted from outside contributors. Growing a diverse community
> is the main reason to bring Bigtop to the Apache Incubator.
>
> == Core Developers

[VOTE] Accept Bigtop for incubation

2011-06-17 Thread Tom White
ichael Stack, Tom White) and are familiar with
Apache principals and philosophy for community driven software
development.

== Alignment ==

We expect projects in Bigtop to be drawn from Hadoop and related
projects at Apache. Bigtop will complement these projects (Hadoop,
Pig, Hive, HBase, etc...) by providing an environment for contributors
interested in building more complex data processing pipelines to work
together integrating more than a single project into a well-tested
whole.

= Known Risks =

== Orphaned Products ==

The contributors are leading vendors of Hadoop-based technologies and
have a long standing in the Hadoop community. There is minimal risk of
this work becoming non-strategic and the contributors are confident
that a larger community will form within the project in a relatively
short space of time.

== Inexperience with Open Source ==

All code developed for Bigtop has been open sourced under the Apache
2.0 license. Most committers of Bigtop project are intimately familiar
with the Apache model for open-source development and are experienced
with working with new contributors.

== Homogeneous Developers ==

The initial set of committers is from a small set of organizations and
numerous existing Apache projects. We expect that once approved for
incubation, the project will attract new contributors from more
organizations and will thus grow organically.

== Reliance on Salaried Developers ==

It is expected that Bigtop will be developed on salaried and volunteer
time, although all of the initial developers will work on it mainly on
salaried time.

== Relationships with Other Apache Products ==

Bigtop depends upon other Apache Projects including Apache Hadoop,
Apache HBase, Apache Hive, Apache Pig, Apache Zookeeper, Apache
Thrift, Apache Avro, Apache Whirr. The build system uses Apache Ant
and Apache Maven.

== An Excessive Fascination with the Apache Brand ==

We would like Bigtop to become an Apache project to further foster a
healthy community of contributors and consumers around
interoperability, testing and packaging of Hadoop projects. Since
Bigtop directly interacts with many Apache Hadoop-related projects and
solves important problems of many Hadoop users, residing in the the
Apache Software Foundation will increase interaction with the larger
community.

= Documentation =

 * Bigtop will develop its own documentation detailing how to build,
test, install, configure and debug.

= Initial Source =

 * https://github.com/cloudera/bigtop

== Source and Intellectual Property Submission Plan ==

 * The initial source is already licensed under the Apache License, Version 2.0.

https://github.com/cloudera/bigtop

== External Dependencies ==

The required external dependencies are all Apache License or
compatible licenses.

== Cryptography ==

Bigtop doesn't use cryptography itself, however Hadoop projects use
standard APIs and tools for SSH and SSL communication where necessary.

= Required  Resources =

== Mailing lists ==

 * bigtop-private (with moderated subscriptions)
 * bigtop-dev
 * bigtop-commits
 * bigtop-user

== Subversion Directory ==

https://svn.apache.org/repos/asf/incubator/bigtop

== Issue Tracking ==

JIRA BIGTOP (Bigtop)

== Other Resources ==

The existing code already has unit and integration tests so we would
like a Jenkins instance to run them whenever a new patch is submitted.
This can be added after project creation.

To test RPM & deb install/uninstall and upgrade, it is useful to have
a set of Virtual Machine images in known states, and servers that can
bring them up. It should be possible to use Apache Whirr to
choreograph the VM setup/teardown, so these tests could be performed
against VMs on developer desktops or large scale VM-hosting platforms.
For the latter, VM hosting time would be appreciated.

= Initial Committers =

 * Alejandro Abdelnur (tucu at cloudera dot com)
 * Andre Arcilla (arcilla at yahoo-inc dot com)
 * Andrew Bayer (abayer at cloudera dot com)
 * Konstantin Boudnik (cos at apache dot org)
 * Eli Collins (eli at apache dot org)
 * Travis Crawford (travis at twitter dot com)
 * Bruno Mahé (bruno at cloudera dot com)
 * Alan Gates (gates at apache dot org)
 * Patrick Hunt (phunt at apache dot org)
 * Peter Linnell (plinnell at cloudera dot com)
 * Steve Loughran (stevel at apache dot org)
 * Owen O'Malley (omalley at apache dot org)
 * James Page (James.page at canonical dot com)
 * Roman Shaposhnik (rvs at cloudera dot com)
 * John Sichi (jvs at apache dot org)
 * Michael Stack (stack at apache dot org)
 * Tom White (tomwhite at apache dot org)
 * Andrei Savu (asavu at apache dot org)
 * Edward J. Yoon (edwardyoon at apache dot org)

= Affiliations =

 * Alejandro Abdelnur, Cloudera
 * Andre Arcilla, Yahoo! Inc.
 * Andrew Bayer, Cloudera
 * Konstantin Boudnik, free lancer
 * Eli Collins, Cloudera
 * Travis Crawford, Twitter
 * Bruno Mahé, Cloudera
 * Alan Gates, Yahoo!
 * Patrick Hunt, Cloudera
 * Peter Linnell, Cloudera
 * Steve Lo

Re: [PROPOSAL] Bigtop for the Apache Incubator

2011-06-16 Thread Tom White
Alan Gates has volunteered to be a committer and mentor, so I added
him to the proposal. (For some reason his email to this list didn't
come through.)

Thanks Alan!

Tom

On Mon, Jun 13, 2011 at 9:26 PM, Tom White  wrote:
> Hi,
>
> I would like to propose Bigtop to be an Apache Incubator project.
> Bigtop is a project for the development of packaging and tests of the
> Hadoop ecosystem. The goal is to do testing at various levels
> (packaging, platform, runtime, upgrade, etc...) developed by a
> community with a focus on the system as a whole, rather than
> individual projects.
>
> Here's a link to the proposal on the wiki
> http://wiki.apache.org/incubator/BigtopProposal
>
> I've also included the initial contents below.
>
> Cheers,
> Tom
>
> = Bigtop - Apache Hadoop Ecosystem Packaging and Test =
>
> == Abstract ==
>
> Bigtop - a project for the development of packaging and tests of the
> Hadoop ecosystem.
>
> == Proposal ==
>
> The primary goal of Bigtop is to build a community around the
> packaging and interoperability testing of Hadoop-related projects.
> This includes testing at various levels (packaging, platform, runtime,
> upgrade, etc...) developed by a community with a focus on the system
> as a whole, rather than individual projects.
>
> Build, packaging and integration test code that depends upon official
> releases of the Apache Hadoop-related projects (HDFS, MapReduce,
> HBase, Hive, Pig, ZooKeeper, etc...) will be developed and released by
> this project. As bugs and other issues are found we expect these to be
> fixed upstream.
>
> == Background ==
>
> The initial packaging and test code for Bigtop was developed by
> Cloudera to package projects from the Apache Hadoop ecosystem and
> provide a consistent, inter-operable framework.
>
> == Rationale ==
>
> Hadoop defines itself as:
>
> {{{
> The Apache Hadoop project develops open-source software for reliable,
> scalable, distributed computing. Hadoop includes these subprojects:
>
> * Hadoop Common: The common utilities that support the other Hadoop 
> subprojects.
> * HDFS: A distributed file system that provides high throughput access
> to application data.
> * MapReduce: A software framework for distributed processing of large
> data sets on compute clusters.
> }}}
>
> There are also several other Hadoop-related projects at Apache.  Some
> TLP examples include HBase, Hive, Mahout, ZooKeeper, and Pig.  There
> are also several new projects in the Incubator such as HCatalog and
> Sqoop.
>
> There is limited interoperability testing done the projects
> themselves. The intent of this project is to build a community where
> the projects are brought together, packaged, and tested for
> interoperability.
>
> Projects such as Apache Whirr (incubating), which deploy and use a
> collection of Hadoop-related projects, would benefit from the
> interoperability testing done by Bigtop, rather than picking and
> testing project combinations themselves.
>
> == Initial Goals ==
>
> Much of the code for Bigtop has been released by Cloudera under the
> Apache 2.0 license for over two years.
>
> Some current goals include:
>  * create a set of packages for the Hadoop ecosystem, over a wide
> range of platforms
>  * interoperability test these projects
>  * document project sets that are known to work well together
>
> Bigtop’s release artifact would consist of a single tarball of
> packaging and test code that, when built, would produce source and
> binary Linux packages for the upstream projects.
>
> = Current Status =
>
> == Meritocracy ==
>
> Bigtop was originally developed and released as an open source
> packaging infrastructure, CDH, by Cloudera.
>
> == Community ==
>
> The community is primarily the original developers at Cloudera,
> however a number of contributions to the packaging specifications have
> been accepted from outside contributors. Growing a diverse community
> is the main reason to bring Bigtop to the Apache Incubator.
>
> == Core Developers ==
>
> The core developers for Bigtop project are:
>  * Andrew Bayer has extensive expertise with build tools, specifically
> Jenkins continuous integration and Maven.
>  * Peter Linnell has contributed to the RPM packaging.
>  * Bruno Mahé has overseen much of the development of the RPM and
> Debian packaging system.
>  * Roman Shaposhnik designed and implemented the system testing framework.
>
> Many of the committers to the Bigtop project have contributed towards
> Hadoop or related Apache projects (Alejandro Abdelnur, Eli Collins,
> Patrick Hunt, John Sichi, Michael Stack, Tom White) and are familiar
> with Apache principals and 

Re: [PROPOSAL] Bigtop for the Apache Incubator

2011-06-16 Thread Tom White
Owen,

Thanks for offering to help out.

Cheers,
Tom

On Thu, Jun 16, 2011 at 7:13 AM, Owen O'Malley  wrote:
> On Mon, Jun 13, 2011 at 9:26 PM, Tom White  wrote:
>
>> Hi,
>>
>> I would like to propose Bigtop to be an Apache Incubator project.
>> Bigtop is a project for the development of packaging and tests of the
>> Hadoop ecosystem. The goal is to do testing at various levels
>> (packaging, platform, runtime, upgrade, etc...) developed by a
>> community with a focus on the system as a whole, rather than
>> individual projects.
>>
>
> I'd like to join as a committer and mentor.
>
> -- Owen
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Bigtop for the Apache Incubator

2011-06-15 Thread Tom White
Edward, Andrei - thanks for volunteering to help out as initial
committers. Please add your names to the proposal on the wiki.

Cheers,
Tom

On Wed, Jun 15, 2011 at 6:15 AM, Andrei Savu  wrote:
> I'm happy to see this project joining the Foundation. As an Apache
> Whirr committer I feel a lot of pain when it comes to testing
> compatibility for multiple services across multiple operating systems
> (and multiple cloud providers).
>
> I would love to join the Bigtop team as one of the initial committers
> an do some of testing work we are already doing in Whirr.
>
> PS: having ready to use .rpm and .deb packages sounds really great!
>
> Cheers,
>
> -- Andrei Savu / andreisavu.ro
>
> On Tue, Jun 14, 2011 at 7:26 AM, Tom White  wrote:
>> Hi,
>>
>> I would like to propose Bigtop to be an Apache Incubator project.
>> Bigtop is a project for the development of packaging and tests of the
>> Hadoop ecosystem. The goal is to do testing at various levels
>> (packaging, platform, runtime, upgrade, etc...) developed by a
>> community with a focus on the system as a whole, rather than
>> individual projects.
>>
>> Here's a link to the proposal on the wiki
>> http://wiki.apache.org/incubator/BigtopProposal
>>
>> I've also included the initial contents below.
>>
>> Cheers,
>> Tom
>>
>> = Bigtop - Apache Hadoop Ecosystem Packaging and Test =
>>
>> == Abstract ==
>>
>> Bigtop - a project for the development of packaging and tests of the
>> Hadoop ecosystem.
>>
>> == Proposal ==
>>
>> The primary goal of Bigtop is to build a community around the
>> packaging and interoperability testing of Hadoop-related projects.
>> This includes testing at various levels (packaging, platform, runtime,
>> upgrade, etc...) developed by a community with a focus on the system
>> as a whole, rather than individual projects.
>>
>> Build, packaging and integration test code that depends upon official
>> releases of the Apache Hadoop-related projects (HDFS, MapReduce,
>> HBase, Hive, Pig, ZooKeeper, etc...) will be developed and released by
>> this project. As bugs and other issues are found we expect these to be
>> fixed upstream.
>>
>> == Background ==
>>
>> The initial packaging and test code for Bigtop was developed by
>> Cloudera to package projects from the Apache Hadoop ecosystem and
>> provide a consistent, inter-operable framework.
>>
>> == Rationale ==
>>
>> Hadoop defines itself as:
>>
>> {{{
>> The Apache Hadoop project develops open-source software for reliable,
>> scalable, distributed computing. Hadoop includes these subprojects:
>>
>> * Hadoop Common: The common utilities that support the other Hadoop 
>> subprojects.
>> * HDFS: A distributed file system that provides high throughput access
>> to application data.
>> * MapReduce: A software framework for distributed processing of large
>> data sets on compute clusters.
>> }}}
>>
>> There are also several other Hadoop-related projects at Apache.  Some
>> TLP examples include HBase, Hive, Mahout, ZooKeeper, and Pig.  There
>> are also several new projects in the Incubator such as HCatalog and
>> Sqoop.
>>
>> There is limited interoperability testing done the projects
>> themselves. The intent of this project is to build a community where
>> the projects are brought together, packaged, and tested for
>> interoperability.
>>
>> Projects such as Apache Whirr (incubating), which deploy and use a
>> collection of Hadoop-related projects, would benefit from the
>> interoperability testing done by Bigtop, rather than picking and
>> testing project combinations themselves.
>>
>> == Initial Goals ==
>>
>> Much of the code for Bigtop has been released by Cloudera under the
>> Apache 2.0 license for over two years.
>>
>> Some current goals include:
>>  * create a set of packages for the Hadoop ecosystem, over a wide
>> range of platforms
>>  * interoperability test these projects
>>  * document project sets that are known to work well together
>>
>> Bigtop’s release artifact would consist of a single tarball of
>> packaging and test code that, when built, would produce source and
>> binary Linux packages for the upstream projects.
>>
>> = Current Status =
>>
>> == Meritocracy ==
>>
>> Bigtop was originally developed and released as an open source
>> packaging infrastructure, CDH, by Cloudera.
>>
>> == Community ==
>>
>> The community is primar

Re: [PROPOSAL] Bigtop for the Apache Incubator

2011-06-14 Thread Tom White
Steve,

Thanks for signing up.

Bigtop should not maintain branches or patchsets on top of upstream
projects. Any bugs or compatibility problems that are found will need
to be resolved in the upstream projects and then released in those
projects. This makes things harder for Bigtop, but, like you, I hope
having committers from the upstream projects involved in Bigtop will
give projects further impetus to make releases in a timely manner.

Cheers,
Tom

On Tue, Jun 14, 2011 at 4:51 AM, Steve Loughran  wrote:
> I've added more on the limitations of the current process (not synchronised
> releases, not enough automated testing on multiple-host clusters), and on a
> risk of the project: the upstream projects need to care about and work on
> more synchronized releases.
>
> Cloudera managed to address that problem by patching their forks of the
> apache projects, then push back the patches by way of JIRA patches. Bigtop
> should not get into the game of having its own branches of any of the ASF
> projects, instead it should focus on getting its changes into trunk of the
> various projects SVN trees, and on having near-synchronized releases of the
> various projects
>
> That requires
>  -engagement from the committer teams to get patches in. Having committers
> from the relevant projects in the BigTop project would address this.
>  -support from the PMCs to do releases on a regular schedule. That's hard,
> but something like a six-monthly schedule for the big projects with the up
> and coming projects releasing in a proper factor of that timescale (3, 2, 1
> month), could help.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[PROPOSAL] Bigtop for the Apache Incubator

2011-06-13 Thread Tom White
Hi,

I would like to propose Bigtop to be an Apache Incubator project.
Bigtop is a project for the development of packaging and tests of the
Hadoop ecosystem. The goal is to do testing at various levels
(packaging, platform, runtime, upgrade, etc...) developed by a
community with a focus on the system as a whole, rather than
individual projects.

Here's a link to the proposal on the wiki
http://wiki.apache.org/incubator/BigtopProposal

I've also included the initial contents below.

Cheers,
Tom

= Bigtop - Apache Hadoop Ecosystem Packaging and Test =

== Abstract ==

Bigtop - a project for the development of packaging and tests of the
Hadoop ecosystem.

== Proposal ==

The primary goal of Bigtop is to build a community around the
packaging and interoperability testing of Hadoop-related projects.
This includes testing at various levels (packaging, platform, runtime,
upgrade, etc...) developed by a community with a focus on the system
as a whole, rather than individual projects.

Build, packaging and integration test code that depends upon official
releases of the Apache Hadoop-related projects (HDFS, MapReduce,
HBase, Hive, Pig, ZooKeeper, etc...) will be developed and released by
this project. As bugs and other issues are found we expect these to be
fixed upstream.

== Background ==

The initial packaging and test code for Bigtop was developed by
Cloudera to package projects from the Apache Hadoop ecosystem and
provide a consistent, inter-operable framework.

== Rationale ==

Hadoop defines itself as:

{{{
The Apache Hadoop project develops open-source software for reliable,
scalable, distributed computing. Hadoop includes these subprojects:

* Hadoop Common: The common utilities that support the other Hadoop subprojects.
* HDFS: A distributed file system that provides high throughput access
to application data.
* MapReduce: A software framework for distributed processing of large
data sets on compute clusters.
}}}

There are also several other Hadoop-related projects at Apache.  Some
TLP examples include HBase, Hive, Mahout, ZooKeeper, and Pig.  There
are also several new projects in the Incubator such as HCatalog and
Sqoop.

There is limited interoperability testing done the projects
themselves. The intent of this project is to build a community where
the projects are brought together, packaged, and tested for
interoperability.

Projects such as Apache Whirr (incubating), which deploy and use a
collection of Hadoop-related projects, would benefit from the
interoperability testing done by Bigtop, rather than picking and
testing project combinations themselves.

== Initial Goals ==

Much of the code for Bigtop has been released by Cloudera under the
Apache 2.0 license for over two years.

Some current goals include:
 * create a set of packages for the Hadoop ecosystem, over a wide
range of platforms
 * interoperability test these projects
 * document project sets that are known to work well together

Bigtop’s release artifact would consist of a single tarball of
packaging and test code that, when built, would produce source and
binary Linux packages for the upstream projects.

= Current Status =

== Meritocracy ==

Bigtop was originally developed and released as an open source
packaging infrastructure, CDH, by Cloudera.

== Community ==

The community is primarily the original developers at Cloudera,
however a number of contributions to the packaging specifications have
been accepted from outside contributors. Growing a diverse community
is the main reason to bring Bigtop to the Apache Incubator.

== Core Developers ==

The core developers for Bigtop project are:
 * Andrew Bayer has extensive expertise with build tools, specifically
Jenkins continuous integration and Maven.
 * Peter Linnell has contributed to the RPM packaging.
 * Bruno Mahé has overseen much of the development of the RPM and
Debian packaging system.
 * Roman Shaposhnik designed and implemented the system testing framework.

Many of the committers to the Bigtop project have contributed towards
Hadoop or related Apache projects (Alejandro Abdelnur, Eli Collins,
Patrick Hunt, John Sichi, Michael Stack, Tom White) and are familiar
with Apache principals and philosophy for community driven software
development.

== Alignment ==

We expect projects in Bigtop to be drawn from Hadoop and related
projects at Apache. Bigtop will complement these projects (Hadoop,
Pig, Hive, HBase, etc...) by providing an environment for contributors
interested in building more complex data processing pipelines to work
together integrating more than a single project into a well tested
whole.

= Known Risks =

== Orphaned Products ==

The contributors are leading vendors of Hadoop-based technologies and
have a long standing in the Hadoop community. There is minimal risk of
this work becoming non-strategic and the contributors are confident
that a larger community will form within the project in a relatively
short space of time.

== Inexperience with O

Re: [VOTE] [RESULT] Accept Sqoop for Incubation

2011-06-11 Thread Tom White
Thanks Arvind. I'll start going through the steps in
http://incubator.apache.org/guides/mentor.html#bootstrap over the
coming days.

Cheers,
Tom

On Sat, Jun 11, 2011 at 2:43 PM, arv...@cloudera.com
 wrote:
> With 19 +1 votes (11 binding), no -1 votes, and no 0 votes, the vote passes.
>
> Binding votes
>
>  Chris Mattmann
>  Sanjiva Weerawarana
>  Ralph Goers
>  Julien Vermillard
>  Mark Struberg
>  Tommaso Teofili
>  Leo Simons
>  Christian Grobmeier
>  Niall Pemberton
>  Patrick Hunt
>  Tom White
>
> Non-binding votes
>
>  Ioannis Canellos
>  Nigel Daley
>  Edward J. Yoon
>  Olivier Lamy
>  Steve Loughran
>  Phillip Rhodes
>  Eric Sammer
>  Michael McCandless
>
>
> The binding votes were counted based on the Incubator PMC membership
> list located at:
> http://people.apache.org/committers-by-project.html#incubator-pmc
>
> Thanks everyone who voted.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Flume to join the Incubator.

2011-06-08 Thread Tom White
ct with several other Apache
> projects including Apache HBase and Apache Cassandra.
>
> Flume's functionality has some indirect or direct overlap with the
> functionality of Apache Chukwa but has several significant architectural
> diffferences.  Both systems can be used to collect log data to write to
> hdfs.  However, Chukwa's primary goals are the analytic and monitoring
> aspects of a Hadoop cluster.  Instead of focusing on analytics, Flume
> focuses primarily upon data transport and integration with a wide set of
> data sources and data destinations.   Architecturally, Chukwa components are
> individually and statically configured.  It also depends upon Hadoop
> MapReduce for its core functionality.  In contrast, Flume's components are
> dynamically and centrally configured and does not depend directly upon
> Hadoop MapReduce.  Furthermore, Flume provides a more general model for
> handling data and enables integration with projects such as Apache Hive,
> data stores such as Apache HBase, Apache Cassandra and Voldemort, and
> several Apache Lucene-related projects.
>
> == An Excessive Fascination with the Apache Brand ==
>
> We would like Flume to become an Apache project to further foster a healthy
> community of contributors and consumers around the project.  Since Flume
> directly interacts with many Apache Hadoop-related projects by solves an
> important problem of many Hadoop users, residing in the Apache Software
> Foundation will increase interaction with the larger community.
>
> = Documentation =
>
>  * All Flume documentation (User Guide, Developer Guide, Cookbook, and
> Windows Guide) is maintained within Flume sources and can be built directly.
>  * Cloudera provides documentation specific to its distribution of Flume at:
> http://archive.cloudera.com/cdh/3/flume/
>  * Flume wiki at GitHub: https://github.com/cloudera/flume/wiki
>  * Flume jira at Cloudera: https://issues.cloudera.org/browse/flume
>
> = Initial Source =
>
>  * https://github.com/cloudera/flume/tree/
>
> == Source and Intellectual Property Submission Plan ==
>
>  * The initial source is already licensed under the Apache License, Version
> 2.0. https://github.com/cloudera/flume/blob/master/LICENSE
>
> == External Dependencies ==
>
> The required external dependencies are all Apache License or compatible
> licenses. Following components with non-Apache licenses are enumerated:
>
>  * org.arabidopsis.ahocorasick : BSD-style
>
> Non-Apache build tools that are used by Flume are as follows:
>
>  * AsciiDoc: GNU GPLv2
>  * FindBugs: GNU LGPL
>  * Cobertura: GNU GPLv2
>  * PMD : BSD-style
>
> == Cryptography ==
>
> Flume uses standard APIs and tools for SSH and SSL communication where
> necessary.
>
> = Required  Resources =
>
> == Mailing lists ==
>
>  * flume-private (with moderated subscriptions)
>  * flume-dev
>  * flume-commits
>  * flume-user
>
> == Subversion Directory ==
>
> https://svn.apache.org/repos/asf/incubator/flume
>
> == Issue Tracking ==
>
> JIRA Flume (FLUME)
>
> == Other Resources ==
>
> The existing code already has unit and integration tests so we would like a
> Jenkins instance to run them whenever a new patch is submitted. This can be
> added after project creation.
>
> = Initial Committers =
>
>  * Andrew Bayer (abayer at cloudera dot com)
>  * Jonathan Hsieh (jon at cloudera dot com)
>  * Patrick Hunt (phunt at cloudera dot com)
>  * Aaron Kimball (akimball83 at gmail dot com)
>  * Bruce Mitchener (bruce.mitchener at gmail dot com)
>  * Arvind Prabhakar (arvind at cloudera dot com)
>  * Ahmed Radwan (ahmed at cloudera dot com)
>  * Henry Robinson (henry at cloudera dot com)
>  * Eric Sammer (esammer at cloudera dot com)
>  * Derek Deeter (ddeeterctrb at gmail dot com)
>
> = Affiliations =
>
>  * Andrew Bayer, Cloudera
>  * Jonathan Hsieh, Cloudera
>  * Patrick Hunt, Cloudera
>  * Aaron Kimball, Odiago
>  * Bruce Mitchener, Independent
>  * Arvind Prabhakar, Cloudera
>  * Ahmed Radwan, Cloudera
>  * Henry Robinson, Cloudera
>  * Eric Sammer, Cloudera
>  * Derek Deeter, Intuit
>
>
> = Sponsors =
>
> == Champion ==
>
>  * Nigel Daley
>
> == Nominated Mentors ==
>
>  * Tom White
>  * Nigel Daley
>  * Ralph Goers
>  * Patrick Hunt
>
> == Sponsoring Entity ==
>
>  * Apache Incubator PMC
>
>
> --
> // Jonathan Hsieh (shay)
> // Software Engineer, Cloudera
> // j...@cloudera.com
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Sqoop for Incubation

2011-06-08 Thread Tom White
+1

Tom

On Tue, Jun 7, 2011 at 8:39 PM, arv...@cloudera.com  wrote:
> As there are no active discussions on the [PROPOSAL] thread for a few
> days now, I will like to initiate the vote to accept Sqoop as an
> Apache Incubator project. The proposal discussion thread and full text
> of the proposal can be found at the following locations:
>
> Discussion Thread:
> http://www.mail-archive.com/general@incubator.apache.org/msg27726.html
> Proposal: http://wiki.apache.org/incubator/SqoopProposal
>
> Please cast your votes:
>
> [  ] +1 Accept Sqoop for incubation
> [  ] +0 Indifferent to Sqoop incubation
> [  ]  -1 Reject Sqoop for incubation
>
> This vote will close 72 hours from now.
>
> Thanks and Regards,
> Arvind Prabhakar
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] [RESULT] Release Whirr version 0.5.0-incubating (candidate 1)

2011-06-03 Thread Tom White
This vote has now passed with 3 binding +1s (2 from the dev list
vote), no +0's, and no -1's.

The binding votes were:

Stefan Seelmann, +1
Doug Cutting, +1 (on dev vote thread)
Tom White, +1 (on dev vote thread)

The non-binding votes (including those from the dev list vote) were as follows:

Andrei Savu, +1
Patrick Hunt, +0
Lars George, +1
Adrian Cole, +1
Johan Oskarsson, +1
Tibor Kiss, +1

I'll push the artifacts out.

Thanks to everyone who voted!

Cheers,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.5.0-incubating (candidate 1)

2011-06-03 Thread Tom White
Thanks for taking a look, Stefan.

On Thu, Jun 2, 2011 at 9:18 AM, Stefan Seelmann  wrote:
> +1 (binding)
>
> Signature and checksums of source and binary tarball are ok.
> DISCLAIMER file is there.
> NOTICE and LICENSE files look ok.
> Build from sources worked under Linux (tests failed under MacOSX)
>
> Two notes for future releases:
> - The tarballs are signed with a 1024Bit DSA key. According to
> http://apache.org/dev/release-signing.html a 4096 bits key and SHA256
> should be used.

Thanks for the pointer - I'll update my key for next time.

> - Both, source and binary tarballs contain two log files in the cli folder.

This has already been fixed in trunk so it doesn't happen for the next release.

Cheers,
Tom

>
> Kind Regards,
> Stefan
>
>
> On Tue, May 31, 2011 at 7:23 AM, Tom White  wrote:
>> Please vote on the following release candidate for Apache Whirr,
>> version 0.5.0-incubating. We already received two binding IPMC +1
>> votes for the PPMC release vote on whirr-dev.
>>
>> PPMC release vote thread:
>> http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201105.mbox/%3cbanlktimd_d7ar611s46w6_bh6anuy78...@mail.gmail.com%3E
>>
>> It fixes the following issues:
>> https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12316248&styleName=Html&projectId=1230
>>
>> Note that we are voting upon the source (tag), binaries are provided
>> for convenience.
>>
>> Source and binary files:
>> http://people.apache.org/~tomwhite/whirr-0.5.0-incubating-candidate-1
>>
>> Maven staging repo:
>> https://repository.apache.org/content/repositories/orgapachewhirr-012
>>
>> The tag to be voted upon:
>> http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.5.0-incubating
>>
>> Whirr's KEYS file containing PGP keys we use to sign the release:
>> http://svn.apache.org/repos/asf/incubator/whirr/dist/KEYS
>>
>> The vote is open for 72 hours.
>>
>> [ ] +1
>> [ ] +0
>> [ ] -1
>>
>> Thanks,
>> Tom
>>
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Release Whirr version 0.5.0-incubating (candidate 1)

2011-05-31 Thread Tom White
Please vote on the following release candidate for Apache Whirr,
version 0.5.0-incubating. We already received two binding IPMC +1
votes for the PPMC release vote on whirr-dev.

PPMC release vote thread:
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201105.mbox/%3cbanlktimd_d7ar611s46w6_bh6anuy78...@mail.gmail.com%3E

It fixes the following issues:
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12316248&styleName=Html&projectId=1230

Note that we are voting upon the source (tag), binaries are provided
for convenience.

Source and binary files:
http://people.apache.org/~tomwhite/whirr-0.5.0-incubating-candidate-1

Maven staging repo:
https://repository.apache.org/content/repositories/orgapachewhirr-012

The tag to be voted upon:
http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.5.0-incubating

Whirr's KEYS file containing PGP keys we use to sign the release:
http://svn.apache.org/repos/asf/incubator/whirr/dist/KEYS

The vote is open for 72 hours.

[ ] +1
[ ] +0
[ ] -1

Thanks,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.4.0-incubating

2011-03-21 Thread Tom White
On Fri, Mar 18, 2011 at 6:28 PM, sebb  wrote:
> On 18 March 2011 22:02, Tom White  wrote:
>> On Fri, Mar 18, 2011 at 10:30 AM, sebb  wrote:
>>> On 18 March 2011 16:43, Stefan Bodewig  wrote:
>>>> On 2011-03-18, sebb wrote:
>>>>
>>>>> But the main issue is that the binary distribution contains lots of
>>>>> 3rd party products which are not mentioned in either the NOTICE file
>>>>> or the LICENSE file.
>>>>
>>>> They likely are supposed to be in the - unfortunately empty - license
>>>> files inside the lib directory.
>>>>
>>>>> Whether it requires attribution or not, 3rd party product licenses
>>>>> must be recorded in the LICENSE file.
>>
>> Right, we definitely need to fix this, per
>> http://incubator.apache.org/guides/releasemanagement.html#best-practice-license.
>>
>>>>
>>>>> The standard method is to include the text in the file, but it may be
>>>>> allowable to just include a pointer to the license elsewhere in the
>>>>> distribution.
>>>>
>>>> This pointer is missing, you are correct.
>>>>
>>>>> I think these issues are sufficient to block the release.
>>>>
>>>> Of the binary "convenience build".  If the whirr project wanted to
>>>> release the source tarball alone, the problems you have found wouldn't
>>>> apply.  The source tarball looks good to me.
>>>
>>> The NOTICE file includes attributions for two products that are not present.
>>> AIUI it's important that NOTICE only contains *required* attributions
>>> because the NOTICEs have to passed on to downstream users.
>>
>> Both products are present (jsr250-api-1.0.jar and jersey-core-1.4.jar)
>
> Sorry, it was not clear - I was referring to the source tarball, which
> does not contain any 3rd party libraries.
>
>> and these are the only ones that contain required attributions (as far
>> as I can tell). So I believe the NOTICE file is correct.
>
> It may be for the binary release, but if it is decided to release
> source only, it is not correct.

Whirr can't be used without these libraries, so it seems relevant to
include references to them in NOTICE. Or are you suggesting a
different NOTICE file for each distribution?

I've made all the other changes that you and Stefan suggested for the
next release candidate.

Thanks,
Tom

>
>>>
>>> The  divider lines should be removed (not a blocker) and
>>> the year should be updated.
>>>
>>> Also, the lib directory is full of licence files for products that are
>>> not present.
>>> It's not necessary for everything in SVN to be in the source archive,
>>> though everything in the source archive must be in SVN (or be
>>> derivable directly from it)
>>> This is confusing.
>>
>> It looks like the old LICENSE files in lib were mistakenly not removed
>> when the JAR versions were updated (e.g. guava-r06-LICENSE.txt ->
>> guava-r08-LICENSE.txt) or removed since the last release. I agree this
>> is confusing and they should be removed.
>>
>> Thanks for taking the time to check the release candidate.
>>
>> Cheers,
>> Tom
>>
>>>
>>>> Stefan
>>>>
>>>> -
>>>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>>>> For additional commands, e-mail: general-h...@incubator.apache.org
>>>>
>>>>
>>>
>>> -
>>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>>> For additional commands, e-mail: general-h...@incubator.apache.org
>>>
>>>
>>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.4.0-incubating

2011-03-18 Thread Tom White
On Fri, Mar 18, 2011 at 10:30 AM, sebb  wrote:
> On 18 March 2011 16:43, Stefan Bodewig  wrote:
>> On 2011-03-18, sebb wrote:
>>
>>> But the main issue is that the binary distribution contains lots of
>>> 3rd party products which are not mentioned in either the NOTICE file
>>> or the LICENSE file.
>>
>> They likely are supposed to be in the - unfortunately empty - license
>> files inside the lib directory.
>>
>>> Whether it requires attribution or not, 3rd party product licenses
>>> must be recorded in the LICENSE file.

Right, we definitely need to fix this, per
http://incubator.apache.org/guides/releasemanagement.html#best-practice-license.

>>
>>> The standard method is to include the text in the file, but it may be
>>> allowable to just include a pointer to the license elsewhere in the
>>> distribution.
>>
>> This pointer is missing, you are correct.
>>
>>> I think these issues are sufficient to block the release.
>>
>> Of the binary "convenience build".  If the whirr project wanted to
>> release the source tarball alone, the problems you have found wouldn't
>> apply.  The source tarball looks good to me.
>
> The NOTICE file includes attributions for two products that are not present.
> AIUI it's important that NOTICE only contains *required* attributions
> because the NOTICEs have to passed on to downstream users.

Both products are present (jsr250-api-1.0.jar and jersey-core-1.4.jar)
and these are the only ones that contain required attributions (as far
as I can tell). So I believe the NOTICE file is correct.

>
> The  divider lines should be removed (not a blocker) and
> the year should be updated.
>
> Also, the lib directory is full of licence files for products that are
> not present.
> It's not necessary for everything in SVN to be in the source archive,
> though everything in the source archive must be in SVN (or be
> derivable directly from it)
> This is confusing.

It looks like the old LICENSE files in lib were mistakenly not removed
when the JAR versions were updated (e.g. guava-r06-LICENSE.txt ->
guava-r08-LICENSE.txt) or removed since the last release. I agree this
is confusing and they should be removed.

Thanks for taking the time to check the release candidate.

Cheers,
Tom

>
>> Stefan
>>
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.3.0-incubating

2011-01-29 Thread Tom White
With 3 +1's (binding votes from Doug Cutting and Tom White were made
on the whirr-dev vote thread) and no -1's the vote passes. I'll roll
out the release.

Thanks,
Tom

On Thu, Jan 27, 2011 at 1:04 PM, Steve Loughran  wrote:
> On 26/01/2011 05:30, Tom White wrote:
>>
>> This is the third incubator release for Apache Whirr, version
>> 0.3.0-incubating. We already received one binding IPMC +1 vote for the
>> PPMC release vote on whirr-dev, so are looking for two more.
>
>> The vote is open for 72 hours.
>>
>
> +1
>
> Ship it! Steve
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Release Whirr version 0.3.0-incubating

2011-01-25 Thread Tom White
This is the third incubator release for Apache Whirr, version
0.3.0-incubating. We already received one binding IPMC +1 vote for the
PPMC release vote on whirr-dev, so are looking for two more.

PPMC release vote thread:
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201101.mbox/%3caanlktikdg83pbi+71pj0zcxs7pe_rs4musfavcwnx...@mail.gmail.com%3E

The issues fixed for 0.3.0-incubating
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12315487&styleName=Html&projectId=1230

Source and binary files:
http://people.apache.org/~tomwhite/whirr-0.3.0-incubating-candidate-0

Maven staging repo:
https://repository.apache.org/content/repositories/orgapachewhirr-065

The tag to be voted upon:
http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.3.0-incubating

Whirr's KEYS file containing PGP keys we use to sign the release:
http://svn.apache.org/repos/asf/incubator/whirr/dist/KEYS

The vote is open for 72 hours.

[ ] +1
[ ] +0
[ ] -1

Thanks,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Mesos to enter the incubator

2010-12-21 Thread Tom White
m design and current features:
> http://mesos.berkeley.edu/mesos_tech_report.pdf (paper to appear at NSDI
> 2011 conference)
>
>
>
> = Initial Source =
>
> Mesos has been under development since spring 2009 by a team of graduate
> students and researchers. It is currently hosted on GitHub under a BSD
> license at http://github.com/mesos/mesos.
>
>
>
> = External Dependencies =
>
> The dependencies all have Apache compatible licenses, including BSD, MIT,
> Boost, and Apache 2.0.
>
>
>
> = Cryptography =
>
> Not applicable.
>
>
>
> = Required Resources =
>
> == Mailing Lists ==
>
>  * mesos-private for private PMC discussions (with moderated subscriptions)
>  * mesos-dev
>  * mesos-commits
>  * mesos-user
>
>
>
> == Subversion Directory ==
>
> https://svn.apache.org/repos/asf/incubator/mesos
>
>
>
> == Issue Tracking ==
>
> JIRA Mesos (MESOS)
>
>
>
> == Other Resources ==
>
> The existing code already has unit tests, so we would like a Hudson instance
> to run them whenever a new patch is submitted. This can be added after
> project
> creation.
>
>
>
> = Initial Committers =
>
>  * Ali Ghodsi (ali at sics dot se)
>  * Benjamin Hindman (benh at eecs dot berkeley dot edu)
>  * Andy Konwinski (andyk at eecs dot berkeley dot edu)
>  * Matei Zaharia (matei at apache dot org)
>
> A CLA is already on file for Matei Zaharia.
>
>
> = Affiliations =
>
>  * Ali Ghodsi (UC Berkeley / Swedish Institute of Computer Science)
>  * Benjamin Hindman (UC Berkeley)
>  * Andy Konwinski (UC Berkeley)
>  * Matei Zaharia (UC Berkeley)
>
>
>
> = Sponsors =
>
> == Champion ==
>
> Tom White
>
> == Nominated Mentors ==
>
>  * Dhruba Borthakur
>  * Brian McCallister
>  * Tom White
>
> == Sponsoring Entity ==
>
> Incubator PMC
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Mesos Project

2010-12-16 Thread Tom White
Thanks David. I think we can run a separate vote thread once this
proposal thread has run its course. Other recent proposal threads seem
to have general votes of support on them. As you say, this phase is a
time to respond to any questions that are raised, attract more
mentors, and so on.

Cheers,
Tom

On Wed, Dec 15, 2010 at 4:12 PM, David Crossley  wrote:
> David Crossley wrote:
>> The Proposal phase is for discussing the proposal,
>> encouraging people, attracting mentors, ensuring that
>> the proposal is ready.
>>
>> Then the Vote phase is for voting.
>
> Dunno what to do now that it has started :-)
> They gather steam, and we don't want to re-do it.
>
> Perhaps add "Vote" to the subject line and declare
> the end time, then tally everything at the end.
>
> -David
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Mesos Project

2010-12-15 Thread Tom White
sed in the Alignment section.
>
>
>
> = Documentation =
>
> Information about Mesos can be found at http://mesos.berkeley.edu.
> The following sources may be useful to start with:
>
>  * Documentation for GitHub release: http://github.com/mesos/mesos/wiki
>  * Presentation at Hadoop User Group: 
> http://www.cs.berkeley.edu/~matei/talks/2010/hug_mesos.pdf
>  * Tech report on system design and current features: 
> http://mesos.berkeley.edu/mesos_tech_report.pdf (paper to appear at NSDI 2011 
> conference)
>
>
>
> = Initial Source =
>
> Mesos has been under development since spring 2009 by a team of graduate
> students and researchers. It is currently hosted on GitHub under a BSD
> license at http://github.com/mesos/mesos.
>
>
>
> = External Dependencies =
>
> The dependencies all have Apache compatible licenses, including BSD, MIT,
> Boost, and Apache 2.0.
>
>
>
> = Cryptography =
>
> Not applicable.
>
>
>
> = Required Resources =
>
> == Mailing Lists ==
>
>  * mesos-private for private PMC discussions (with moderated subscriptions)
>  * mesos-dev
>  * mesos-commits
>  * mesos-user
>
>
>
> == Subversion Directory ==
>
> https://svn.apache.org/repos/asf/incubator/mesos
>
>
>
> == Issue Tracking ==
>
> JIRA Mesos (MESOS)
>
>
>
> == Other Resources ==
>
> The existing code already has unit tests, so we would like a Hudson instance
> to run them whenever a new patch is submitted. This can be added after project
> creation.
>
>
>
> = Initial Committers =
>
>  * Ali Ghodsi (ali at sics dot se)
>  * Benjamin Hindman (benh at eecs dot berkeley dot edu)
>  * Andy Konwinski (andyk at eecs dot berkeley dot edu)
>  * Matei Zaharia (matei at apache dot org)
>
> A CLA is already on file for Matei Zaharia.
>
>
> = Affiliations =
>
>  * Ali Ghodsi (UC Berkeley / Swedish Institute of Computer Science)
>  * Benjamin Hindman (UC Berkeley)
>  * Andy Konwinski (UC Berkeley)
>  * Matei Zaharia (UC Berkeley)
>
>
>
> = Sponsors =
>
> == Champion ==
>
> Tom White
>
> == Nominated Mentors ==
>
>  * Dhruba Borthakur
>  * Brian McCallister
>  * Tom White
>
> == Sponsoring Entity ==
>
> Incubator PMC
>
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Gora into the Apache Incubator

2010-09-22 Thread Tom White
 abstraction layer was orthogonal to the web crawler, and we
> decided to extract it to a separate project with much wider goals. Thus
> Gora, as a project, was born. The initial code is developed by Enis and
> Dogacan with ant.com’s sponsorship.
>
> The code can be found at http://github.com/enis/gora/.
>
> External Dependencies
> External dependencies excluding Apache projects are as follows
> *
> * JDOM - http://jdom.org/ -  Apache-style license
> *
> * SQL Builder - http://openhms.sourceforge.net/sqlbuilder/ - Artistic
> License, LGPL. SQL Builder is intended to be removed from the source due to
> technical reasons anyway.
> *
> * HSQLDB - http://hsqldb.org/ - BSD-style license
> *
> * JUnit - http://junit.org - Common Public License 1.0
> *
> * SLF4J - http://www.slf4j.org/ - MIT License
> *
> * Google Guava Libraries - http://code.google.com/p/guava-libraries/ -
> Apache License 2.0
>
> Required Resources
> Mailing Lists
> * gora-private (with moderated subscriptions)
> * gora-dev
> * gora-commits
> Subversion Directory
> * http://svn.apache.org/repos/asf/incubator/gora
>
> Issue Tracking
> * JIRA (GORA)
> Other Resources
> We need a wiki at http://wiki.apache.org. Currently, we have a wiki at
> Github, Since there is not a lot of pages there, we can manually move the
> pages to the wiki at wiki.apache.org.
>
> Initial Committers
> *    Name         email                  Affiliation   Timezone
> *    Enis Söztutar      enis [at] apache.org         Konneka             +3
> *    Doğacan Güney      dogacan [at] apache.org      Konneka             +3
> *    Sertan Alkan       sertanalkan [at] gmail.com   Konneka             +3
> *    Julien Nioche      jnioche [at] apache.org      DigitalPebble
> <http://wiki.apache.org/incubator/DigitalPebble>        +1
> *    Andrzej Bialecki   ab [at] apache.org           Sigram
> *    Andrew Hart        ahart [at] apache.org        NASA JPL            -8
> *    Dave Woollard      woollard [at] apache.org     NASA JPL            -8
> *    Henry Saputra      hsaputra [at] apache.org     Yahoo!              -8
>
>  Affiliations
>  All of the parties are affiliated with companies and organizations that are
>  familiar with the development of open source . Most of the original Gora
>  development was sponsored by ant.com, however we expect that the amount of
>  volunteer work will increase, and more developers will come on board.
>
>  Sponsors
>  Champion
>  * Chris Mattmann (mattmann AT apache DOT org)
>  Nominated Mentors
>  * Chris Mattmann (mattmann AT apache DOT org)
>  * Andrzej Bialecki (ab AT apache DOT org )
>  * Tom White (tomwhite AT apache DOT org)
>  Sponsoring Entity
>  Apache Incubator. Successful graduation can result in either being a TLP,
> or a subproject of  Hadoop, since most of the community is projected to
> overlap.
>
> ++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.mattm...@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Whirr version 0.1.0-incubating

2010-09-18 Thread Tom White
With 3 binding +1 IPMC votes (Ant Elder, Doug Cutting, and Tom White)
and no 0 or -1 votes, the vote passes. I'll proceed with the final
steps of the release.

Thanks to everyone who helped with the release!

Tom

BTW I've opened https://issues.apache.org/jira/browse/WHIRR-91 to
address the improvements that Ant raised for the next release.

On Tue, Sep 14, 2010 at 11:19 AM, Tom White  wrote:
> This is the first incubator release for Apache Whirr, version
> 0.1.0-incubating. We already received one binding IPMC +1 vote for the
> PPMC release vote on whirr-dev, so are looking for two more.
>
> PPMC release vote thread:
> http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201009.mbox/%3caanlktinio1np6d+gbnm4w6jjcg-6koe7x8begkuxr...@mail.gmail.com%3e
>
> The issues fixed for 0.1.0-incubating
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12315111&styleName=Html&projectId=1230
>
> Source and binary files:
> http://people.apache.org/~tomwhite/whirr-releases/
>
> Maven staging repo:
> https://repository.apache.org/content/repositories/orgapachewhirr-009
>
> The tag to be voted upon:
> http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.1.0-incubating
>
> The vote is open for 72 hours.
>
> [ ] +1
> [ ] +0
> [ ] -1
>
> Thanks,
> Tom
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Gora to enter Incubator

2010-09-14 Thread Tom White
+1 Sounds very interesting. I'd be happy to help out as a mentor.

Cheers,
Tom

On Mon, Sep 13, 2010 at 6:10 AM, Enis Soztutar  wrote:
> Hi all,
>
> We would like to announce the Proposal for Gora, an ORM for Colum Stores,
> for the Apache Incubation. We believe that Gora can find a nice home at
> Apache.
>
> Wiki of the proposal can be found at
> http://wiki.apache.org/incubator/GoraProposal
>
> The proposal is as below.
>
>
> = Gora Proposal for Apache Incubation =
>
> == Abstract ==
> Gora is an ORM framework for column stores such as Apache HBase and Apache
> Cassandra with a specific focus on Hadoop.
>
> == Proposal ==
> Although there are various excellent ORM frameworks for relational
> databases, data modeling in NoSQL data stores differ profoundly from their
> relational cousins. Moreover, data-model agnostic frameworks such as JDO are
> not sufficient for use cases, where one needs to use the full power of the
> data models in column stores. Gora fills this gap by giving the user an
> easy-to-use ORM framework with data store specific mappings and built in
> Apache Hadoop support.
>
> The overall goal for Gora is to become the standard data representation and
> persistence framework for big data. The roadmap of Gora can be grouped as
> follows.
>
>  * Data Persistence : Persisting objects to Column stores such as HBase,
> Cassandra, Hypertable; key-value stores such as Voldermort, Redis, etc; SQL
> databases, such as MySQL, HSQLDB, flat files in local file system of Hadoop
> HDFS.
>  * Data Access : An easy to use Java-friendly common API for accessing the
> data regardless of its location.
>  * Indexing : Persisting objects to Lucene and Solr indexes,
> accessing/querying the data with Gora API.
>  * Analysis : Accesing the data and making analysis through adapters for
> Apache Pig, Apache Hive and Cascading
>  * MapReduce support : Out-of-the-box and extensive MapReduce (Apache
> Hadoop) support for data in the data store.
>
> == Background ==
> ORM stands for Object Relation Mapping. It is a technology which abstacts
> the persistency layer
> (mostly Relational Databases) so that plain domain level objects can be
> used, without the cumbersome effort to save/load the data to and from the
> database. Gora differs from current solutions in that:
>  * Gora is specially focussed at NoSQL data stores, but also has limited
> support for SQL databases
>  * The main use case for Gora is to access/analyze big data using Hadoop.
>  * Gora uses Avro for bean definition, not byte code enhancement or
> annotations
>  * Object-to-data store mappings are backend specific, so that full data
> model can be utilized.
>  * Gora is simple since it ignores complex SQL mappings
>  * Gora will support persistence, indexing and anaysis of data, using Pig,
> Lucene, Hive, etc
>
> == Rationale ==
> ORM frameworks are nothing new. But with the explosion of data generated in
> Terabytes and even Petabytes, NoSQL data stores are gaining ever-increasing
> popularity. Coupled with limited support to already-proven Apache Hadoop
> support in current ORM frameworks, there was a need for a new project.
>
> Gora is currently hosted at Github. However, Gora has ties to ASF in many
> ways. As detailed in the proposal section, Gora will be a high level client
> for many Apache projects and subprojects including Hadoop(common, hdfs, and
> mapreduce), HBase, Cassandra, Avro, Lucene, Solr, Pig, and Hive. Gora
> already uses Hadoop, HBase, Cassandra and Avro. Moreover, Gora started its
> life inside Apache Nutch project, and now Nutch trunk uses Gora as a
> library. Even more, the initial set of committers are all ASF members.
> Therefore, we think that Apache will be an excellent home for Gora.
>
> == Initial Goals ==
> Initial goals for Gora can be summarized as:
>  * Iron out the remaining issues with HBase, Cassandra and SQL support.
>  * Make the first release before the end of the year.
>  * Improve documentation
>  * Support for Cascading
>
> == Current Status ==
> === Meritocracy ===
> Current commit rights belong to the initial list of committers four of who
> are also ASF members. All the developers have extensive experience with
> Apache projects. We honor the meritocracy policy of ASF foundation.
>
> === Community ===
> Gora’s community mostly overlap with that of Nutch, Hadoop, HBase, Avro and
> Cassandra. We
> have a small community for now (5 initial committers, 18 people tracking the
> project at Github), but have been piggybacking the Nutch community for a
> while. If Gora is accepted to Apache Incubator, we expect more traction.
> Moreover, with the increasing popularity of NoSQL databases, we expect more
> users.
>
> === Core Developers ===
> Gora was started by the initial code base inside Apache Nutch by Doğacan
> Güney. Then Enis Söztutar has refactored and re-architected the project out
> of Nutch. Later Julien Nioche, Andrzej Bialecki and Doğacan has ported Nutch
> to use the newly formed project. Later, Sertan

Re: [PROPOSAL] Gora to enter Incubator

2010-09-14 Thread Tom White
I posted a little earlier volunteering to be a mentor, but it looks
like it may be in the moderation queue. Anyway, +1 to the proposal,
and happy to help out if you still need a mentor.

Cheers,
Tom

On Tue, Sep 14, 2010 at 2:44 PM, Mattmann, Chris A (388J)
 wrote:
> Hi Folks,
>
> FYI, if any mentors out there have free cycles and are interested, we are 
> looking for 1 more mentor to fulfill the Incubator mentor requirements.
>
> Thanks,
> Chris
>
>
>
> On 9/13/10 6:10 AM, "Enis Soztutar"  wrote:
>
> Hi all,
>
> We would like to announce the Proposal for Gora, an ORM for Colum Stores,
> for the Apache Incubation. We believe that Gora can find a nice home at
> Apache.
>
> Wiki of the proposal can be found at
> http://wiki.apache.org/incubator/GoraProposal
>
> The proposal is as below.
>
>
> = Gora Proposal for Apache Incubation =
>
> == Abstract ==
> Gora is an ORM framework for column stores such as Apache HBase and Apache
> Cassandra with a specific focus on Hadoop.
>
> == Proposal ==
> Although there are various excellent ORM frameworks for relational
> databases, data modeling in NoSQL data stores differ profoundly from their
> relational cousins. Moreover, data-model agnostic frameworks such as JDO are
> not sufficient for use cases, where one needs to use the full power of the
> data models in column stores. Gora fills this gap by giving the user an
> easy-to-use ORM framework with data store specific mappings and built in
> Apache Hadoop support.
>
> The overall goal for Gora is to become the standard data representation and
> persistence framework for big data. The roadmap of Gora can be grouped as
> follows.
>
>  * Data Persistence : Persisting objects to Column stores such as HBase,
> Cassandra, Hypertable; key-value stores such as Voldermort, Redis, etc; SQL
> databases, such as MySQL, HSQLDB, flat files in local file system of Hadoop
> HDFS.
>  * Data Access : An easy to use Java-friendly common API for accessing the
> data regardless of its location.
>  * Indexing : Persisting objects to Lucene and Solr indexes,
> accessing/querying the data with Gora API.
>  * Analysis : Accesing the data and making analysis through adapters for
> Apache Pig, Apache Hive and Cascading
>  * MapReduce support : Out-of-the-box and extensive MapReduce (Apache
> Hadoop) support for data in the data store.
>
> == Background ==
> ORM stands for Object Relation Mapping. It is a technology which abstacts
> the persistency layer
> (mostly Relational Databases) so that plain domain level objects can be
> used, without the cumbersome effort to save/load the data to and from the
> database. Gora differs from current solutions in that:
>  * Gora is specially focussed at NoSQL data stores, but also has limited
> support for SQL databases
>  * The main use case for Gora is to access/analyze big data using Hadoop.
>  * Gora uses Avro for bean definition, not byte code enhancement or
> annotations
>  * Object-to-data store mappings are backend specific, so that full data
> model can be utilized.
>  * Gora is simple since it ignores complex SQL mappings
>  * Gora will support persistence, indexing and anaysis of data, using Pig,
> Lucene, Hive, etc
>
> == Rationale ==
> ORM frameworks are nothing new. But with the explosion of data generated in
> Terabytes and even Petabytes, NoSQL data stores are gaining ever-increasing
> popularity. Coupled with limited support to already-proven Apache Hadoop
> support in current ORM frameworks, there was a need for a new project.
>
> Gora is currently hosted at Github. However, Gora has ties to ASF in many
> ways. As detailed in the proposal section, Gora will be a high level client
> for many Apache projects and subprojects including Hadoop(common, hdfs, and
> mapreduce), HBase, Cassandra, Avro, Lucene, Solr, Pig, and Hive. Gora
> already uses Hadoop, HBase, Cassandra and Avro. Moreover, Gora started its
> life inside Apache Nutch project, and now Nutch trunk uses Gora as a
> library. Even more, the initial set of committers are all ASF members.
> Therefore, we think that Apache will be an excellent home for Gora.
>
> == Initial Goals ==
> Initial goals for Gora can be summarized as:
>  * Iron out the remaining issues with HBase, Cassandra and SQL support.
>  * Make the first release before the end of the year.
>  * Improve documentation
>  * Support for Cascading
>
> == Current Status ==
> === Meritocracy ===
> Current commit rights belong to the initial list of committers four of who
> are also ASF members. All the developers have extensive experience with
> Apache projects. We honor the meritocracy policy of ASF foundation.
>
> === Community ===
> Gora’s community mostly overlap with that of Nutch, Hadoop, HBase, Avro and
> Cassandra. We
> have a small community for now (5 initial committers, 18 people tracking the
> project at Github), but have been piggybacking the Nutch community for a
> while. If Gora is accepted to Apache Incubator, we expect more traction.
> Moreover, with the incr

[VOTE] Release Whirr version 0.1.0-incubating

2010-09-14 Thread Tom White
This is the first incubator release for Apache Whirr, version
0.1.0-incubating. We already received one binding IPMC +1 vote for the
PPMC release vote on whirr-dev, so are looking for two more.

PPMC release vote thread:
http://mail-archives.apache.org/mod_mbox/incubator-whirr-dev/201009.mbox/%3caanlktinio1np6d+gbnm4w6jjcg-6koe7x8begkuxr...@mail.gmail.com%3e

The issues fixed for 0.1.0-incubating
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12315111&styleName=Html&projectId=1230

Source and binary files:
http://people.apache.org/~tomwhite/whirr-releases/

Maven staging repo:
https://repository.apache.org/content/repositories/orgapachewhirr-009

The tag to be voted upon:
http://svn.apache.org/repos/asf/incubator/whirr/tags/release-0.1.0-incubating

The vote is open for 72 hours.

[ ] +1
[ ] +0
[ ] -1

Thanks,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Restored missing Bluesky/Clerezza parts of this month's report

2010-05-17 Thread Tom White
On Mon, May 17, 2010 at 1:35 PM, Bertrand Delacretaz
 wrote:
> Hi Noel,
>
> FYI I just fixed the report (although it said "closed") at
> http://wiki.apache.org/incubator/May2010 - the beginning of Clerezza
> report had been mistakenly deleted in revision 19.
>
> Looks like there are other inconsistencies in the wiki report, Amber,
> Hise and Whirr are missing but according to
> http://wiki.apache.org/incubator/ReportingSchedule they should be
> there.

I read the page to mean that the first report for Amber and Whirr
would be June, then July, August, before falling into a quarterly
schedule. Please say if I've got this wrong, so we can put together a
report for Whirr.

Cheers
Tom

>
> -Bertrand
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Whirr for Incubation

2010-05-11 Thread Tom White
The vote passes with 11 +1s (6 binding), zero 0s and zero -1s.

The next step is to create the podling and infrastructure.

Thanks,
Tom

On Mon, May 10, 2010 at 6:50 AM, Donald Woods  wrote:
>> [X] +1 Accept Whirr for incubation
>
>
> -Donald
>
>
> On 5/5/10 2:06 PM, Tom White wrote:
>> We've added three mentors since starting the proposal thread, so I
>> would like to start the vote to accept Whirr into the Apache
>> Incubator.
>>
>> The proposal is included below and is also at:
>> http://wiki.apache.org/incubator/WhirrProposal
>>
>> Please cast your votes:
>>
>> [ ] +1 Accept Whirr for incubation
>> [ ] +0 Don't care
>> [ ] -1 Reject for the following reason:
>>
>> Thanks,
>>
>> Tom
>>
>> = Whirr, a library of cloud services =
>>
>> == Abstract ==
>> Whirr will be a set of libraries for running cloud services.
>>
>> == Proposal ==
>> Whirr will provide code for running a variety of software services on
>> cloud infrastructure. It will provide bindings in several languages
>> (e.g. Python and Java) for popular cloud providers to make it easy to
>> start and stop services like Hadoop clusters. The project will not be
>> limited to a particular set of services, rather it will be expected
>> that a range of services are developed, as determined by the project
>> contributors. Possible services include Hadoop, HBase, !ZooKeeper,
>> Cassandra.
>>
>> == Background ==
>> The ability to run services on cloud providers is very useful,
>> particularly for proofs of concept, testing, and also ad hoc
>> production work. Bringing up clusters in the cloud is non-trivial,
>> since careful choreography is required. (Designing an interface that
>> is convenient as well as secure is also a challenge in a cloud
>> context.)  Making services that runs on a variety of cloud providers
>> is harder, even with the availability of libraries like libcloud and
>> jclouds, since each platform's quirks and extra features must be
>> considered (and either worked around, or possibly taken advantage of,
>> as appropriate) . Whirr will facilitate sharing of best practices,
>> both for a particular service (such as Hadoop configuration on a
>> particular provider), and for common cloud operations (such as
>> installation of dependencies across cloud providers). It will provide
>> a space to share good configurations and will encode service-specific
>> knowledge.
>>
>> == Rationale ==
>> There are already scripts in the Hadoop project that allow users to
>> run Hadoop clusters on Amazon EC2 and other cloud providers. While
>> users have found these scripts useful, their current home as a Hadoop
>> Common contrib project has the following limitations:
>>  * Tying the scripts' release cycle to Hadoop's means that it is
>> difficult to distribute updates to the scripts which are changing fast
>> (new features and bugfixes).
>>  * The scripts support multiple versions of Hadoop, so it makes more
>> sense to distribute them separately from Hadoop itself.
>>  * They are general: people want to contribute code for non-Hadoop
>> services like Cassandra (for example:
>> http://github.com/johanoskarsson/cassandra-ec2).
>>  * Having a uniform approach to running services in the cloud, hosted
>> in one project, makes launching sets of complementary services easier
>> for the user. Today, the scripts and libraries hosted within each
>> project (e.g. in Hadoop, HBase, Cassandra) have slightly different
>> conventions and semantics, and are likely to diverge over time.
>> Building a community around cloud infrastructure services will help
>> enforce a common approach to running services in the cloud.
>>
>> == Initial Goals ==
>>  * Provide a new home for the existing Hadoop cloud scripts.
>>  * Add more services (e.g. HBase)
>>  * Develop Java libraries for Hadoop clusters
>>  * Add new cloud providers by taking advantage of libcloud and jclouds.
>>  * (Future) Run on own hardware, so users can take advantage of the
>> same interface to control services running locally or in the cloud.
>>
>> == Current Status ==
>> === Meritocracy ===
>> The Hadoop scripts were originally created by Tom White, and have had
>> a substantial number of contributions from members of the Hadoop
>> community. By becoming its own project, significant contributors to
>> Whirr would become committers, and allow the project to grow.
>>
>> === Community ===
>> The community interested in cloud service infrastructur

Re: [VOTE] Accept Whirr for Incubation

2010-05-11 Thread Tom White
The vote passes with 11 +1s (6 binding), zero 0s and zero -1s.

The next step is to create the podling and infrastructure.

Thanks,
Tom

On Mon, May 10, 2010 at 6:50 AM, Donald Woods  wrote:
>> [X] +1 Accept Whirr for incubation
>
>
> -Donald
>
>
> On 5/5/10 2:06 PM, Tom White wrote:
>> We've added three mentors since starting the proposal thread, so I
>> would like to start the vote to accept Whirr into the Apache
>> Incubator.
>>
>> The proposal is included below and is also at:
>> http://wiki.apache.org/incubator/WhirrProposal
>>
>> Please cast your votes:
>>
>> [ ] +1 Accept Whirr for incubation
>> [ ] +0 Don't care
>> [ ] -1 Reject for the following reason:
>>
>> Thanks,
>>
>> Tom
>>
>> = Whirr, a library of cloud services =
>>
>> == Abstract ==
>> Whirr will be a set of libraries for running cloud services.
>>
>> == Proposal ==
>> Whirr will provide code for running a variety of software services on
>> cloud infrastructure. It will provide bindings in several languages
>> (e.g. Python and Java) for popular cloud providers to make it easy to
>> start and stop services like Hadoop clusters. The project will not be
>> limited to a particular set of services, rather it will be expected
>> that a range of services are developed, as determined by the project
>> contributors. Possible services include Hadoop, HBase, !ZooKeeper,
>> Cassandra.
>>
>> == Background ==
>> The ability to run services on cloud providers is very useful,
>> particularly for proofs of concept, testing, and also ad hoc
>> production work. Bringing up clusters in the cloud is non-trivial,
>> since careful choreography is required. (Designing an interface that
>> is convenient as well as secure is also a challenge in a cloud
>> context.)  Making services that runs on a variety of cloud providers
>> is harder, even with the availability of libraries like libcloud and
>> jclouds, since each platform's quirks and extra features must be
>> considered (and either worked around, or possibly taken advantage of,
>> as appropriate) . Whirr will facilitate sharing of best practices,
>> both for a particular service (such as Hadoop configuration on a
>> particular provider), and for common cloud operations (such as
>> installation of dependencies across cloud providers). It will provide
>> a space to share good configurations and will encode service-specific
>> knowledge.
>>
>> == Rationale ==
>> There are already scripts in the Hadoop project that allow users to
>> run Hadoop clusters on Amazon EC2 and other cloud providers. While
>> users have found these scripts useful, their current home as a Hadoop
>> Common contrib project has the following limitations:
>>  * Tying the scripts' release cycle to Hadoop's means that it is
>> difficult to distribute updates to the scripts which are changing fast
>> (new features and bugfixes).
>>  * The scripts support multiple versions of Hadoop, so it makes more
>> sense to distribute them separately from Hadoop itself.
>>  * They are general: people want to contribute code for non-Hadoop
>> services like Cassandra (for example:
>> http://github.com/johanoskarsson/cassandra-ec2).
>>  * Having a uniform approach to running services in the cloud, hosted
>> in one project, makes launching sets of complementary services easier
>> for the user. Today, the scripts and libraries hosted within each
>> project (e.g. in Hadoop, HBase, Cassandra) have slightly different
>> conventions and semantics, and are likely to diverge over time.
>> Building a community around cloud infrastructure services will help
>> enforce a common approach to running services in the cloud.
>>
>> == Initial Goals ==
>>  * Provide a new home for the existing Hadoop cloud scripts.
>>  * Add more services (e.g. HBase)
>>  * Develop Java libraries for Hadoop clusters
>>  * Add new cloud providers by taking advantage of libcloud and jclouds.
>>  * (Future) Run on own hardware, so users can take advantage of the
>> same interface to control services running locally or in the cloud.
>>
>> == Current Status ==
>> === Meritocracy ===
>> The Hadoop scripts were originally created by Tom White, and have had
>> a substantial number of contributions from members of the Hadoop
>> community. By becoming its own project, significant contributors to
>> Whirr would become committers, and allow the project to grow.
>>
>> === Community ===
>> The community interested in cloud service infrastructur

[VOTE] Accept Whirr for Incubation

2010-05-05 Thread Tom White
We've added three mentors since starting the proposal thread, so I
would like to start the vote to accept Whirr into the Apache
Incubator.

The proposal is included below and is also at:
http://wiki.apache.org/incubator/WhirrProposal

Please cast your votes:

[ ] +1 Accept Whirr for incubation
[ ] +0 Don't care
[ ] -1 Reject for the following reason:

Thanks,

Tom

= Whirr, a library of cloud services =

== Abstract ==
Whirr will be a set of libraries for running cloud services.

== Proposal ==
Whirr will provide code for running a variety of software services on
cloud infrastructure. It will provide bindings in several languages
(e.g. Python and Java) for popular cloud providers to make it easy to
start and stop services like Hadoop clusters. The project will not be
limited to a particular set of services, rather it will be expected
that a range of services are developed, as determined by the project
contributors. Possible services include Hadoop, HBase, !ZooKeeper,
Cassandra.

== Background ==
The ability to run services on cloud providers is very useful,
particularly for proofs of concept, testing, and also ad hoc
production work. Bringing up clusters in the cloud is non-trivial,
since careful choreography is required. (Designing an interface that
is convenient as well as secure is also a challenge in a cloud
context.)  Making services that runs on a variety of cloud providers
is harder, even with the availability of libraries like libcloud and
jclouds, since each platform's quirks and extra features must be
considered (and either worked around, or possibly taken advantage of,
as appropriate) . Whirr will facilitate sharing of best practices,
both for a particular service (such as Hadoop configuration on a
particular provider), and for common cloud operations (such as
installation of dependencies across cloud providers). It will provide
a space to share good configurations and will encode service-specific
knowledge.

== Rationale ==
There are already scripts in the Hadoop project that allow users to
run Hadoop clusters on Amazon EC2 and other cloud providers. While
users have found these scripts useful, their current home as a Hadoop
Common contrib project has the following limitations:
 * Tying the scripts' release cycle to Hadoop's means that it is
difficult to distribute updates to the scripts which are changing fast
(new features and bugfixes).
 * The scripts support multiple versions of Hadoop, so it makes more
sense to distribute them separately from Hadoop itself.
 * They are general: people want to contribute code for non-Hadoop
services like Cassandra (for example:
http://github.com/johanoskarsson/cassandra-ec2).
 * Having a uniform approach to running services in the cloud, hosted
in one project, makes launching sets of complementary services easier
for the user. Today, the scripts and libraries hosted within each
project (e.g. in Hadoop, HBase, Cassandra) have slightly different
conventions and semantics, and are likely to diverge over time.
Building a community around cloud infrastructure services will help
enforce a common approach to running services in the cloud.

== Initial Goals ==
 * Provide a new home for the existing Hadoop cloud scripts.
 * Add more services (e.g. HBase)
 * Develop Java libraries for Hadoop clusters
 * Add new cloud providers by taking advantage of libcloud and jclouds.
 * (Future) Run on own hardware, so users can take advantage of the
same interface to control services running locally or in the cloud.

== Current Status ==
=== Meritocracy ===
The Hadoop scripts were originally created by Tom White, and have had
a substantial number of contributions from members of the Hadoop
community. By becoming its own project, significant contributors to
Whirr would become committers, and allow the project to grow.

=== Community ===
The community interested in cloud service infrastructure is currently
spread across many smaller projects, and one of the main goals of this
project is to build a vibrant community to share best practices and
build common infrastructure. For example, this project would provide a
home to facilitate collaboration between the groups of Hadoop and
HBase developers who are building cloud services.

=== Core developers ===
Tom White wrote most of the original code and is familiar with open
source and Apache-style development, being a Hadoop committer and an
ASF member. There have been a number of contributors who have provided
patches to these scripts over time. Andrew Purtell who created the
HBase cloud scripts is a HBase committer. Johan Oskarsson (Hadoop and
Cassandra committer) ported the scripts to Cassandra.

=== Alignment ===
Whirr complements libcloud, currently in the Incubator. Libcloud
provides multi-cloud provider support, while Whirr will provide
multi-service support in the cloud. Whirr will build cloud components
for several Apache projects, such as Hadoop, HBase, !ZooKeeper,
Cassandra, and hopefully more.

== Kn

Re: [PROPOSAL] Whirr Project

2010-04-29 Thread Tom White
Patrick,

That sounds great - thanks for volunteering. I've added you to the
initial committer list on the proposal page at
http://wiki.apache.org/incubator/WhirrProposal.

Steve Loughran has volunteered to be a mentor; I've added his name too.

I would like to send a vote on this proposal to the Incubator PMC in
the next few days, so if anyone else would like to mentor (and we'd
love to have more mentors), please sign up soon.

Thanks,
Tom

On Wed, Apr 28, 2010 at 11:13 PM, Patrick Hunt  wrote:
> Tom, I'd be happy to be a committer (or contributor) for Whirr. I'm very
> interested to see ZooKeeper supported. Also very interested in
> non-production uses (such as testing).
>
> Regards,
>
> Patrick
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Whirr Project

2010-04-23 Thread Tom White
Thanks for the advice, Niclas. I've sent a mail to the PMC private list.

Cheers,
Tom

On Thu, Apr 22, 2010 at 5:25 PM, Niclas Hedhman  wrote:
> On Fri, Apr 23, 2010 at 7:58 AM, Tom White  wrote:
>>> You do know any ASF member can sign up to be an incubator mentor, right? If
>>> I count correctly you have two on your list :)
>>
>> I didn't know that. I've added myself as a mentor.
>
> Just send a mail to PMC private list that you wish to be added to the
> PMC and subscribe to the private list. Then you are set to go.
>
>
> Cheers
> --
> Niclas Hedhman, Software Developer
> http://www.qi4j.org - New Energy for Java
>
> I  live here; http://tinyurl.com/2qq9er
> I  work here; http://tinyurl.com/2ymelc
> I relax here; http://tinyurl.com/2cgsug
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Whirr Project

2010-04-22 Thread Tom White
On Thu, Apr 22, 2010 at 3:46 PM, Leo Simons  wrote:
>
> ...and pasted inline below (as is customary). The proposal looks fine to me.
> Like you mention your initial group of committers is a bit small which is a
> risk but hey, cloud is hot, go build community :)

That's the plan.

> You do know any ASF member can sign up to be an incubator mentor, right? If
> I count correctly you have two on your list :)

I didn't know that. I've added myself as a mentor.

@Doug - thanks for volunteering to be a mentor - I've added you to the
wiki page too.

Cheers,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Whirr Project

2010-04-16 Thread Tom White
Hi Chris,

Thanks for your interest in the proposal. Using Whirr in other
projects, like OODT, is definitely a use case I'm interested in. Also
Solr would make a good service to include in Whirr.

Cheers,
Tom

On Thu, Apr 15, 2010 at 10:28 PM, Mattmann, Chris A (388J)
 wrote:
> Hi Tom,
>
> This sounds interesting. It would be interesting to look at e.g., how we 
> could take some of the OODT [1] configurations and use Whirr as a front-end 
> to run them in the cloud. I also know the folks in Solr are using Zookeeper, 
> so they may benefit from this project as well.
>
> I don't have a huge amount of time since I'm overcommitted at work, and with 
> 2 podlings I'm participating in (OODT and SIS), but my +1 to move forward 
> with this as it seems useful. I will try and monitor the lists and if I get 
> the time to help out at some point, I'll bug you on list ^_^
>
> Cheers,
> Chris
>
> [1] http://incubator.apache.org/projects/oodt.html
>
>
> On 4/15/10 2:42 PM, "Tom White"  wrote:
>
> I would like to propose Whirr as an incubator proposal.
>
> Whirr will be a set of libraries for running cloud services, such as
> Hadoop or Cassandra. The initial code (for Hadoop) is hosted as a
> Hadoop contrib module, but I believe it would flourish as its own
> project with its own community.
>
> The proposal is on the incubator wiki at
> http://wiki.apache.org/incubator/WhirrProposal.
>
> We look forward to hearing feedback and questions on the proposal.
> Also, we don't have any mentors yet, so please sign up if you are
> interested in helping out.
>
> Thanks,
> Tom
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>
>
>
> ++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.mattm...@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[PROPOSAL] Whirr Project

2010-04-15 Thread Tom White
I would like to propose Whirr as an incubator proposal.

Whirr will be a set of libraries for running cloud services, such as
Hadoop or Cassandra. The initial code (for Hadoop) is hosted as a
Hadoop contrib module, but I believe it would flourish as its own
project with its own community.

The proposal is on the incubator wiki at
http://wiki.apache.org/incubator/WhirrProposal.

We look forward to hearing feedback and questions on the proposal.
Also, we don't have any mentors yet, so please sign up if you are
interested in helping out.

Thanks,
Tom

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org