+1(binding)
 Accept Spot into the Apache Incubator

Regards,
Uma

On 9/20/16, 11:15 AM, "Doug Cutting" <cutt...@apache.org> wrote:

>Following the discussion thread, I would like to call a vote on
>accepting Spot into the Apache Incubator.
>
>[] +1 Accept Spot into the Apache Incubator
>[] +0 Abstain.
>[] -1 Do not accept Spot into the Apache Incubator because ...
>
>This vote will run for the usual 72 hours.
>
>The proposal is attached, but you can also access it on the wiki:
>   https://wiki.apache.org/incubator/SpotProposal
>
>Thanks,
>
>Doug
>
>= SpotProposal =
>
>== Abstract ==
>
>Spot is an open source platform for network telemetry (packet, flow,
>and proxy at the moment) built on an open data model and Apache
>Hadoop.
>
>== Proposal ==
>
>Spot (formerly Open Network Insight, or ONI) is an open source
>solution for network telemetry (packet, flow, and proxy at the moment)
>built on an open data model and Apache Hadoop. It provides ingestion
>and transformation of binary data, scalable machine learning, and
>interactive visualization for identifying threats in network flows and
>DNS packets.
>
>Spot has a pluggable architecture that can accommodate multiple open
>data models. Although cybersecurity/network-intrusion analysis is the
>initial use case for Spot, we are actively encouraging the
>contribution of new models that will enable other adjacent
>applications, such as fraud detection or IT-operational analytics such
>as performance and health monitoring. Because these models are open,
>users maintain control of their own data.
>
>More information on Spot can be found at the existing project website
>at http://open-network-insight.org/.
>
>== Background ==
>
>It almost goes without saying that cybersecurity is an acute and
>paramount concern globally, for organizations of all types and
>sizes. Fortunately, thanks to the availability of massively scalable
>(in the PBs) data infrastructure, security professionals can now make
>authentically data-driven decisions about how they protect their
>assets. For example, records of network traffic, captured as network
>flows, are often stored and analyzed for use in network management,
>and this same information can provide valuable insights into network
>vulnerabilities.
>
>Cybersecurity is just one example, however: There are other examples
>of adjacent use cases, such as user fraud detection or IT-operations
>analytics, that would benefit from the combination of Spot
>functionality and PB-scale data sets for analysis.
>
>== Rationale ==
>
>Although cybersecurity is its initial use case/data model, Spot is
>intended to more generally tackle the dual challenges of facilitating
>the development of big data-driven analytic solutions, while helping
>vendors avoid having to create one/off infrastructure for each use
>case. Spot will eliminate issues related to vendor data models that
>create silos between solutions, and that make it difficult for users
>to consume these innovations from multiple vendors. In summary, Spot
>will accelerate the development of new massively scalable analytic
>applications that give users more flexibility, and more choices.
>
>As an initial effort, we are now seeking to build an ecosystem of
>developers, data scientists, and security professionals to make Spot
>the open, community-driven, cybersecurity platform standard it needs
>to become. By bringing Spot to Apache, we hope to galvanize these
>groups to cooperate in this highly matrixed effort, and to build a
>global, and diverse, Spot community.
>
>== Initial Goals ==
>
>Move the existing codebase, website, documentation, and mailing lists
>to Apache-hosted infrastructure Work with the infrastructure team to
>implement and approve our build and testing workflows in the context
>of the ASF Incremental development and releases per Apache guidelines
>
>== Current Status ==
>
>=== Releases ===
>
>Spot has undergone one public release (1.0). This initial release was
>not performed in the typical ASF fashion; we will adopt the ASF source
>release process upon joining the incubator.
>
>=== Source ===
>
>Spot¹s source, including core platform and associated submodules, is
>currently hosted in several GitHub repositories under the indicated
>licenses:
>
> * Core (Apache License 2.0)
> * Oni-ingest (Apache License 2.0)
> * Oni-ml (Apache License 2.0
> * Oni-oa (BSD & MIT)
> * Oni-setup (Apache License 2.0)
> * Oni-nfdump (BSD)
> * Oni-lda-c (GNU General Public License version 2)
>
>The repositories will be transitioned to Apache¹s git hosting during
>incubation.  Issues related to GPL code will be resolved during
>incubation.
>
>
>=== Issue Tracking ===
>
>Spot¹s bug and feature tracking is hosted on Github at:
>
> * https://github.com/Open-Network-Insight/open-network-insight/issues
>
>Issue tracking will be transitioned to Apache¹s JIRA instance during
>incubation.
>
>=== Code review ===
>
>Spot maintainers currently use ³LGTM² (Looks Good to Me) in comments
>on the code review to indicate acceptance, with at least three LGTMs
>required to approve the merge.
>
>=== Community discussion ===
>
>A Spot Slack channel is available at:
>
> * https://opennetworkinsights.slack.com/messages/general/ (Invites
>request via http://open-network-insight.org:3000/)
>
>Community discussion options will be expanded considerably when
>apache.org mailing lists are available.
>
>=== Meritocracy ===
>
>We intend to adhere to a meritocratic approach to electing new
>committers and PMC members. We also believe that contributions can
>come in forms other than just code. We will encourage contributions
>and participation of all types, and ensure that contributors are
>appropriately recognized and that PMC memberships are appropriately
>earned.
>
>=== Community ===
>
>Though Spot is a relatively new project, it has already seen promising
>adoption:
>
> * Intel is the original development sponsor for Spot.
> * Cloudera is strong advocate for open source cybersecurity solutions
>and Apache Hadoop, and a supporter of Spot.
> * Cloudwick¹s OAS cybersecurity solution is built on Spot.
> * Accenture¹s Cyber Intelligence Platform solution is built on Spot.
> * Centrify has announced its intention to contribute identity-based
>security features to Spot¹s network-intrusion detection data model.
> * Webroot has announced its intention to contribute endpoint-security
>functionality.
> * Cybraics has announced its intention to contribute network-security
>functionality.
> * Jask has announced its intention to contribute network-security
>functionality.
>
>As described in the ³Rationale² section, we believe that building on
>and expanding the Spot community will be a key aspect in its success.
>
>=== Core Developers ===
>
>Spot was initially developed as a project at Intel, and most of the
>contributions to date have been from developers employed by that
>company. By bringing Spot to Apache, we hope to diversify its
>developer community more rapidly.
>
>=== Alignment ===
>
>Spot is built on Apache Hadoop, Apache Kafka, and Apache Spark, and as
>more functionality is built out, integration with other Apache
>projects is very likely.
>
>== Known Risks ==
>
>=== Orphaned products ===
>
>The risk of Spot being abandoned is low. Intel has made substantial
>investments already, Cloudera has publicly expressed the importance of
>Spot as a ³killer app² for Apache Hadoop, and Cloudwick and Accenture
>both have offerings that are built on Spot/CDH.
>
>=== Inexperience with Open Source ===
>
>Most of Spot¹s initial committers have experience in open source
>development, although not necessarily within the ASF. Those Spot
>developers who have little open source experience or are not Apache
>committers are eager to learn ASF practices as a means of improving
>project governance and diversifying the developer community.
>
>=== Homogenous Developers ===
>
>As mentioned previously, Intel developers are mostly responsible for
>what Spot code exists, to date. As a benefit of ASF governance, we
>hope to scale-up contributions from new developers and community
>members and eventually, develop them into committers by adhering to
>Apache¹s meritocratic principles.
>
>=== Reliance on Salaried Developers ===
>
>To date, all Spot code has been written by salaried developers
>(chiefly employed by Intel).
>
>=== Relationships with Other Apache Products ===
>
>Spot is currently related to the following other Apache projects:
>
> * Apache Hadoop
> * Apache Spark
> * Apache Kafka
>
>We look forward to continuing to integrate and collaborate with these
>communities.
>
>=== A Excessive Fascination with the Apache Brand ===
>
>Although most (not all) of the initial committers are not currently
>Apache committers, they are resolved to learning, with the help of the
>more experienced committers/project mentors/champion, the Apache
>Way. We believe that adhering to these principles will be of great
>value with respect to meeting long-term project goals, including
>facilitating widespread adoption.
>
>== Documentation ==
>
>Spot functionality is divided into different repositories, with each
>repository containing the relevant developer documentation:
>
> * oni-ingest
> * oni-ml
> * oni-oa
> * oni-setup
> * oni-nfdump
> * oni-lda-c
>
>An Installation Guide is published in the project wiki:
> * https://github.com/Open-Network-Insight/open-network-insight/wiki
>The Spot (currently Open Network Insight) website is managed via a
>Wordpress instance hosted by Bluehost:
> * http://open-network-insight.org/
>A Docker-based demo is available via Docker Hub:
> * https://hub.docker.com/r/opennetworkinsight/oni-demo/
>
>== Initial Source ==
>
>The Spot codebase is currently hosted on GitHub and will be
>transitioned to the ASF repositories during incubation. Spot and its
>submodules are currently licensed under several different licenses.
>
>No trademarks or domain names for Spot have been registered to date,
>and it will be up to the ASF¹s discretion to do so. The project¹s
>current website at open-network-insight.org will be redirected to
>spot.incubator.apache.org during incubation.
>
>Some portions of the code are imported from other open source projects
>under the Apache 2.0, BSD, or MIT licenses.
>
>== External Dependencies ==
>
>The full set of dependencies and licenses are:
> * Jupyter: BSD
> * D3js: BSD
> * Nfdump: BSD
> * Wireshark: GNU General Public License version 2
> * Apache Hadoop: Apache License 2.0
> * Apache Spark: Apache License 2.0
> * JQuery: MIT
> * ReactJS: BSD
> * Bootstrap: MIT
>
>Issues related to GPL dependencies will be resolved during incubation.
>
>== Cryptography ==
>
>Spot does not currently include any cryptography-related code.
>
>== Required Resources ==
>
>=== Developer and user mailing lists ===
>
> * priv...@spot.incubator.apache.org (PMC)
> * comm...@spot.incubator.apache.org (git push emails)
> * iss...@spot.incubator.apache.org (JIRA issue feed)
> * d...@spot.incubator.apache.org (code reviews plus dev discussion)
> * u...@spot.incubator.apache.org (user questions)
>
>=== Repository ===
>
> * git://git.apache.org/spot
>
>=== Issue Tracker ===
>
>We would like to import our current JIRA project into the ASF JIRA,
>such that our historical commit messages and code comments continue to
>reference the appropriate bug numbers.
>
>== Initial Committers ==
>
> * Grant Babb
> * Ricardo Barona
> * Cesar Berho
> * Jarek Jarcec Cecho
> * Michael Czerny
> * Nick Gamb
> * Sai Ganji
> * Gabriela Lima Garza
> * Victor Gonzalez
> * Mark Grover
> * Morris Hicks
> * Ritu Kama
> * Austin Leahy
> * Ashrith Mekala
> * Diego Ortiz
> * Sudharshan Rao PakalaSai
> * Srinivasa Reddy
> * Alan Ross
> * Everardo Lopez Sandoval
> * Nathan Segerlind
> * Vartika Singh
> * Nathanael Smith
> * Carlos Villavicencio
>
>== Affiliations ==
>
> * Grant Babb: Jask
> * Ricardo Barona : Intel
> * Cesar Berho: Intel
> * Jarek Jarcec Cecho: StreamSets
> * Michael Czerny: Cybraics
> * Nick Gamb: Centrify
> * Sai Ganji: Cloudwick
> * Gabriela Lima Garza: Intel
> * Victor Gonzalez: Intel
> * Mark Grover: Cloudera
> * Morris Hicks: Cloudera
> * Ritu Kama: Intel
> * Austin Leahy: eBay
> * Ashrith Mekala: Cloudwick
> * Diego Ortiz: Intel
> * Sudharshan Rao PakalaSai: Cloudwick
> * Srinivasa Reddy: Cloudera
> * Alan Ross: Intel
> * Everardo Lopez Sandoval: Intel
> * Nathan Segerlind: Intel
> * Vartika Singh: Cloudera
> * Nathanael Smith: Intel
> * Carlos Villavicencio: Intel
>
>== Sponsors ==
>
>=== Champion ===
>
> * Doug Cutting - Cloudera
>
>=== Nominated Mentors ===
>
> * Brock Noland - ASF Member, phData
> * Jarek Jarcec Cecho - ASF Member, StreamSets
> * Andrei Savu - Cloudera
> * Uma Maheswara Rao G - Intel
>
>=== Sponsoring Entity ===
>
>The Apache Incubator.
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>For additional commands, e-mail: general-h...@incubator.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Reply via email to