+1 Sent from my iPhone
> On Sep 22, 2016, at 12:57 AM, Tom White <tom.e.wh...@gmail.com> wrote: > > +1 > > Tom > >> On Tue, Sep 20, 2016 at 7:15 PM, Doug Cutting <cutt...@apache.org> wrote: >> Following the discussion thread, I would like to call a vote on >> accepting Spot into the Apache Incubator. >> >> [] +1 Accept Spot into the Apache Incubator >> [] +0 Abstain. >> [] -1 Do not accept Spot into the Apache Incubator because ... >> >> This vote will run for the usual 72 hours. >> >> The proposal is attached, but you can also access it on the wiki: >> https://wiki.apache.org/incubator/SpotProposal >> >> Thanks, >> >> Doug >> >> = SpotProposal = >> >> == Abstract == >> >> Spot is an open source platform for network telemetry (packet, flow, >> and proxy at the moment) built on an open data model and Apache >> Hadoop. >> >> == Proposal == >> >> Spot (formerly Open Network Insight, or ONI) is an open source >> solution for network telemetry (packet, flow, and proxy at the moment) >> built on an open data model and Apache Hadoop. It provides ingestion >> and transformation of binary data, scalable machine learning, and >> interactive visualization for identifying threats in network flows and >> DNS packets. >> >> Spot has a pluggable architecture that can accommodate multiple open >> data models. Although cybersecurity/network-intrusion analysis is the >> initial use case for Spot, we are actively encouraging the >> contribution of new models that will enable other adjacent >> applications, such as fraud detection or IT-operational analytics such >> as performance and health monitoring. Because these models are open, >> users maintain control of their own data. >> >> More information on Spot can be found at the existing project website >> at http://open-network-insight.org/. >> >> == Background == >> >> It almost goes without saying that cybersecurity is an acute and >> paramount concern globally, for organizations of all types and >> sizes. Fortunately, thanks to the availability of massively scalable >> (in the PBs) data infrastructure, security professionals can now make >> authentically data-driven decisions about how they protect their >> assets. For example, records of network traffic, captured as network >> flows, are often stored and analyzed for use in network management, >> and this same information can provide valuable insights into network >> vulnerabilities. >> >> Cybersecurity is just one example, however: There are other examples >> of adjacent use cases, such as user fraud detection or IT-operations >> analytics, that would benefit from the combination of Spot >> functionality and PB-scale data sets for analysis. >> >> == Rationale == >> >> Although cybersecurity is its initial use case/data model, Spot is >> intended to more generally tackle the dual challenges of facilitating >> the development of big data-driven analytic solutions, while helping >> vendors avoid having to create one/off infrastructure for each use >> case. Spot will eliminate issues related to vendor data models that >> create silos between solutions, and that make it difficult for users >> to consume these innovations from multiple vendors. In summary, Spot >> will accelerate the development of new massively scalable analytic >> applications that give users more flexibility, and more choices. >> >> As an initial effort, we are now seeking to build an ecosystem of >> developers, data scientists, and security professionals to make Spot >> the open, community-driven, cybersecurity platform standard it needs >> to become. By bringing Spot to Apache, we hope to galvanize these >> groups to cooperate in this highly matrixed effort, and to build a >> global, and diverse, Spot community. >> >> == Initial Goals == >> >> Move the existing codebase, website, documentation, and mailing lists >> to Apache-hosted infrastructure Work with the infrastructure team to >> implement and approve our build and testing workflows in the context >> of the ASF Incremental development and releases per Apache guidelines >> >> == Current Status == >> >> === Releases === >> >> Spot has undergone one public release (1.0). This initial release was >> not performed in the typical ASF fashion; we will adopt the ASF source >> release process upon joining the incubator. >> >> === Source === >> >> Spot’s source, including core platform and associated submodules, is >> currently hosted in several GitHub repositories under the indicated >> licenses: >> >> * Core (Apache License 2.0) >> * Oni-ingest (Apache License 2.0) >> * Oni-ml (Apache License 2.0 >> * Oni-oa (BSD & MIT) >> * Oni-setup (Apache License 2.0) >> * Oni-nfdump (BSD) >> * Oni-lda-c (GNU General Public License version 2) >> >> The repositories will be transitioned to Apache’s git hosting during >> incubation. Issues related to GPL code will be resolved during >> incubation. >> >> >> === Issue Tracking === >> >> Spot’s bug and feature tracking is hosted on Github at: >> >> * https://github.com/Open-Network-Insight/open-network-insight/issues >> >> Issue tracking will be transitioned to Apache’s JIRA instance during >> incubation. >> >> === Code review === >> >> Spot maintainers currently use “LGTM” (Looks Good to Me) in comments >> on the code review to indicate acceptance, with at least three LGTMs >> required to approve the merge. >> >> === Community discussion === >> >> A Spot Slack channel is available at: >> >> * https://opennetworkinsights.slack.com/messages/general/ (Invites >> request via http://open-network-insight.org:3000/) >> >> Community discussion options will be expanded considerably when >> apache.org mailing lists are available. >> >> === Meritocracy === >> >> We intend to adhere to a meritocratic approach to electing new >> committers and PMC members. We also believe that contributions can >> come in forms other than just code. We will encourage contributions >> and participation of all types, and ensure that contributors are >> appropriately recognized and that PMC memberships are appropriately >> earned. >> >> === Community === >> >> Though Spot is a relatively new project, it has already seen promising >> adoption: >> >> * Intel is the original development sponsor for Spot. >> * Cloudera is strong advocate for open source cybersecurity solutions >> and Apache Hadoop, and a supporter of Spot. >> * Cloudwick’s OAS cybersecurity solution is built on Spot. >> * Accenture’s Cyber Intelligence Platform solution is built on Spot. >> * Centrify has announced its intention to contribute identity-based >> security features to Spot’s network-intrusion detection data model. >> * Webroot has announced its intention to contribute endpoint-security >> functionality. >> * Cybraics has announced its intention to contribute network-security >> functionality. >> * Jask has announced its intention to contribute network-security >> functionality. >> >> As described in the “Rationale” section, we believe that building on >> and expanding the Spot community will be a key aspect in its success. >> >> === Core Developers === >> >> Spot was initially developed as a project at Intel, and most of the >> contributions to date have been from developers employed by that >> company. By bringing Spot to Apache, we hope to diversify its >> developer community more rapidly. >> >> === Alignment === >> >> Spot is built on Apache Hadoop, Apache Kafka, and Apache Spark, and as >> more functionality is built out, integration with other Apache >> projects is very likely. >> >> == Known Risks == >> >> === Orphaned products === >> >> The risk of Spot being abandoned is low. Intel has made substantial >> investments already, Cloudera has publicly expressed the importance of >> Spot as a “killer app” for Apache Hadoop, and Cloudwick and Accenture >> both have offerings that are built on Spot/CDH. >> >> === Inexperience with Open Source === >> >> Most of Spot’s initial committers have experience in open source >> development, although not necessarily within the ASF. Those Spot >> developers who have little open source experience or are not Apache >> committers are eager to learn ASF practices as a means of improving >> project governance and diversifying the developer community. >> >> === Homogenous Developers === >> >> As mentioned previously, Intel developers are mostly responsible for >> what Spot code exists, to date. As a benefit of ASF governance, we >> hope to scale-up contributions from new developers and community >> members and eventually, develop them into committers by adhering to >> Apache’s meritocratic principles. >> >> === Reliance on Salaried Developers === >> >> To date, all Spot code has been written by salaried developers >> (chiefly employed by Intel). >> >> === Relationships with Other Apache Products === >> >> Spot is currently related to the following other Apache projects: >> >> * Apache Hadoop >> * Apache Spark >> * Apache Kafka >> >> We look forward to continuing to integrate and collaborate with these >> communities. >> >> === A Excessive Fascination with the Apache Brand === >> >> Although most (not all) of the initial committers are not currently >> Apache committers, they are resolved to learning, with the help of the >> more experienced committers/project mentors/champion, the Apache >> Way. We believe that adhering to these principles will be of great >> value with respect to meeting long-term project goals, including >> facilitating widespread adoption. >> >> == Documentation == >> >> Spot functionality is divided into different repositories, with each >> repository containing the relevant developer documentation: >> >> * oni-ingest >> * oni-ml >> * oni-oa >> * oni-setup >> * oni-nfdump >> * oni-lda-c >> >> An Installation Guide is published in the project wiki: >> * https://github.com/Open-Network-Insight/open-network-insight/wiki >> The Spot (currently Open Network Insight) website is managed via a >> Wordpress instance hosted by Bluehost: >> * http://open-network-insight.org/ >> A Docker-based demo is available via Docker Hub: >> * https://hub.docker.com/r/opennetworkinsight/oni-demo/ >> >> == Initial Source == >> >> The Spot codebase is currently hosted on GitHub and will be >> transitioned to the ASF repositories during incubation. Spot and its >> submodules are currently licensed under several different licenses. >> >> No trademarks or domain names for Spot have been registered to date, >> and it will be up to the ASF’s discretion to do so. The project’s >> current website at open-network-insight.org will be redirected to >> spot.incubator.apache.org during incubation. >> >> Some portions of the code are imported from other open source projects >> under the Apache 2.0, BSD, or MIT licenses. >> >> == External Dependencies == >> >> The full set of dependencies and licenses are: >> * Jupyter: BSD >> * D3js: BSD >> * Nfdump: BSD >> * Wireshark: GNU General Public License version 2 >> * Apache Hadoop: Apache License 2.0 >> * Apache Spark: Apache License 2.0 >> * JQuery: MIT >> * ReactJS: BSD >> * Bootstrap: MIT >> >> Issues related to GPL dependencies will be resolved during incubation. >> >> == Cryptography == >> >> Spot does not currently include any cryptography-related code. >> >> == Required Resources == >> >> === Developer and user mailing lists === >> >> * priv...@spot.incubator.apache.org (PMC) >> * comm...@spot.incubator.apache.org (git push emails) >> * iss...@spot.incubator.apache.org (JIRA issue feed) >> * d...@spot.incubator.apache.org (code reviews plus dev discussion) >> * u...@spot.incubator.apache.org (user questions) >> >> === Repository === >> >> * git://git.apache.org/spot >> >> === Issue Tracker === >> >> We would like to import our current JIRA project into the ASF JIRA, >> such that our historical commit messages and code comments continue to >> reference the appropriate bug numbers. >> >> == Initial Committers == >> >> * Grant Babb >> * Ricardo Barona >> * Cesar Berho >> * Jarek Jarcec Cecho >> * Michael Czerny >> * Nick Gamb >> * Sai Ganji >> * Gabriela Lima Garza >> * Victor Gonzalez >> * Mark Grover >> * Morris Hicks >> * Ritu Kama >> * Austin Leahy >> * Ashrith Mekala >> * Diego Ortiz >> * Sudharshan Rao PakalaSai >> * Srinivasa Reddy >> * Alan Ross >> * Everardo Lopez Sandoval >> * Nathan Segerlind >> * Vartika Singh >> * Nathanael Smith >> * Carlos Villavicencio >> >> == Affiliations == >> >> * Grant Babb: Jask >> * Ricardo Barona : Intel >> * Cesar Berho: Intel >> * Jarek Jarcec Cecho: StreamSets >> * Michael Czerny: Cybraics >> * Nick Gamb: Centrify >> * Sai Ganji: Cloudwick >> * Gabriela Lima Garza: Intel >> * Victor Gonzalez: Intel >> * Mark Grover: Cloudera >> * Morris Hicks: Cloudera >> * Ritu Kama: Intel >> * Austin Leahy: eBay >> * Ashrith Mekala: Cloudwick >> * Diego Ortiz: Intel >> * Sudharshan Rao PakalaSai: Cloudwick >> * Srinivasa Reddy: Cloudera >> * Alan Ross: Intel >> * Everardo Lopez Sandoval: Intel >> * Nathan Segerlind: Intel >> * Vartika Singh: Cloudera >> * Nathanael Smith: Intel >> * Carlos Villavicencio: Intel >> >> == Sponsors == >> >> === Champion === >> >> * Doug Cutting - Cloudera >> >> === Nominated Mentors === >> >> * Brock Noland - ASF Member, phData >> * Jarek Jarcec Cecho - ASF Member, StreamSets >> * Andrei Savu - Cloudera >> * Uma Maheswara Rao G - Intel >> >> === Sponsoring Entity === >> >> The Apache Incubator. >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org >> For additional commands, e-mail: general-h...@incubator.apache.org > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org