+1
On Wed, Aug 15, 2018 at 9:36 AM Dave Fisher <dave2w...@comcast.net> wrote: > +1 (binding) > > Sent from my iPhone > > > On Aug 15, 2018, at 9:27 AM, P. Taylor Goetz <ptgo...@apache.org> wrote: > > > > After a brief discussion [1] I would like to call a VOTE to accept DLab > into the Apache Incubator. The full proposal is available on the wiki[2] > and is pasted below in text form as well. > > > > This vote will run at least 72 hours. Please VOTE as follows: > > > > [ ] +1 Accept DLab into the Apache Incubator > > [ ] +0 No opinion > > [ ] -1 Do not accept DLab into the Apache Incubator because… > > > > -Taylor > > > > [1] > https://lists.apache.org/thread.html/9c96873d49f53da33260e21dc698f7c9b82eec256caf97a0e3f54943@%3Cgeneral.incubator.apache.org%3E > > [2] https://wiki.apache.org/incubator/DLabProposal > > > > > > = DLab Proposal = > > > > == Abstract == > > DLab is a platform for creating self-service, exploratory data science > environments in the cloud using best-of-breed data science tools. > > > > DLab includes a self-service web console, used to create and manage > exploratory environments. It allows teams to spin up analytical > environments with just a single click of a mouse. Once established, the > environment can be managed by an analytical team itself, leveraging simple > and easy-to-use web-based interface. > > > > == Proposal == > > In order to work effectively, data scientists rely on a varying suite of > analytics tools that are readily available. However, many of those tools > are non-trivial to set up in terms of hardware provisioning, software > installation, configuration, and deployment. Setting up a collaborative, > multi-tenant development environment for data scientists consumes > substantial IT and DevOps resources, as well as time. These factors often > combine to hinder the agility and effectiveness of data science teams > within an organization. Current solutions are largely closed source and/or > proprietary, and committing to a given solution introduces the potential > for vendor lock-in. > > > > EPAM Systems developed DLab in response to the lack of open source, > permissibly licensed solutions to better enable data science workflows. The > ALv2 was selected to encourage open development and user adoption. DLab was > open sourced on Dec 29, 2016 and is under active development with support > from EPAM Systems. > > > > We believe DLab is a unique solution with no current open source > equivalent. Our primary goals of incubation are to grow and diversify the > DLab community to ensure its long-term sustainability. > > > > == Rationale == > > DLab is a platform that provides data scientists with the ability to > self-provision, without IT support, exploratory and production environments > with their preferred set of tools installed and pre-configured. Tool > options include, but are not limited to: > > > > * Apache Spark > > * Apache Flink (planned) > > * Apache Zeppelin > > * Jupyter > > * TensorFlow + Jupyter > > * Deep Learning + Jupyter > > > > DLab leverages cloud computing providers for virtual hardware > provisioning and currently supports the following: > > > > * Amazon Web Services (AWS) > > * Microsoft Azure > > * Google Compute Platform (GCP) (under development) > > > > DLab offers git-based collaboration tools for data scientists and > developers and integrates with the following git service providers: > > > > * GItHub > > * GitLab > > * BitBucket > > > > Additionally, DLab includes the option to configure the UnGit tool in an > environment to facilitate collaboration. > > Finally, DLab integrates closely with many security and SSO offerings, > including: > > > > * LDAP > > * Microsoft Active Directory > > * AWS Identity Access Management service > > > > DLab was designed from the ground up to be highly configurable, > flexible, and extensible platform. We believe these qualities will > encourage community growth by enabling contributors to easily add new > integrations and extensions. > > > > == Initial Goals == > > The initial goal will be to move the existing codebase to Apache and > integrate with the Apache development process and infrastructure. A primary > goal of incubation will be to grow and diversify the DLab PPMC. We are well > aware that the project community is comprised of individuals from a single > company. We aim to change that during incubation. > > > > == Current Status == > > As previously mentioned, DLab is under active development at EPAM > Systems, and is being used in a number of production deployments: > > > > * [An investment company] is using DLab as an AWS-based analytics > platform for their data scientists to provide a convenient way to perform > multi-tenant data analytics. This enables data scientists to easily > provision work environments with integrated data sources based on > Elasticsearch, Apache HBase, and Neo4j, and utilizing Apache Spark. This > enabled a “one click”, self service option for users to provision an > environment with the necessary tools and data. > > > > * [An electronics manufacturing company] leverages DLab for data > quality, data exploration, and analytics. The company’s data scientists > leverage DLab to work with data sources that have been transferred to the > cloud in order to find new insights on the data, and help the > implementation team define requirements for data engineering. The main goal > is to increase the utilization of various tools by decreasing time to > deployment. > > > > * [A retail company] is using DLab as an image recognition framework, to > enable automated restocking of inventory. > > > > * [A travel company] is using DLab to create recommendation engine that > will allow end users to find more relevant accommodations faster and at a > lower cost. > > > > === Meritocracy === > > We value meritocracy and we understand that it is the basis for an open > community that encourages multiple companies and individuals to contribute > and be invested in the project’s future. We will encourage and monitor > participation and make sure to extend privileges and responsibilities to > all contributors. > > > > === Community === > > DLab is currently being used by developers at EPAM and a gowing number > of customers are actively using it in production environments. By bringing > DLab to Apache we hope to broaden and diversity the user and developer > community through open collaboration. > > > > === Core Developers === > > DLab was initially developed at EPAM Systems and is under active > development. We believe DLab will be of interest to a broad range of users > and devlopers and that incubating the project at the ASF will help us build > a diverse, sustainable community. > > > > === Alignment === > > DLab utilizes other Apache projects such as Apache Spark, Apache Toree > (incubating), and Apache Zeppelin, along with a number of other Apache > libraries. We anticipate integration with additional Apache projects as the > DLab community and interest in the project grows. > > > > == Known Risks == > > > > === Orphaned products === > > EPAM Systems is committed to the future development of DLab and > understands that graduation to a TLP, while preferable, is not the only > positive outcome of incubation. > > > > Should the DLab project be accepted by the Incubator, the prospective > PPMC would be willing to agree to a target incubation period of 2 years or > less, knowing that every Incubator project incurs a certain cost in terms > of ASF infrastructure and volunteer time. > > > > === Inexperience with Open Source === > > Many DLab contributors are already familiar with open source processes > and several of them are committers on other Apache projects. We will be > actively working with experienced Apache community members to improve our > project. > > > > === Homogenous Developers === > > The initial committers of DLab all come from EPAM Systems, though we > are committed to recruiting and developing additional committers from a > wide spectrum of industries and backgrounds. > > > > === Reliance on Salaried Developers === > > It is expected that DLab development will occur on both salaried time > and on volunteer time, after hours. All of the initial committers are paid > by EPAM Systems to contribute to this project. However, they are all > passionate about the project, and we are both confident and hopeful that > the project will continue even if no salaried developers contribute to the > project. > > > > === Relationships with Other Apache Products === > > As mentioned in the Rationale section, DLab utilizes a number of > existing Apache projects (Spark, Toree, Zeppelin, et. al.), and we expect > that list to expand as the community grows and diversifies. Any Apache > project in the big data, data science, and/or analytics space would be > potentially relevant. > > > > === A Excessive Fascination with the Apache Brand === > > We are applying to the Incubator process because we think it is the next > logical step for the DLab project after open-sourcing the code. This > proposal is not for the purpose of generating publicity. Rather, we want to > make sure to create a very inclusive and meritocratic community, outside > the umbrella of a single company. EPAM has a long history of contributing > to Apache projects and the DLab developers and contributors understand the > implication of making it an Apache project. > > > > == Required Resources == > > > > === Mailing lists === > > * d...@dlab.incubator.apache.org > > * comm...@dlab.incubator.apache.org > > * priv...@dlab.incubator.apache.org > > > > === Source control === > > * https://git-wip-us.apache.org/repos/asf/incubator-dlab > > > > === Issue tracking === > > * JIRA DLab (DLAB) > > > > == Documentation == > > * DLab Website: http://dlab.opensource.epam.com > > * DLab code base: https://github.com/epam/DLab > > * DLab Overview: https://github.com/epam/DLab/blob/master/README.md > > * DLab User Guide: > https://github.com/epam/DLab/blob/master/USER_GUIDE.md > > > > == Initial Source == > > The DLab codebase is currently hosted on Github: > https://github.com/epam/DLab > > > > == Source and Intellectual Property Submission Plan == > > The DLab source code in Github is currently licensed under Apache > License v2.0 and the copyright is assigned to EPAM Systems. If DLab becomes > an Incubator project at the ASF, EPAM Systems will transfer the source code > and trademark ownership to the Apache Software Foundation via a Software > Grant Agreement. > > > > == External Dependencies == > > To the best of our knowledge, all of DLab dependencies are distributed > under Apache compatible licenses. > > > > DLab was designed to be highly extensible, and we expect and encourage > the development of third-party extensions and plug-ins. We also understand > that any such component, if it requires a dependency forbidden by Apache > license policy, would not be eligible for inclusion in an Apache release, > and would have to be hosted, supported, etc. outside of ASF infrastructure > and labeled appropriately. > > > > === External dependencies licensed under Apache License 2.0: === > > MongoDB Java Driver - org.mongodb:mongo-java-driver ( > http://mongodb.github.io/mongo-java-driver/3.2/driver) > > > > Dropwizard (https://github.com/dropwizard/dropwizard) > > > > Dropwizard Template Config ( > https://github.com/tkrille/dropwizard-template-config) > > > > Apache Directory Server (https://github.com/apache/directory-server) > > > > Jackson (https://github.com/FasterXML/jackson) > > > > AWS Java SDK (https://github.com/aws/aws-sdk-java) > > > > Boto3 (https://github.com/boto/boto3) > > > > === External dependencies licensed under the MIT License: === > > angular2-app (https://www.npmjs.com/package/angular2-app) > > > > angular2-seed (https://www.npmjs.com/package/angular2-seed) > > > > angular2-seed-advanced ( > https://www.npmjs.org/package/angular2-seed-advanced) > > > > angular2-seed-n3UX (https://www.npmjs.com/package/angular2-seed-n3UX) > > > > http-status-enum (https://www.npmjs.com/package/http-status-enum) > > Mockito (https://github.com/mockito/mockito) > > > > ng2-translate (https://www.npmjs.com/package/ng2-translate) > > > > SLF4J (http://www.slf4j.org/) > > > > === External dependencies licensed under the CDDL License: === > > Jersey (https://github.com/jersey/jersey) > > > > === External dependencies licensed under the Python Software License > Version 2: === > > jython (https://github.com/jythontools/jython) > > > > === ASF Projects: === > > Apache Spark, Apache Toree (incubating), Apache Zeppelin > > > > == Cryptography == > > Not applicable. > > > > == Initial Committers == > > * Dmytro Liaskovskyi dmytro_liaskovs...@epam.com > > * Volodymyr Veres volodymyr_ve...@epam.com > > * Oleh Hrynets oleh_hryn...@epam.com > > * Oleh Hrynyk oleh_hry...@epam.com > > * Oleh Martushevskyi oleh_martushevs...@epam.com > > * Oleh Moskovych oleh_moskov...@epam.com > > * Vadym Kuznetsov vadym_kuznet...@epam.com > > * Usein Faradzhev usein_faradz...@epam.com > > * Bohdan Hliva bohdan_hl...@epam.com > > * Oleksandr Melnychuk oleksandr_melnych...@epam.com > > * Mikhail Teplitskiy mikhail_teplits...@epam.com > > * Vira Vitanska vira_vitan...@epam.com > > * Andriana Kovalyshyn andriana_kovalys...@epam.com > > * Oleksandr Chaparin oleksandr_chapa...@epam.com > > * Denys Shliakhov denys_shliak...@epam.com > > * Nazar Barabash nazar_barab...@epam.com > > * Yuriy Holinko yuriy_holi...@epam.com > > * Petro Kotsiuba petro_kotsi...@epam.com > > * Bogdan Rudyi bogdan_ru...@epam.com > > * Mikhail Teplitskyi mikhail_teplits...@epam.com > > > > == Sponsors == > > > > === Champion === > > * P. Taylor Goetz ptgo...@apache.org > > > > === Nominated Mentors === > > * P. Taylor Goetz ptgo...@apache.org > > * Henry Saputra hsapu...@apache.org > > > > === Interested Contributors === > > * Debo Dutta ddu...@apache.org > > > > === Sponsoring Entity === > > * The Apache Incubator > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >