Great, thank you... i will update the wiki and send a vote in the morning On Mon, Aug 20, 2018 at 18:15 William Colen <co...@apache.org> wrote:
> Luciano, > > Now I can properly volunteer myself as a mentor. > > Thank you > William > > Em seg, 20 de ago de 2018 às 19:37, William Colen <co...@apache.org> > escreveu: > > > Luciano, I am very happy to see this incubation proposal here. I am > > following this project for a while. Knowing the reputation on the > > committees involved and B2W I know it can be very successful. > > I am waiting for the board aproval for joining IPMC. Once approved I > would > > happly volunteer myself as a mentor. > > > > Em seg, 20 de ago de 2018 às 17:45, Luciano Resende < > luckbr1...@gmail.com> > > escreveu: > > > >> We are still looking for one more mentor before we put this proposal to > a > >> vote, any volunteers? > >> > >> On Wed, Aug 15, 2018 at 12:13 PM Luciano Resende <luckbr1...@gmail.com> > >> wrote: > >> > >> > We would like to start a discussion on accepting Marvin-AI as an > Apache > >> > Incubator project. > >> > > >> > The proposal is available at the incubator wiki, and also copied > below: > >> > https://wiki.apache.org/incubator/Marvin-AI > >> > > >> > As part of the initial due diligence, we have done a preliminary name > >> > search and the results are available on the JIRA below: > >> > > >> > https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-144 > >> > > >> > We are also looking for two additional mentors. > >> > > >> > > >> > Thanks in advance for your time reviewing and providing feedback. > >> > > >> > === > >> > > >> > = Marvin-AI = > >> > > >> > == Abstract == > >> > > >> > Marvin-AI is an open-source artificial intelligence (AI) platform that > >> > helps data scientists, prototype and productionalize complex solutions > >> with > >> > a scalable, low-latency, language-agnostic, and standardized > >> architecture > >> > while simplifies the process of exploration and modeling. > >> > > >> > == Proposal == > >> > > >> > Marvin helps non-experienced developers create industry-grade AI > >> > applications. It has three core components: a development environment > >> to > >> > be used during data exploration and hypothesis validation (Toolbox), a > >> > library which should be extended to create Marvin engines, and a Scala > >> > application server which interprets engines (Engine Executor). > >> > A basic premise of Marvin is that it should be language-agnostic, able > >> to > >> > interpret engines implemented in different programming languages. > >> > > >> > == Background == > >> > > >> > The Marvin AI project was initiated as an internal project at B2W > >> Digital > >> > (Brazil), the largest e-commerce company in Latin America. Nowadays, > it > >> is > >> > used by all data scientists within the B2W team. Oftentimes, data > >> > scientists don't have an extensive background in software engineering, > >> yet > >> > are in charge of creating AI applications that need to scale to high > >> > throughput and provide millisecond-level response times. At B2W, > Marvin > >> AI > >> > plays an important role in this process, abstracting advanced software > >> > engineering procedures, allowing data scientists to focus on their > >> > knowledge domain. > >> > > >> > == Rationale == > >> > > >> > With recent advances in computer architecture and a corresponding > >> increase > >> > in the amount of data generated by always-connected devices, AI > >> algorithms > >> > offer a solution to problems that have long troubled modern > >> corporations. > >> > Since AI developers come from various fields, such as statistics, > >> physics, > >> > and math, there exists a strong need for platforms which enable them > to > >> > move from prototypes to enterprise applications. Although some tools > >> claim > >> > to offer this service, in reality, there is no reliable open-source > >> > solution. > >> > > >> > == Initial Goals == > >> > > >> > The initial goals will most likely be to merge the existing codebase > >> into > >> > a single repository, migrate it to Apache, and then integrate with the > >> > Apache development process. Furthermore, we plan for incremental > >> > development and releases, as per Apache guidelines. > >> > > >> > == Current Status == > >> > > >> > === Meritocracy === > >> > > >> > Marvin already works under principles of meritocracy. Today, Marvin > >> > already has some contributors that are part of other institutions. > >> Although > >> > there is no formal process defined to become a committer, contributors > >> that > >> > make major changes/improvements to the platform are naturally granted > >> write > >> > access to the repository. > >> > > >> > > >> > === Community === > >> > > >> > Acceptance into the Apache foundation would substantially boost both > >> > Marvin's user and developer communities. The current community > includes > >> a > >> > few experienced developers that have either academic or professional > >> > experience with AI. The community is largely comprised of data > >> scientists > >> > working at B2W and other companies such as Cloudera, MIT, Qume Labs, > >> > Laguro.com, and CBYK. Also, there is a meetup group of hundreds of > >> users > >> > who meet regularly to exchange ideas about Marvin and, more generally, > >> AI. > >> > > >> > Reference to the group: https://www.meetup.com/marvin-ai/members/ > >> > > >> > === Core Developers === > >> > > >> > The core developers for Marvin are listed in the contributor's list > and > >> > initial PPMC below. These lists include B2W employees, MIT students, > >> UFSCAR > >> > researchers, independent contributors, and some employees of other > >> > companies like Cloudera, Qume Labs, Laguro.com, and CBYK. > >> > > >> > === Alignment === > >> > > >> > The initial committers strongly believe that by being part of the > Apache > >> > Software Foundation, Marvin AI will be part of a comprehensive suite > >> for AI > >> > applications that can process big data and enable enterprises to > extract > >> > value from their data lakes. Also, we hope that by integrating with > >> other > >> > Apache projects such as Apache Spark, Apache Hadoop; that this will > >> foster > >> > additional collaboration between these projects furthering the already > >> > existing integration points and expanding the community of > contributors. > >> > > >> > > >> > == Known Risks == > >> > > >> > === Orphaned products === > >> > > >> > Given the current maturity of Marvin and how well it has been received > >> at > >> > technical conferences, the risk of the project being abandoned is > >> minimal. > >> > AI is not academia-exclusive anymore, and as enterprises start to add > >> > data-science pipelines to their applications, demand for Marvin will > >> only > >> > increase. > >> > > >> > === Inexperience with Open Source === > >> > > >> > Marvin AI has been an open-source project since October 2017. The > >> project > >> > was started in a company where open-source culture is foundational. > B2W > >> > Digital runs the largest e-commerce in Latin America on top of > >> open-source > >> > projects. > >> > > >> > === Reliance on Salaried Developers === > >> > > >> > Marvin AI receives substantial efforts from salaried developers -- a > few > >> > of which were hired by companies to work exclusively for the project > -- > >> but > >> > the majority devote "after-hours" or spare time to this project. Some > >> > developers are graduate students that contribute in their free time at > >> > school. > >> > > >> > === Relationships with Other Apache Products === > >> > > >> > Marvin integrates with several Apache products, such as Hadoop (HDFS) > >> and > >> > Spark. Marvin shares some similar features with PredictionIO, > >> specifically > >> > the model application server and a design pattern that was inspired by > >> the > >> > DASE. Despite these similarities, Marvin is catered towards a > different > >> > clientele (data scientists), and for that reason, it includes many > >> critical > >> > features that are not provided by PredictionIO. > >> > > >> > === An Excessive Fascination with the Apache Brand === > >> > > >> > While the ASF brand will undoubtedly help Marvin become a successful > >> > project, Marvin is already gaining traction at companies around the > >> globe. > >> > > >> > == Documentation == > >> > > >> > http://www.marvin-ai.org > >> > > >> > > >> > == Initial Source == > >> > > >> > The current codebase is available at http://github.com/marvin-ai. > This > >> is > >> > practically the same code that will be migrating to the Apache > >> Foundation, > >> > the notable difference being that the multiple repositories will be > >> merged > >> > into a single repository (if necessary). > >> > > >> > These are the main repositories and a very simplified explanation > about > >> > each one: > >> > > >> > '''Main repositories''' > >> > > >> > * marvin-ai/marvin-python-toolbox - Data Science toolbox that helps > in > >> > the creation of new ML engines > >> > * marvin-ai/marvin-engine-executor - Component responsible for > >> > interpreting, serving and managing Marvin engines > >> > * marvin-ai/marvin-public-engines - Marvin engine examples to help > new > >> > Marvin users to build engines > >> > * marvin-ai/marvin-platform-book - Documentation in GitHub book site > >> > format > >> > > >> > '''Secondary repositories (Experimental and Initial)''' > >> > * marvin-ai/marvin-vagrant-dev - Development environment that uses > >> > VirtualBox and vagrant to non mac and Linux users; > >> > * marvin-ai/marvin-paper - Source code (latex format) of the first > >> Marvin > >> > paper published in PAPIS.io conference in Boston. > >> > * marvin-ai/marvin-cluster-admin - Admin module responsible to manage > >> > Marvin cluster; > >> > * marvin-ai/marvin-automl - AutoML module responsible to help data > >> > scientist to build machine learning models with a very simple visual > >> > interface; > >> > > >> > > >> > == External Dependencies == > >> > > >> > It is very likely that all our dependencies are using either the > Apache > >> or > >> > MIT license. Upon acceptance to the incubator, we would begin a > thorough > >> > analysis of all transitive dependencies to verify this fact and > >> introduce > >> > license checking into the build and release process. > >> > > >> > == Required Resources == > >> > > >> > === Mailing lists === > >> > > >> > * priv...@marvin-ai.incubator.apache.org (with moderated > >> subscriptions) > >> > * d...@marvin-ai.incubator.apache.org > >> > * comm...@marvin-ai.incubator.apache.org > >> > > >> > > >> > === Git Repositories === > >> > > >> > * https://git-wip-us.apache.org/repos/asf/incubator-marvin-ai.git > >> > > >> > === Issue Tracking === > >> > > >> > * JIRA (MARVIN) > >> > > >> > == Initial Committers == > >> > > >> > * Lucas Bonatto Miguel <lucasbona...@gmail.com> - Qume Labs > >> (California > >> > - USA) > >> > * Daniel Takabayashi <daniel.takabaya...@gmail.com> - B2W Digital > (São > >> > Paulo - BR) / Laguro.com (California - USA) > >> > * Bruno Piraja <bruno.pir...@b2wdigital.com> - B2W Digital (São > Paulo > >> - > >> > BR) > >> > * Zhang Yifei <zhang.yi...@b2wdigital.com> - B2W Digital (São Paulo > - > >> BR) > >> > * Harrison Wang <hwang...@mit.edu> - MIT (USA) > >> > * Brody West <bro...@mit.edu> - MIT (USA) > >> > * Rafael Novello <rafael.nove...@b2wdigital.com> - B2W Digital (São > >> > Paulo - BR) > >> > * Willian Leite <willian.le...@cbyk.com.br> - CBYK (São Paulo - BR) > >> > * Danilo Nunes <nunesdan...@gmail.com> - Qume Labs (California - > USA) > >> > * Alan Silva <alan.si...@cloudera.com> Cloudera (USA) > >> > * Jeremy Elster <jeremy.els...@b2wdigital.com> - B2W Digital (São > >> Paulo > >> > - BR) > >> > > >> > > >> > == Sponsors == > >> > > >> > === Champion === > >> > > >> > * Luciano Resende - (lresende) > >> > > >> > === Nominated Mentors === > >> > > >> > * Luciano Resende - (lresende) > >> > > >> > === Sponsoring Entity === > >> > We would like to propose the Apache Incubator to sponsor this project. > >> > > >> > -- > >> > Luciano Resende > >> > http://twitter.com/lresende1975 > >> > http://lresende.blogspot.com/ > >> > > >> > >> > >> -- > >> Luciano Resende > >> http://twitter.com/lresende1975 > >> http://lresende.blogspot.com/ > >> > > > -- Sent from my Mobile device