+1 On 14/09/2015 16:17, "Adam Fuchs" <afu...@apache.org> wrote:
>Thanks again for the healthy discussion on Rya. With that, I would like to >call a VOTE for accepting Rya as a new incubator project. > >The proposal text is included below, and is posted on the wiki here: >https://wiki.apache.org/incubator/RyaProposal > >The discussion thread on Rya starts here: >http://mail-archives.apache.org/mod_mbox/incubator-general/201509.mbox/%3C >CALt5_xJKtRcUr3WGjfrY77DYWF0-8DWi%3DzyS7hrMFTg%2BYAORjQ%40mail.gmail.com%3 >E > >The vote will be open until Thu Sep 17 15:15:00 UTC 2015. > >[ ] +1 accept Rya in the Incubator >[ ] ±0 >[ ] -1 because... > >Thanks, >Adam > > >= Rya Proposal = >== Abstract == >Rya (pronounced "ree-uh" /rēə/) is a cloud-based RDF triple store that >supports SPARQL queries. > >== Proposal == >Rya is a scalable RDF data management system built on top of Accumulo. Rya >uses novel storage methods, indexing schemes, and query processing >techniques that scale to billions of triples across multiple nodes. Rya >provides fast and easy access to the data through SPARQL, a conventional >query mechanism for RDF data. > >== Background == >RDF is a World Wide Web Consortium (W3C) standard used in describing >resources on the Web. The smallest data unit is a triple consisting of >subject, predicate, and object. Using this framework, it is very easy to >describe any resource, not just Web related. For example, if you want to >say that Alice is a professor, you can represent this as an RDF triple >like >(Alice, rdf:type, Professor). In general, RDF is an open world framework >that allows anyone to make any statement about any resource, which makes >it > a popular choice for expressing a large variety of data. > >RDF is used in conjunction with the Web Ontology Language (OWL). OWL is a >framework for describing models or ontologies for RDF. It defines >concepts, >relationships, and/or structure of RDF documents. These models can be used >to 'reason/infer' information about entities within a given domain. For >example, you can express that a Professor is a sub class of Faculty, >(Professor, rdfs:subClassOf, Faculty) and knowing that (Alice, rdf:type, >Professor), it can be inferred that (Alice, rdf:type, Faculty). > >SPARQL is an RDF query language. Similar with SQL, SPARQL has SELECT and >WHERE clauses; however, it is based on querying and retrieving RDF >triples. > >Work on Rya, a large scale distributed system for storing and querying >RDF >data, started in 2010. > >== Rationale == >With the increase in data size, there is a need for scalable systems for >storing and retrieving RDF data in a cluster of nodes. We believe that Rya >can fulfill that role. We expect that communities within government, >health >care, finance, and others who generate large amounts of RDF data will be >most interested in this project. > >From its inception, the project operated with an Apache-style license, but >it was open to mostly US government-related projects only. We believe that >having the project and the development open for all will benefit both the >project and the interested communities. > >== Current Status == >The project source code and documentation are currently hosted in a >private >repository on Github. New users are added to the repository upon request. > >=== Meritocracy === >Meritocracy is the model that we currently follow, and we want to build a >larger and more diverse developer community by becoming an Apache project. > >=== Community === >Rya has being building a community of users and developers for the past 3 >years. There is currently an active workgroup with monthly meetings and >the >number of participants in the meeting is increasing. > >=== Core Developers === >The core developers are a diverse group of people who are either >government >employees or former / current government contractors from different >companies. > >=== Alignment === >Rya is built on top of Accumulo, an Apache project. > >== Known Risks == >=== Orphaned Products === >There is a very small risk of becoming orphaned. The current contributors >are strongly committed to the project, there is a large enough number of >developers interested in contributing to the project, and we believe that >the support for the project will continue to grow from the interested >communities. > >=== Inexperience with Open Source === >The initial committers have various degrees of experience with open source >projects - from very new to experienced. This project was open source >within government from the beginning. We are aware that it will be >different and more difficult functioning in a real open source >environment. >We are enthusiastic and committed to learning the Apache way and being >successful in operating under Apache's development process. > >=== Homogenous Developers === >The current list of developers form a heterogeneous group, with people for >academia, government, and industry, collaborating from distributed >geographic locations. We aim to expand the list of contributors with the >help of the Apache incubation process. > >=== Reliance on Salaried Developers === >Many but not all of the developers working on the project are salaried >employees, paid to work on this project. They will continue to contribute >to the open source project. Some of the initial committers continued as >volunteers even if no longer employed to work on this project and they >plan >to continue supporting the project. > >=== Relationships with Other Apache Products === >Rya uses Apache Accumulo, Hadoop, Zookeeper, Maven. > > *Apache Jena API or Apache Commons RDF API could become the RDF API used >by Rya, but such a decision was not made. > *Apache Clerezza is database/triple store agnostic, and as such could be >complementary to Rya. > *Apache Stanbol focuses on providing semantic services, while Rya focuses >on providing a distributed triple store solution, with support for SPARQL >and OWL reasoning. > *Apache Marmotta provides an implementation of a Linked Data Platform, >and >overlaps in some of the goals and functionality with Rya (RDF triple >store, >SPARQL support among others). There are many opportunities for >collaboration with these projects and we are looking forward to such a >collaboration. > >=== Apache Brand === >Rya has generated interest in the government. It also generated interest >within academia and industry. We believe that everyone could benefit from >having Rya as an open source project. Due to its strong ties to Accumulo, >an Apache project, and due to the values of the Apache Foundation, we >believe that Apache incubator is the right place for Rya. > >== Documentation == >Two peer-reviewed publications [1,2] about Rya were published in 2012 and >2015. More documentation is available in the code. > >[1] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ >http://www.usna.edu/Users/cs/adina/research/Rya%5FCloudI%32%30%31%32.pdf|R >ya: >A Scalable RDF Triple Store for the Clouds]]. Proceedings of the 1st >International Workshop on Cloud Intelligence, Pages 4:1-4:8, August 2012 > >[2] Roshan Punnoose, Adina Crainiceanu, David Rapp. [[ >http://www.usna.edu/Users/cs/adina/research/Rya_ISjournal2013.pdf|SPARQL >in >the Clouds Using Rya]]. Information Systems, Volume 48, Pages 181-195, >March 2015 (Available online 23 July 2013) > >== Initial Source == >The code is currently in a private Github repository, due to security and >IP review processes. We intend to open it up via transferring the code to >an ASF repository. > >== Source and Intellectual Property Submission Plan == >The source code has been released under the Apache License, Version 2. >Software grant, and CCLAs have been submitted. ICLAs for initial >committers >have been submitted or are in progress. > >== External Dependencies == > * [[http://rdf4j.org|OpenRDF Sesame]] (BSD license) > * [[http://www.geomesa.org/|GeoMesa]] (Apache License, Version 2.0) > * [[https://accumulo.apache.org/|Accumulo]] (Apache License, Version 2.0) > * [[https://hadoop.apache.org/|Hadoop]] (Apache License, Version 2.0) > * [[https://pig.apache.org/|Pig]] (Apache License, Version 2.0) > * [[http://tinkerpop.incubator.apache.org/|TinkerPop]] (Apache License, >Version 2.0) > >== Cryptography == >The proposal does not involve any cryptographic code. > >== Required Resources == >=== Mailing lists === > * priv...@rya.incubator.apache.org > * d...@rya.incubator.apache.org > * comm...@rya.incubator.apache.org > >=== Git Repository === >https://git-wip-us.apache.org/repos/asf/incubator-rya.git > >=== Issue Tracking === >JIRA Rya > >== Initial Committers == > * Roshan Punnoose, roshanp at gmail dot com > * David Rapp, dnrapp at ncsu dot edu > * Adina Crainiceanu, adinancr at gmail dot com > * Aaron Mihalik, aaron.mihalik at gmail dot com > * Puja Valiyil, pujav65 at gmail dot com > * Jennifer Brown, jennifer.brown at parsons dot com > * Steve Wagner, steve.r.wagner at gmail dot com > >== Affiliations == > * Roshan Punnoose, Enlighten IT Consulting > * David Rapp, North Carolina State University > * Adina Crainiceanu, US Naval Academy > * Aaron Mihalik, Parsons > * Puja Valiyil, Parsons > * Jennifer Brown, Parsons > * Steve Wagner, Enlighten IT Consulting > >== Sponsors == >=== Champion === > * Adam Fuchs, ASF Member, afuchs at apache dot org > >=== Nominated Mentors === > * Josh Elser josh dot elser at gmail dot com > * Edward J. Yoon edwardyoon at apache dot org > * Sean Busbey busbey at cloudera dot com > >We are seeking additional mentors > >=== Sponsoring Entity === >Apache Incubator --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org