date:20120720

[VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Aaron McCurry

I would like to call a vote for accepting Blur for incubation in the
Apache Incubator. The full proposal is available below.

Please cast your vote:

[ ] +1, bring Blur into Incubator
[ ] +0, I don't care either way,
[ ] -1, do not bring Blur into Incubator, because...

This vote will be open for 72 hours and only votes from the Incubator
PMC are binding.

Thank you for your consideration!

Aaron

http://wiki.apache.org/incubator/BlurProposal

= Blur Proposal =

== Abstract ==
Blur is a search platform capable of searching massive amounts of data
in a cloud computing environment. Blur leverages several existing
Apache projects, including Apache Lucene, Apache Hadoop, Apache
!ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
updates are possible with Blur.  Bulk updates are accomplished using
Hadoop Map/Reduce and NRT are performed through direct Thrift calls.

== Proposal ==
Blur is an open source search platform capable of querying massive
amounts of data at incredible speeds. Rather than using the flat,
document-like data model used by most search solutions, Blur allows
you to build rich data models and search them in a semi-relational
manner similar to joins while querying a relational database. Using
Blur, you can get precise search results against terabytes of data at
Google-like speeds.  Blur leverages multiple open source projects
including Hadoop, Lucene, Thrift and !ZooKeeper to create an
environment where structured data can be transformed into an index
that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
bulk indexing into Blur.  Server failures are handled automatically by
using !ZooKeeper for cluster state and HDFS for index storage.

== Background ==
Blur was created by Aaron !McCurry in 2010. Blur was developed to
solve the challenges in dealing with searching huge quantities of data
that the traditional RDBMS solutions could not cope with while still
providing JOIN-like capabilities to query the data.  Several other
open source projects have implemented aspects of this design including
elasticsearch, Katta and Apache Solr.

== Rationale ==
There is a need for a distributed search capability within the Hadoop
ecosystem. Currently, there are no other search solutions that
natively leverage HDFS and the failover features of Hadoop in the same
manner as the Blur project. The communities we expect to be most
interested in such a project are government, health care, and other
industries where scalability is a concern. We have made much progress
in developing this project over the past 2 years and believe both the
project and the interested communities would benefit from this work
being openly available and having open development.  In future
versions of Blur the API will more closely follow the API’s provided
in Lucene so that systems that already use Lucene can more easily
scale with Blur. Blur can be viewed as a query execution engine that
Lucene based solutions can utilize when scale becomes an issue.

== Initial Goals ==
The initial goals of the project are:
 * To migrate the Blur codebase, issue tracking and wiki from
github.com and integrate the project with the ASF infrastructure.
 * Add new committers to the project and grow the community in The Apache Way.

== Current Status ==

=== Meritocracy ===
Blur was initially developed by Aaron !McCurry in June 2010.  Since
then Blur has continued to evolve with the support of a small
development team at Near Infinity.  As a part of the Apache Software
Foundation, the Apache Blur team intends to strongly encourage the
community to help with and contribute to the project.  Apache Blur
will actively seek potential committers and help them become familiar
with the codebase.

=== Community ===
A small community has developed around Blur and several project teams
are currently using Blur for their big data search capability. The
source code is currently available on GitHub and there is a dedicated
website (blur.io) that provides an overview of the project. Blur has
been shared with several members of the Apache community and has been
presented at the Bay Area HUG (see
http://www.meetup.com/hadoop/events/20109471/).

=== Core Developers ===
The current developers are employed by Near Infinity Corporation, but
we anticipate interest developing among other companies.

=== Alignment ===
Blur is built on top of a number of Apache projects; Hadoop, Lucene,
!ZooKeeper, and Thrift. It builds with Maven.  During the course of
Blur development, a couple of patches have been committed back to the
Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
strong relationship with the before mentioned Apache projects, the
incubator is a good match for Blur.

== Known Risks ==

=== Orphaned Products ===
There is only a small risk of being orphaned. The customers that
currently use Blur are committed to improving the codebase of the
project due to its fulfilling needs not addressed by any other
software. In addition, one

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Dave Fisher

+1! - Binding.

On Jul 20, 2012, at 9:42 AM, Aaron McCurry wrote:

 I would like to call a vote for accepting Blur for incubation in the
 Apache Incubator. The full proposal is available below.
 
 Please cast your vote:
 
 [ ] +1, bring Blur into Incubator
 [ ] +0, I don't care either way,
 [ ] -1, do not bring Blur into Incubator, because...
 
 This vote will be open for 72 hours and only votes from the Incubator
 PMC are binding.
 
 Thank you for your consideration!
 
 Aaron
 
 http://wiki.apache.org/incubator/BlurProposal
 
 = Blur Proposal =
 
 == Abstract ==
 Blur is a search platform capable of searching massive amounts of data
 in a cloud computing environment. Blur leverages several existing
 Apache projects, including Apache Lucene, Apache Hadoop, Apache
 !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
 updates are possible with Blur.  Bulk updates are accomplished using
 Hadoop Map/Reduce and NRT are performed through direct Thrift calls.
 
 == Proposal ==
 Blur is an open source search platform capable of querying massive
 amounts of data at incredible speeds. Rather than using the flat,
 document-like data model used by most search solutions, Blur allows
 you to build rich data models and search them in a semi-relational
 manner similar to joins while querying a relational database. Using
 Blur, you can get precise search results against terabytes of data at
 Google-like speeds.  Blur leverages multiple open source projects
 including Hadoop, Lucene, Thrift and !ZooKeeper to create an
 environment where structured data can be transformed into an index
 that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
 bulk indexing into Blur.  Server failures are handled automatically by
 using !ZooKeeper for cluster state and HDFS for index storage.
 
 == Background ==
 Blur was created by Aaron !McCurry in 2010. Blur was developed to
 solve the challenges in dealing with searching huge quantities of data
 that the traditional RDBMS solutions could not cope with while still
 providing JOIN-like capabilities to query the data.  Several other
 open source projects have implemented aspects of this design including
 elasticsearch, Katta and Apache Solr.
 
 == Rationale ==
 There is a need for a distributed search capability within the Hadoop
 ecosystem. Currently, there are no other search solutions that
 natively leverage HDFS and the failover features of Hadoop in the same
 manner as the Blur project. The communities we expect to be most
 interested in such a project are government, health care, and other
 industries where scalability is a concern. We have made much progress
 in developing this project over the past 2 years and believe both the
 project and the interested communities would benefit from this work
 being openly available and having open development.  In future
 versions of Blur the API will more closely follow the API’s provided
 in Lucene so that systems that already use Lucene can more easily
 scale with Blur. Blur can be viewed as a query execution engine that
 Lucene based solutions can utilize when scale becomes an issue.
 
 == Initial Goals ==
 The initial goals of the project are:
 * To migrate the Blur codebase, issue tracking and wiki from
 github.com and integrate the project with the ASF infrastructure.
 * Add new committers to the project and grow the community in The Apache 
 Way.
 
 == Current Status ==
 
 === Meritocracy ===
 Blur was initially developed by Aaron !McCurry in June 2010.  Since
 then Blur has continued to evolve with the support of a small
 development team at Near Infinity.  As a part of the Apache Software
 Foundation, the Apache Blur team intends to strongly encourage the
 community to help with and contribute to the project.  Apache Blur
 will actively seek potential committers and help them become familiar
 with the codebase.
 
 === Community ===
 A small community has developed around Blur and several project teams
 are currently using Blur for their big data search capability. The
 source code is currently available on GitHub and there is a dedicated
 website (blur.io) that provides an overview of the project. Blur has
 been shared with several members of the Apache community and has been
 presented at the Bay Area HUG (see
 http://www.meetup.com/hadoop/events/20109471/).
 
 === Core Developers ===
 The current developers are employed by Near Infinity Corporation, but
 we anticipate interest developing among other companies.
 
 === Alignment ===
 Blur is built on top of a number of Apache projects; Hadoop, Lucene,
 !ZooKeeper, and Thrift. It builds with Maven.  During the course of
 Blur development, a couple of patches have been committed back to the
 Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
 strong relationship with the before mentioned Apache projects, the
 incubator is a good match for Blur.
 
 == Known Risks ==
 
 === Orphaned Products ===
 There is only a small risk of being orphaned. The

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Bill Graham

+1 (non-binding)

On Fri, Jul 20, 2012 at 9:48 AM, Dave Fisher dave2w...@comcast.net wrote:

 +1! - Binding.

 On Jul 20, 2012, at 9:42 AM, Aaron McCurry wrote:

  I would like to call a vote for accepting Blur for incubation in the
  Apache Incubator. The full proposal is available below.
 
  Please cast your vote:
 
  [ ] +1, bring Blur into Incubator
  [ ] +0, I don't care either way,
  [ ] -1, do not bring Blur into Incubator, because...
 
  This vote will be open for 72 hours and only votes from the Incubator
  PMC are binding.
 
  Thank you for your consideration!
 
  Aaron
 
  http://wiki.apache.org/incubator/BlurProposal
 
  = Blur Proposal =
 
  == Abstract ==
  Blur is a search platform capable of searching massive amounts of data
  in a cloud computing environment. Blur leverages several existing
  Apache projects, including Apache Lucene, Apache Hadoop, Apache
  !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
  updates are possible with Blur.  Bulk updates are accomplished using
  Hadoop Map/Reduce and NRT are performed through direct Thrift calls.
 
  == Proposal ==
  Blur is an open source search platform capable of querying massive
  amounts of data at incredible speeds. Rather than using the flat,
  document-like data model used by most search solutions, Blur allows
  you to build rich data models and search them in a semi-relational
  manner similar to joins while querying a relational database. Using
  Blur, you can get precise search results against terabytes of data at
  Google-like speeds.  Blur leverages multiple open source projects
  including Hadoop, Lucene, Thrift and !ZooKeeper to create an
  environment where structured data can be transformed into an index
  that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
  bulk indexing into Blur.  Server failures are handled automatically by
  using !ZooKeeper for cluster state and HDFS for index storage.
 
  == Background ==
  Blur was created by Aaron !McCurry in 2010. Blur was developed to
  solve the challenges in dealing with searching huge quantities of data
  that the traditional RDBMS solutions could not cope with while still
  providing JOIN-like capabilities to query the data.  Several other
  open source projects have implemented aspects of this design including
  elasticsearch, Katta and Apache Solr.
 
  == Rationale ==
  There is a need for a distributed search capability within the Hadoop
  ecosystem. Currently, there are no other search solutions that
  natively leverage HDFS and the failover features of Hadoop in the same
  manner as the Blur project. The communities we expect to be most
  interested in such a project are government, health care, and other
  industries where scalability is a concern. We have made much progress
  in developing this project over the past 2 years and believe both the
  project and the interested communities would benefit from this work
  being openly available and having open development.  In future
  versions of Blur the API will more closely follow the API’s provided
  in Lucene so that systems that already use Lucene can more easily
  scale with Blur. Blur can be viewed as a query execution engine that
  Lucene based solutions can utilize when scale becomes an issue.
 
  == Initial Goals ==
  The initial goals of the project are:
  * To migrate the Blur codebase, issue tracking and wiki from
  github.com and integrate the project with the ASF infrastructure.
  * Add new committers to the project and grow the community in The
 Apache Way.
 
  == Current Status ==
 
  === Meritocracy ===
  Blur was initially developed by Aaron !McCurry in June 2010.  Since
  then Blur has continued to evolve with the support of a small
  development team at Near Infinity.  As a part of the Apache Software
  Foundation, the Apache Blur team intends to strongly encourage the
  community to help with and contribute to the project.  Apache Blur
  will actively seek potential committers and help them become familiar
  with the codebase.
 
  === Community ===
  A small community has developed around Blur and several project teams
  are currently using Blur for their big data search capability. The
  source code is currently available on GitHub and there is a dedicated
  website (blur.io) that provides an overview of the project. Blur has
  been shared with several members of the Apache community and has been
  presented at the Bay Area HUG (see
  http://www.meetup.com/hadoop/events/20109471/).
 
  === Core Developers ===
  The current developers are employed by Near Infinity Corporation, but
  we anticipate interest developing among other companies.
 
  === Alignment ===
  Blur is built on top of a number of Apache projects; Hadoop, Lucene,
  !ZooKeeper, and Thrift. It builds with Maven.  During the course of
  Blur development, a couple of patches have been committed back to the
  Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
  strong relationship with the

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Tim Williams

+1 (binding) :)

On Friday, July 20, 2012, Aaron McCurry wrote:

 I would like to call a vote for accepting Blur for incubation in the
 Apache Incubator. The full proposal is available below.

 Please cast your vote:

 [ ] +1, bring Blur into Incubator
 [ ] +0, I don't care either way,
 [ ] -1, do not bring Blur into Incubator, because...

 This vote will be open for 72 hours and only votes from the Incubator
 PMC are binding.

 Thank you for your consideration!

 Aaron

 http://wiki.apache.org/incubator/BlurProposal

 = Blur Proposal =

 == Abstract ==
 Blur is a search platform capable of searching massive amounts of data
 in a cloud computing environment. Blur leverages several existing
 Apache projects, including Apache Lucene, Apache Hadoop, Apache
 !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
 updates are possible with Blur.  Bulk updates are accomplished using
 Hadoop Map/Reduce and NRT are performed through direct Thrift calls.

 == Proposal ==
 Blur is an open source search platform capable of querying massive
 amounts of data at incredible speeds. Rather than using the flat,
 document-like data model used by most search solutions, Blur allows
 you to build rich data models and search them in a semi-relational
 manner similar to joins while querying a relational database. Using
 Blur, you can get precise search results against terabytes of data at
 Google-like speeds.  Blur leverages multiple open source projects
 including Hadoop, Lucene, Thrift and !ZooKeeper to create an
 environment where structured data can be transformed into an index
 that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
 bulk indexing into Blur.  Server failures are handled automatically by
 using !ZooKeeper for cluster state and HDFS for index storage.

 == Background ==
 Blur was created by Aaron !McCurry in 2010. Blur was developed to
 solve the challenges in dealing with searching huge quantities of data
 that the traditional RDBMS solutions could not cope with while still
 providing JOIN-like capabilities to query the data.  Several other
 open source projects have implemented aspects of this design including
 elasticsearch, Katta and Apache Solr.

 == Rationale ==
 There is a need for a distributed search capability within the Hadoop
 ecosystem. Currently, there are no other search solutions that
 natively leverage HDFS and the failover features of Hadoop in the same
 manner as the Blur project. The communities we expect to be most
 interested in such a project are government, health care, and other
 industries where scalability is a concern. We have made much progress
 in developing this project over the past 2 years and believe both the
 project and the interested communities would benefit from this work
 being openly available and having open development.  In future
 versions of Blur the API will more closely follow the API’s provided
 in Lucene so that systems that already use Lucene can more easily
 scale with Blur. Blur can be viewed as a query execution engine that
 Lucene based solutions can utilize when scale becomes an issue.

 == Initial Goals ==
 The initial goals of the project are:
  * To migrate the Blur codebase, issue tracking and wiki from
 github.com and integrate the project with the ASF infrastructure.
  * Add new committers to the project and grow the community in The Apache
 Way.

 == Current Status ==

 === Meritocracy ===
 Blur was initially developed by Aaron !McCurry in June 2010.  Since
 then Blur has continued to evolve with the support of a small
 development team at Near Infinity.  As a part of the Apache Software
 Foundation, the Apache Blur team intends to strongly encourage the
 community to help with and contribute to the project.  Apache Blur
 will actively seek potential committers and help them become familiar
 with the codebase.

 === Community ===
 A small community has developed around Blur and several project teams
 are currently using Blur for their big data search capability. The
 source code is currently available on GitHub and there is a dedicated
 website (blur.io) that provides an overview of the project. Blur has
 been shared with several members of the Apache community and has been
 presented at the Bay Area HUG (see
 http://www.meetup.com/hadoop/events/20109471/).

 === Core Developers ===
 The current developers are employed by Near Infinity Corporation, but
 we anticipate interest developing among other companies.

 === Alignment ===
 Blur is built on top of a number of Apache projects; Hadoop, Lucene,
 !ZooKeeper, and Thrift. It builds with Maven.  During the course of
 Blur development, a couple of patches have been committed back to the
 Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
 strong relationship with the before mentioned Apache projects, the
 incubator is a good match for Blur.

 == Known Risks ==

 === Orphaned Products ===
 There is only a small risk of being orphaned. The customers that

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Jukka Zitting

Hi,

On Fri, Jul 20, 2012 at 7:42 PM, Aaron McCurry amccu...@gmail.com wrote:
 I would like to call a vote for accepting Blur for incubation in the
 Apache Incubator. The full proposal is available below.

  [x] +1, bring Blur into Incubator

BR,

Jukka Zitting

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Re: Openmeetings release approval by Incubator PMC Was: Incubation state transitions and stuck projects (Was: February report review)

2012-07-20 Thread Jukka Zitting

Hi,

On Thu, Jul 19, 2012 at 2:45 PM, Alexei Fedotov
alexei.fedo...@gmail.com wrote:
 The release vote have successfully passed and forwarded to general@
 list [1]. What should be our next step? Have we got Incubator PMC
 approval? Can we proceed with wider distribution?

See http://incubator.apache.org/incubation/Incubation_Policy.html#Releases
for the Incubator policy on releases. You're almost done, just an
extra approval vote on general@ is still needed (and since you already
have two mentor +1s, you only need one more IPMC member to vote for
the release).

BR,

Jukka Zitting

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

[jira] [Created] (PODLINGNAMESEARCH-9) Establish whether Apache Any23 is a suitable name

2012-07-20 Thread Lewis John McGibbney (JIRA)

Lewis John McGibbney created PODLINGNAMESEARCH-9:


 Summary: Establish whether Apache Any23 is a suitable name
 Key: PODLINGNAMESEARCH-9
 URL: https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-9
 Project: Podling Suitable Names Search
  Issue Type: Suitable Name Search
Reporter: Lewis John McGibbney


Apache Any23 (Anything to Triples) is a library, a web service and a command 
line tool that extracts structured data in RDF format from a variety of Web 
documents. 

Currently in the Incubator the PPMC has expressed intentions to move towards 
graduation, however this issue could potentially be a blocker. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

[jira] [Commented] (PODLINGNAMESEARCH-9) Establish whether Apache Any23 is a suitable name

2012-07-20 Thread Lewis John McGibbney (JIRA)


[ 
https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-9?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419591#comment-13419591
 ] 

Lewis John McGibbney commented on PODLINGNAMESEARCH-9:
--

First 9 hits on Google

1) Any23: Anything to Triples - developers.any23.org/Share
This relates to the old developers site and associated documentation prior to 
acceptance into the Apache Incubator

2) Any23: Anything To Triples - any23.org/
Relating to the online service, this will shortly be re-directed to the Apache 
hosted service  

3) any23 - Anything to Triples - Google Project Hosting - 
code.google.com/p/any23/
The old googlecode site prior to migration to ASF

4) Apache Any23 - Introduction - incubator.apache.org/any23/
The current ASF hosted official Any23 site

5) Apache Any23 - ASF JIRA - https://issues.apache.org/jira/browse/ANY23
The official ASF hosted Any23 JIRA

6) Any23 Incubation Status Template - Apache Incubator - 
incubator.apache.org/projects/any23.html
The official incubator Any23 podling status page
 
7) ANY 23 - Log and ANY - www.anycorp.com/any_issue.php?id=23
Relates to a magazine which was published in 1998 and now sold out

8) Any23 - Semantic Web Standards - www.w3.org/2001/sw/wiki/Any23
Result of disseminating Any23 whilst in the ASF Incubator

9) any23-dev | Google Groups - groups.google.com/group/any23-dev
The old google group developers forum which now shows a explicit message that 
everything has shifted to ASF


 Establish whether Apache Any23 is a suitable name
 ---

 Key: PODLINGNAMESEARCH-9
 URL: https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-9
 Project: Podling Suitable Names Search
  Issue Type: Suitable Name Search
Reporter: Lewis John McGibbney

 Apache Any23 (Anything to Triples) is a library, a web service and a command 
 line tool that extracts structured data in RDF format from a variety of Web 
 documents. 
 Currently in the Incubator the PPMC has expressed intentions to move towards 
 graduation, however this issue could potentially be a blocker. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Patrick Hunt

+1, bring Blur into Incubator

On Fri, Jul 20, 2012 at 9:42 AM, Aaron McCurry amccu...@gmail.com wrote:
 I would like to call a vote for accepting Blur for incubation in the
 Apache Incubator. The full proposal is available below.

 Please cast your vote:

 [ ] +1, bring Blur into Incubator
 [ ] +0, I don't care either way,
 [ ] -1, do not bring Blur into Incubator, because...

 This vote will be open for 72 hours and only votes from the Incubator
 PMC are binding.

 Thank you for your consideration!

 Aaron

 http://wiki.apache.org/incubator/BlurProposal

 = Blur Proposal =

 == Abstract ==
 Blur is a search platform capable of searching massive amounts of data
 in a cloud computing environment. Blur leverages several existing
 Apache projects, including Apache Lucene, Apache Hadoop, Apache
 !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
 updates are possible with Blur.  Bulk updates are accomplished using
 Hadoop Map/Reduce and NRT are performed through direct Thrift calls.

 == Proposal ==
 Blur is an open source search platform capable of querying massive
 amounts of data at incredible speeds. Rather than using the flat,
 document-like data model used by most search solutions, Blur allows
 you to build rich data models and search them in a semi-relational
 manner similar to joins while querying a relational database. Using
 Blur, you can get precise search results against terabytes of data at
 Google-like speeds.  Blur leverages multiple open source projects
 including Hadoop, Lucene, Thrift and !ZooKeeper to create an
 environment where structured data can be transformed into an index
 that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
 bulk indexing into Blur.  Server failures are handled automatically by
 using !ZooKeeper for cluster state and HDFS for index storage.

 == Background ==
 Blur was created by Aaron !McCurry in 2010. Blur was developed to
 solve the challenges in dealing with searching huge quantities of data
 that the traditional RDBMS solutions could not cope with while still
 providing JOIN-like capabilities to query the data.  Several other
 open source projects have implemented aspects of this design including
 elasticsearch, Katta and Apache Solr.

 == Rationale ==
 There is a need for a distributed search capability within the Hadoop
 ecosystem. Currently, there are no other search solutions that
 natively leverage HDFS and the failover features of Hadoop in the same
 manner as the Blur project. The communities we expect to be most
 interested in such a project are government, health care, and other
 industries where scalability is a concern. We have made much progress
 in developing this project over the past 2 years and believe both the
 project and the interested communities would benefit from this work
 being openly available and having open development.  In future
 versions of Blur the API will more closely follow the API’s provided
 in Lucene so that systems that already use Lucene can more easily
 scale with Blur. Blur can be viewed as a query execution engine that
 Lucene based solutions can utilize when scale becomes an issue.

 == Initial Goals ==
 The initial goals of the project are:
  * To migrate the Blur codebase, issue tracking and wiki from
 github.com and integrate the project with the ASF infrastructure.
  * Add new committers to the project and grow the community in The Apache 
 Way.

 == Current Status ==

 === Meritocracy ===
 Blur was initially developed by Aaron !McCurry in June 2010.  Since
 then Blur has continued to evolve with the support of a small
 development team at Near Infinity.  As a part of the Apache Software
 Foundation, the Apache Blur team intends to strongly encourage the
 community to help with and contribute to the project.  Apache Blur
 will actively seek potential committers and help them become familiar
 with the codebase.

 === Community ===
 A small community has developed around Blur and several project teams
 are currently using Blur for their big data search capability. The
 source code is currently available on GitHub and there is a dedicated
 website (blur.io) that provides an overview of the project. Blur has
 been shared with several members of the Apache community and has been
 presented at the Bay Area HUG (see
 http://www.meetup.com/hadoop/events/20109471/).

 === Core Developers ===
 The current developers are employed by Near Infinity Corporation, but
 we anticipate interest developing among other companies.

 === Alignment ===
 Blur is built on top of a number of Apache projects; Hadoop, Lucene,
 !ZooKeeper, and Thrift. It builds with Maven.  During the course of
 Blur development, a couple of patches have been committed back to the
 Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
 strong relationship with the before mentioned Apache projects, the
 incubator is a good match for Blur.

 == Known Risks ==

 === Orphaned Products ===
 There is only a small risk of

Re: [VOTE] Accept Blur into the Apache Incubator

2012-07-20 Thread Ashish

+1 (non-binding)

On Fri, Jul 20, 2012 at 10:12 PM, Aaron McCurry amccu...@gmail.com wrote:
 I would like to call a vote for accepting Blur for incubation in the
 Apache Incubator. The full proposal is available below.

 Please cast your vote:

 [ ] +1, bring Blur into Incubator
 [ ] +0, I don't care either way,
 [ ] -1, do not bring Blur into Incubator, because...

 This vote will be open for 72 hours and only votes from the Incubator
 PMC are binding.

 Thank you for your consideration!

 Aaron

 http://wiki.apache.org/incubator/BlurProposal

 = Blur Proposal =

 == Abstract ==
 Blur is a search platform capable of searching massive amounts of data
 in a cloud computing environment. Blur leverages several existing
 Apache projects, including Apache Lucene, Apache Hadoop, Apache
 !ZooKeeper and Apache Thrift.  Both bulk and near real time (NRT)
 updates are possible with Blur.  Bulk updates are accomplished using
 Hadoop Map/Reduce and NRT are performed through direct Thrift calls.

 == Proposal ==
 Blur is an open source search platform capable of querying massive
 amounts of data at incredible speeds. Rather than using the flat,
 document-like data model used by most search solutions, Blur allows
 you to build rich data models and search them in a semi-relational
 manner similar to joins while querying a relational database. Using
 Blur, you can get precise search results against terabytes of data at
 Google-like speeds.  Blur leverages multiple open source projects
 including Hadoop, Lucene, Thrift and !ZooKeeper to create an
 environment where structured data can be transformed into an index
 that runs on a Hadoop cluster.  Blur uses the power of Map/Reduce for
 bulk indexing into Blur.  Server failures are handled automatically by
 using !ZooKeeper for cluster state and HDFS for index storage.

 == Background ==
 Blur was created by Aaron !McCurry in 2010. Blur was developed to
 solve the challenges in dealing with searching huge quantities of data
 that the traditional RDBMS solutions could not cope with while still
 providing JOIN-like capabilities to query the data.  Several other
 open source projects have implemented aspects of this design including
 elasticsearch, Katta and Apache Solr.

 == Rationale ==
 There is a need for a distributed search capability within the Hadoop
 ecosystem. Currently, there are no other search solutions that
 natively leverage HDFS and the failover features of Hadoop in the same
 manner as the Blur project. The communities we expect to be most
 interested in such a project are government, health care, and other
 industries where scalability is a concern. We have made much progress
 in developing this project over the past 2 years and believe both the
 project and the interested communities would benefit from this work
 being openly available and having open development.  In future
 versions of Blur the API will more closely follow the API’s provided
 in Lucene so that systems that already use Lucene can more easily
 scale with Blur. Blur can be viewed as a query execution engine that
 Lucene based solutions can utilize when scale becomes an issue.

 == Initial Goals ==
 The initial goals of the project are:
  * To migrate the Blur codebase, issue tracking and wiki from
 github.com and integrate the project with the ASF infrastructure.
  * Add new committers to the project and grow the community in The Apache 
 Way.

 == Current Status ==

 === Meritocracy ===
 Blur was initially developed by Aaron !McCurry in June 2010.  Since
 then Blur has continued to evolve with the support of a small
 development team at Near Infinity.  As a part of the Apache Software
 Foundation, the Apache Blur team intends to strongly encourage the
 community to help with and contribute to the project.  Apache Blur
 will actively seek potential committers and help them become familiar
 with the codebase.

 === Community ===
 A small community has developed around Blur and several project teams
 are currently using Blur for their big data search capability. The
 source code is currently available on GitHub and there is a dedicated
 website (blur.io) that provides an overview of the project. Blur has
 been shared with several members of the Apache community and has been
 presented at the Bay Area HUG (see
 http://www.meetup.com/hadoop/events/20109471/).

 === Core Developers ===
 The current developers are employed by Near Infinity Corporation, but
 we anticipate interest developing among other companies.

 === Alignment ===
 Blur is built on top of a number of Apache projects; Hadoop, Lucene,
 !ZooKeeper, and Thrift. It builds with Maven.  During the course of
 Blur development, a couple of patches have been committed back to the
 Lucene project, including LUCENE-2205 and LUCENE-2215.  Due to the
 strong relationship with the before mentioned Apache projects, the
 incubator is a good match for Blur.

 == Known Risks ==

 === Orphaned Products ===
 There is only a small risk of being

Re: [VOTE] Release Apache Amber 0.22-incubating

2012-07-20 Thread Raymond Feng

+1 from me.

Thanks,
Raymond

On Jul 18, 2012, at 12:33 AM, Simone Tripodi wrote:

 Good morning IPMC,
 
 I'm writing today to submit you the VOTE of the first release ever of
 Apache Amber 0.22 under the Incubator.
 
 PPMC vote already passed on amber-dev@ http://s.apache.org/Ulr with
 already 1 IPMC binding vote.
 
 We solved 28 issues:
 http://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311100styleName=Htmlversion=12322240
 
 There are still 14 issues left in JIRA:
 http://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=truepid=12311100status=1
 
 SVN source tag (r1359173):
 https://svn.apache.org/repos/asf/incubator/amber/tags/amber-0.22-incubating/
 
 Staging repo:
 https://repository.apache.org/content/repositories/orgapacheamber-039/
 
 Staging binaries:
 http://people.apache.org/builds/incubator/amber/
 
 Staging site:
 http://incubator.apache.org/amber/0.22-incubating/
 
 PGP release keys (signed using 19FEA27D):
 http://www.apache.org/dist/incubator/amber/KEYS
 
 Vote will be open for 72 hours and will close ~ on July the 21th at 7:30am GMT
 
 [ ] +1, let's get it rmblee!!!
 [ ] +/-0, fine, but consider to fix few issues before...
 [ ] -1, nope, because... (and please explain why)
 
 Many thanks in advance, all the best!
 -Simo
 
 http://people.apache.org/~simonetripodi/
 http://simonetripodi.livejournal.com/
 http://twitter.com/simonetripodi
 http://www.99soft.org/
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

[VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Accept Blur into the Apache Incubator

Re: Openmeetings release approval by Incubator PMC Was: Incubation state transitions and stuck projects (Was: February report review)

[jira] [Created] (PODLINGNAMESEARCH-9) Establish whether Apache Any23 is a suitable name

[jira] [Commented] (PODLINGNAMESEARCH-9) Establish whether Apache Any23 is a suitable name

Re: [VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Accept Blur into the Apache Incubator

Re: [VOTE] Release Apache Amber 0.22-incubating

11 matches

Site Navigation

Mail list logo

Footer information