[jira] [Comment Edited] (SOLR-7560) Parallel SQL Support

Joel Bernstein (JIRA) Sun, 14 Jun 2015 14:05:17 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14585271#comment-14585271
 ]


Joel Bernstein edited comment on SOLR-7560 at 6/14/15 9:04 PM:
---------------------------------------------------------------

There is still some more work to do before it's ready for a release but I 
believe this ticket is far enough along to commit to *trunk*.

The main goals have been accomplished:

Two SQL constructs are supported:

1) SELECT statements with complex nested WHERE clauses, ORDER BY and LIMIT.
2) GROUP BY statements with aggregate functions, complex nested WHERE and 
HAVING clauses, ORDER BY and LIMIT. The functions supported are SUM, COUNT, 
AVG, MIN, MAX.

The GROUP BY aggregations can be run in parallel across a SolrCloud worker 
collection, providing a true *Parallel SQL framework* for SolrCloud.

A new /sql request (SQLHandler.java) has been created to handle SQL requests 
and stream back the results. 

Initial test cases demonstrate the functionality described above.

The excellent *Presto SQL parser* has been integrated into the SQLHandler and 
all the licenses for new dependencies have been added to the project.

The JDBC driver has not been built as part of this ticket so the SQLHandler's 
HTTP interface will need to be accessed directly.






 


was (Author: joel.bernstein):
There is still some more work to do before it's ready for a release but I 
believe this ticket is far enough along to commit to *trunk*.

The main goals have been accomplished:

Two SQL constructs are supported:

1) SELECT statements with complex nested WHERE clauses, ORDER BY and LIMIT.
2) GROUP BY statements with aggregate functions, complex nested WHERE and 
HAVING clauses, ORDER BY and LIMIT. The functions supported are SUM, COUNT, 
AVG, MIN, MAX.

The GROUP BY aggregations can be run in parallel across a SolrCloud worker 
collection, providing true a *Parallel SQL framework* for SolrCloud.

A new /sql request (SQLHandler.java) has been created to handle SQL requests 
and stream back the results. 

Initial test cases demonstrate the functionality described above.

The excellent *Presto SQL parser* has been integrated into the SQLHandler and 
all the licenses for new dependencies have been added to the project.

The JDBC driver has not been built as part of this ticket so the SQLHandler's 
HTTP interface will need to be accessed directly.






 

> Parallel SQL Support
> --------------------
>
>                 Key: SOLR-7560
>                 URL: https://issues.apache.org/jira/browse/SOLR-7560
>             Project: Solr
>          Issue Type: New Feature
>          Components: clients - java, search
>            Reporter: Joel Bernstein
>             Fix For: 5.3
>
>         Attachments: SOLR-7560.patch, SOLR-7560.patch, SOLR-7560.patch, 
> SOLR-7560.patch
>
>
> This ticket provides support for executing *Parallel SQL* queries across 
> SolrCloud collections. The SQL engine will be built on top of the Streaming 
> API (SOLR-7082), which provides support for *parallel relational algebra* and 
> *real-time map-reduce*.
> Basic design:
> 1) A new SQLHandler will be added to process SQL requests. The SQL statements 
> will be compiled to live Streaming API objects for parallel execution across 
> SolrCloud worker nodes.
> 2) SolrCloud collections will be abstracted as *Relational Tables*. 
> 3) The Presto SQL parser will be used to parse the SQL statements.
> 4) A JDBC thin client will be added as a Solrj client.
> This ticket will focus on putting the framework in place and providing basic 
> SELECT support and GROUP BY aggregate support.
> Future releases will build on this framework to provide additional SQL 
> features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-7560) Parallel SQL Support

Reply via email to