Yes, this is the right forum. Calcite makes this kind of data virtualization really easy. You just need to define a model file in JSON, with a schema for each data source. Then you can connect to Calcite via JDBC, and Calcite will route the queries to the right back-end.
Run through the tutorial, https://github.com/apache/incubator-calcite/blob/master/example/csv/TUTORIAL.md, to see how to connect from sqlline and edit models. Use a model similar to https://github.com/apache/incubator-calcite/blob/master/core/src/test/resources/mysql-foodmart-model.json, but give it two schemas. You can write queries across those schemas. For the sake of example, suppose you had { version: '1.0', defaultSchema: 'foodmart', schemas: [ { type: 'jdbc', name: 'foodmart', jdbcUser: 'foodmart', jdbcPassword: 'foodmart', jdbcUrl: 'jdbc:mysql://localhost', jdbcCatalog: 'foodmart', jdbcSchema: null }, { type: 'jdbc', name: ‘bardmart', jdbcUser: 'foodmart', jdbcPassword: 'foodmart', jdbcUrl: 'jdbc:mysql://localhost', jdbcCatalog: 'foodmart', jdbcSchema: null } ] } Then you could write a distributed query select count(*) from “foodmart”.”sales_fact_1997” as s join “bardmart”.”customer” as c on s.”customer_id” = c.”customer_id”; The two schemas have identical JDBC credentials in this example, but in real life they would be different JDBC databases, or instances of any schema adapter you like (e.g. CSV, Mongo). Julian On Nov 25, 2014, at 4:06 AM, Kishore Vajjala <[email protected]> wrote: > Hi, > > Apologies if this is not the right forum for my questions below. > > I have a few high level question using calcite as virtualization platform. > > My Scenario > > I have multiple data sources and would require a service which exposes a > unified JDBC interface to query the data sources. I was looking at some of > the presentations and found this to be a good fit initially. > > To the the questions now. > > 1. Does Calcite come any near to what I wanted like a jdbc driver and a > Server shell which parses and federates the query. > > 2. If it is not in your roadmap, I want to know if you have any ideas on > how I could achieve this. > > 3. Are there any code examples for implementing SQL over tables in mutiple > data sources (as seen in Julian's presentation using Splunk and MySQL) > > I really appreciate your responses. > > Thanks
