Hi,

It’s very interesting. +1 to create a Jira and prepare a PR for review.

Thanks !
Regards
JB

> Le 23 mars 2020 à 15:23, Katarzyna Kucharczyk <ka.kucharc...@gmail.com> a 
> écrit :
> 
> Hi all,
> 
> Me and my colleagues have developed a new Java connector for Snowflake that 
> we would like to add to Beam.
> 
> Snowflake is an analytic data warehouse provided as Software-as-a-Service 
> (SaaS). It uses a new SQL database engine with a unique architecture designed 
> for the cloud. To read more details please check [1] and [2].
> 
> Proposed Snowflake IOs use JDBC Snowflake library [3]. The IOs are batch 
> write and batch read that use the Snowflake COPY [4] operation underneath. In 
> both cases ParDo IOs load files on a stage and then they are inserted into 
> the Snowflake table of choice using the COPY API. The currently supported 
> stage is Google Cloud Storage[5].
> 
> The schema how Snowflake Read IO works (write operation works similarly but 
> in opposite direction):
> 
> 
> 
> Here is an Apache Beam fork [6] with current work of the Snowflake IO.
> 
> In the near future we would like to also add IO for writing streams which 
> will use SnowPipe - Snowflake mechanism for continuous loading[7]. Also, we 
> would like to use cross language to provide Python connectors as well.
> 
> We are open for all opinions and suggestions. In case of any 
> questions/comments please do not hesitate to post them.
> 
> In case of no objection I will create jira tickets and share them in this 
> thread.
> 
> Cheers,
> Kasia
> 
> [1] https://www.snowflake.com <https://www.snowflake.com/> 
> [2] https://docs.snowflake.net/manuals/user-guide/intro-key-concepts.html 
> <https://docs.snowflake.net/manuals/user-guide/intro-key-concepts.html> 
> [3] https://docs.snowflake.net/manuals/user-guide/jdbc.html 
> <https://docs.snowflake.net/manuals/user-guide/jdbc.html> 
> [4] https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html 
> <https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html> 
> [5] 
> https://github.com/PolideaInternal/beam/tree/snowflake-io/sdks/java/io/snowflake
>  
> <https://github.com/PolideaInternal/beam/tree/snowflake-io/sdks/java/io/snowflake>
>  
> [6] https://cloud.google.com/storage <https://cloud.google.com/storage> 
> [7] https://docs.snowflake.net/manuals/user-guide/data-load-snowpipe.html 
> <https://docs.snowflake.net/manuals/user-guide/data-load-snowpipe.html> 
> 

Reply via email to