RE: using spark to load a data warehouse in real time

2017-03-07 Thread Adaryl Wakefield
t.net<http://www.massstreet.net> www.linkedin.com/in/bobwakefieldmba<http://www.linkedin.com/in/bobwakefieldmba> Twitter: @BobLovesData From: Henry Tremblay [mailto:paulhtremb...@gmail.com] Sent: Tuesday, February 28, 2017 3:56 PM To: user@spark.apache.org Subject: Re: using spark to load a data

RE: using spark to load a data warehouse in real time

2017-03-04 Thread Adaryl Wakefield
/in/bobwakefieldmba> Twitter: @BobLovesData From: Sam Elamin [mailto:hussam.ela...@gmail.com] Sent: Wednesday, March 1, 2017 2:29 AM To: Adaryl Wakefield ; Jörn Franke Cc: user@spark.apache.org Subject: Re: using spark to load a data warehouse in real time Hi Adaryl Having come from a W

RE: using spark to load a data warehouse in real time

2017-03-04 Thread Adaryl Wakefield
yl Wakefield Cc: user@spark.apache.org Subject: Re: using spark to load a data warehouse in real time I am not sure that Spark Streaming is what you want to do. It is for streaming analytics not for loading in a DWH. You need also define what realtime means and what is needed there - it will differ

Re: using spark to load a data warehouse in real time

2017-03-01 Thread Sam Elamin
lto:donta...@gmail.com ] *Sent:* Tuesday, February 28, 2017 12:57 PM *To:* Adaryl Wakefield *Cc:* user@spark.apache.org *Subject:* Re: using spark to load a data warehouse in real time Hi Adaryl, You could definitely load data into a warehouse through Spark's JDBC support through DataFram

Re: using spark to load a data warehouse in real time

2017-02-28 Thread Jörn Franke
Mohammad Tariq [mailto:donta...@gmail.com] > Sent: Tuesday, February 28, 2017 12:57 PM > To: Adaryl Wakefield > Cc: user@spark.apache.org > Subject: Re: using spark to load a data warehouse in real time > > Hi Adaryl, > > You could definitely load data into a warehous

RE: using spark to load a data warehouse in real time

2017-02-28 Thread Adaryl Wakefield
ad Tariq [mailto:donta...@gmail.com] Sent: Tuesday, February 28, 2017 12:57 PM To: Adaryl Wakefield Cc: user@spark.apache.org Subject: Re: using spark to load a data warehouse in real time Hi Adaryl, You could definitely load data into a warehouse through Spark's JDBC support through DataFrame

Re: using spark to load a data warehouse in real time

2017-02-28 Thread Henry Tremblay
[mailto:femib...@gmail.com <mailto:femib...@gmail.com>] *Sent:* Tuesday, February 28, 2017 4:13 AM *To:* Adaryl Wakefield mailto:adaryl.wakefi...@hotmail.com>> *Cc:* user@spark.apache.org <mailto:user@spark.apache.org> *Subject:* Re: usi

Re: using spark to load a data warehouse in real time

2017-02-28 Thread Mohammad Tariq
SELECT statements, no INSERTS or MERGE >> statements. >> >> >> >> Adaryl "Bob" Wakefield, MBA >> Principal >> Mass Street Analytics, LLC >> 913.938.6685 >> >> www.massstreet.net >> >> www.linkedin.com/in/bobwakefieldmba >

Re: using spark to load a data warehouse in real time

2017-02-28 Thread Mohammad Tariq
bwakefieldmba > Twitter: @BobLovesData > > > > *From:* Femi Anthony [mailto:femib...@gmail.com] > *Sent:* Tuesday, February 28, 2017 4:13 AM > *To:* Adaryl Wakefield > *Cc:* user@spark.apache.org > *Subject:* Re: using spark to load a data warehouse in real

RE: using spark to load a data warehouse in real time

2017-02-28 Thread Adaryl Wakefield
ail.com] Sent: Tuesday, February 28, 2017 4:13 AM To: Adaryl Wakefield Cc: user@spark.apache.org Subject: Re: using spark to load a data warehouse in real time Have you checked to see if there are any drivers to enable you to write to Greenplum directly from Spark ? You can also take a loo

Re: using spark to load a data warehouse in real time

2017-02-28 Thread Femi Anthony
Have you checked to see if there are any drivers to enable you to write to Greenplum directly from Spark ? You can also take a look at this link: https://groups.google.com/a/greenplum.org/forum/m/#!topic/gpdb-users/lnm0Z7WBW6Q Apparently GPDB is based on Postgres so maybe that approach may work

using spark to load a data warehouse in real time

2017-02-27 Thread Adaryl Wakefield
Is anybody using Spark streaming/SQL to load a relational data warehouse in real time? There isn't a lot of information on this use case out there. When I google real time data warehouse load, nothing I find is up to date. It's all turn of the century stuff and doesn't take into account advancem