Hello,

In short, Ignite persistence primary advantages are:

   - Enables multi-tiered storage across RAM and disk - 100% of data
   persisted to disk ("warm" and "cold" data sets) while "hot" data always
   stays in RAM. You lay out the data the way you need. Applications just run
   queries and Ignite internally goes either to RAM or disk, transparently.
   This property is crucial for real-time analytics and data warehousing
   offloading.
   - Instantaneous restarts and advanced high-availability - in case of
   full cluster restarts, the cluster becomes fully operational as soon as the
   nodes are interconnected. No need to preload anything from disk to RAM
   (based on the advantage above).

Now, talking about Ignite and data warehousing. Ignite is used for
real-time analytics and Hadoop offloading. But don't treat it as a Hadoop
replacement or a solution for data warehousing. Ignite is used together
with Hadoop but deployed separately. Here are things to consider:

   - Use Ignite for business operations/computations that require
   low-latency response time (seconds or milliseconds) and high-throughput.
   Preload data to Ignite cluster needed for this computations. Enabled Ignite
   persistence for the sake of durability.
   - Keep using Hadoop for high-latency workloads (minutes and hours) and
   batch processing.
   - APIs: modify your applications to ensure that Ignite APIs are used for
   Ignite cluster access (SQL, compute grid, ML). Spark can be used as a
   generic API that can connect to both Hadoop and Ignite and run joins across
   2 storages (use DataFrames).
   - Tooling: data preloading from Hadoop, bi-directional synchronization,
   advanced Spark integration, etc. - reach out GridGain, they have been
   working on a special Hadoop pack.

Hope it helps.

-
Denis


On Tue, May 14, 2019 at 3:20 PM m4mmr <rmam...@gmail.com> wrote:

> Hi,
>
> I am in a project where we are building a new database with strong
> normalisation requirements - very much like a relational data warehouse.
> We
> source the data from HDFS. And the maintenance team requires the data
> movement to be implemented through SQL APIs.
>
> Main Question: Would it be a viable use case to use Ignite with native
> persistence store as storage when building a relational data warehouse?  So
> that we do not have to manage both a RDBMS and Ignite.
>
> I don’t fully see  a clear picture of what the limitations would be of
> using
> native persistence instead of RDBMS for persistence - but I have not seen a
> single use case where someone use native for relational data warehouses
> either.
>
> Again - I think I am missing some basic understanding of the native
> persistence here - even after reading through the docs I could find. So
> would be happy if someone could shed some light on it.
>
> Would be very thankful for all type of help/assistance!
>
>
>
> --
> Sent from: http://apache-ignite-users.70518.x6.nabble.com/
>

Reply via email to