Hi Vladimir,
Did you try to use SQL command 'COPY FROM <csv_file>' via thin JDBC?
This command uses 'IgniteDataStreamer' to write data into cluster and
parse CSV on the server node.
PS. AFAIK IgniteDataStreamer is one of the fastest ways to load data.
Hi Denis,
Data space is 3.7Gb according to MSSQL table properries
Vladimir
9:47, 19 февраля 2021 г., Denis Magda <[email protected]>:
Hello Vladimir,
Good to hear from you! How much is that in gigabytes?
-
Denis
On Thu, Feb 18, 2021 at 10:06 PM <[email protected]
<mailto:[email protected]>> wrote:
Sep 2020 I've published the paper about Loading Large Datasets
into Apache Ignite by Using a Key-Value API (English [1] and
Russian [2] version). The approach described works in
production, but shows inacceptable perfomance for very large
tables.
The story continues, and yesterday I've finished the proof of
concept for very fast loading of very big table. The
partitioned MSSQL table about 295 million rows was loaded by
the 4-node Ignite cluster in 3 min 35 sec. Each node had
executed its own SQL queries in parallel and then distributed
the loaded values across the other cluster nodes.
Probably that result will be of interest for the community.
Regards,
Vladimir Chernyi
[1]
https://www.gridgain.com/resources/blog/how-fast-load-large-datasets-apache-ignite-using-key-value-api
<https://www.gridgain.com/resources/blog/how-fast-load-large-datasets-apache-ignite-using-key-value-api>
[2] https://m.habr.com/ru/post/526708/
<https://m.habr.com/ru/post/526708/>
--
Отправлено из мобильного приложения Яндекс.Почты
--
Taras Ledkov
Mail-To: [email protected]