Re: very fast loading of very big table

Taras Ledkov Fri, 19 Feb 2021 04:51:11 -0800

Hi Vladimir,

Did you try to use SQL command 'COPY FROM <csv_file>' via thin JDBC?

This command uses 'IgniteDataStreamer' to write data into cluster andparse CSV on the server node.


PS. AFAIK IgniteDataStreamer is one of the fastest ways to load data.

Hi Denis,

Data space is 3.7Gb according to MSSQL table properries

Vladimir

9:47, 19 февраля 2021 г., Denis Magda <[email protected]>:

    Hello Vladimir,

    Good to hear from you! How much is that in gigabytes?

    -
    Denis


    On Thu, Feb 18, 2021 at 10:06 PM <[email protected]
    <mailto:[email protected]>> wrote:

        Sep 2020 I've published the paper about Loading Large Datasets
        into Apache Ignite by Using a Key-Value API (English [1] and
        Russian [2] version). The approach described works in
        production, but shows inacceptable perfomance for very large
        tables.

        The story continues, and yesterday I've finished the proof of
        concept for very fast loading of very big table. The
        partitioned MSSQL table about 295 million rows was loaded by
        the 4-node Ignite cluster in 3 min 35 sec. Each node had
        executed its own SQL queries in parallel and then distributed
        the loaded values across the other cluster nodes.

        Probably that result will be of interest for the community.

        Regards,
        Vladimir Chernyi

        [1]
        
https://www.gridgain.com/resources/blog/how-fast-load-large-datasets-apache-ignite-using-key-value-api
        
<https://www.gridgain.com/resources/blog/how-fast-load-large-datasets-apache-ignite-using-key-value-api>
        [2] https://m.habr.com/ru/post/526708/
        <https://m.habr.com/ru/post/526708/>



--
Отправлено из мобильного приложения Яндекс.Почты


--
Taras Ledkov
Mail-To: [email protected]

Re: very fast loading of very big table

Reply via email to