Hi,

depending on what you're trying to achieve `RDD.toLocalIterator()` might
help you.

Best

Eike


2017-03-29 21:00 GMT+02:00 szep.laszlo.it <szep.laszlo...@gmail.com>:

> Hi,
>
> after I created a dataset
>
> Dataset<Row> df = sqlContext.sql("query");
>
> I need to have a result values and I call a method: collectAsList()
>
> List<Row> list = df.collectAsList();
>
> But it's very slow, if I work with large datasets (20-30 million records).
> I
> know, that the result isn't presented in driver app, that's why it takes
> long time, because collectAsList() collect all data from worker nodes.
>
> But then what is the right way to get result values? Is there an other
> solution to iterate over a result dataset rows, or get values? Can anyone
> post a small & working example?
>
> Thanks & Regards,
> Laszlo Szep
>

Reply via email to