spark git commit: [SPARK-22355][SQL] Dataset.collect is not threadsafe

2017-10-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 a607ddc52 -> 2839280ad [SPARK-22355][SQL] Dataset.collect is not threadsafe It's possible that users create a `Dataset`, and call `collect` of this `Dataset` in many threads at the same time. Currently `Dataset#collect` just call

spark git commit: [SPARK-22355][SQL] Dataset.collect is not threadsafe

2017-10-26 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 9b262f6a0 -> 5c3a1f3fa [SPARK-22355][SQL] Dataset.collect is not threadsafe ## What changes were proposed in this pull request? It's possible that users create a `Dataset`, and call `collect` of this `Dataset` in many threads at the same