No matches were found for subject:"\[pyspark 2.3\+\] count distinct returns different value every time it is run on the same dataset"