Hi Guys -

Having trouble figuring out the semantics for using the alias function on
the final sum and count aggregations?

>>> cool_summary = reviews.select(reviews.user_id,
cool_cnt("votes.cool").alias("cool_cnt")).groupBy("user_id").agg({"cool_cnt":"sum","*":"count"})

>>> cool_summary

DataFrame[user_id: string, SUM(cool_cnt#725): double, COUNT(1): bigint]

Reply via email to