nack , Sean Owen
Cc: user
Subject: [EXTERNAL] 回复:Re: 回复:Re: calculate correlation
between_multiple_columns_and_one_specific_column_after_groupby_the_spark_data_frame
CAUTION: This email originated from outside of the organization. Do not click
links or open attachments unless you can confirm
Thanks, Enrico.I just found that I need to group the data frame then calculate
the correlation. So I will get a list of dataframe, not columns. So I used
following solution:use following codes to create a mutable data frame df_all. I
used the first datacol to calculate correlation.