Re: 回复:Re: 回复:Re: calculate correlation between_multiple_columns_and_one_specific_column_after_groupby_the_spark_data_frame

2022-03-16 Thread Lalwani, Jayesh
nack , Sean Owen Cc: user Subject: [EXTERNAL] 回复:Re: 回复:Re: calculate correlation between_multiple_columns_and_one_specific_column_after_groupby_the_spark_data_frame CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm

回复:Re: 回复:Re: calculate correlation between_multiple_columns_and_one_specific_column_after_groupby_the_spark_data_frame

2022-03-16 Thread ckgppl_yan
Thanks, Enrico.I just found that I need to group the data frame then calculate the correlation. So I will get a list of dataframe, not columns. So I used following solution:use following codes to create a mutable data frame df_all. I used the first datacol to calculate correlation.