Re: [spark-graphframes]: Generating incorrect edges

2024-05-11 Thread Nijland, J.G.W. (Jelle, Student M-CS)
t(PREFIX_ORG), psf.sha2(df.descr, 256))) return df Hope this email finds someone running into a similar issue in the future. Kind regards, Jelle From: Mich Talebzadeh Sent: Wednesday, May 1, 2024 11:56 AM To: Stephen Coy Cc: Nijland, J.G.W. (Jelle, Stude

Re: [spark-graphframes]: Generating incorrect edges

2024-04-25 Thread Nijland, J.G.W. (Jelle, Student M-CS)
; ).set("spark.driver.host", "127.0.0.1" # ).set("spark.driver.port", "0" ).set("spark.ui.port", "4041" ).set("spark.executor.instances", "1" ).set("spark.executor.cores", "50" ).set("spark.executor.memory",

Re: [spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Nijland, J.G.W. (Jelle, Student M-CS)
___ From: Mich Talebzadeh Sent: Wednesday, April 24, 2024 4:40 PM To: Nijland, J.G.W. (Jelle, Student M-CS) Cc: user@spark.apache.org Subject: Re: [spark-graphframes]: Generating incorrect edges OK few observations 1) ID Generation Method: How are you generating unique IDs (UUIDs, seque

[spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Nijland, J.G.W. (Jelle, Student M-CS)
tags: pyspark,spark-graphframes Hello, I am running pyspark in a podman container and I have issues with incorrect edges when I build my graph. I start with loading a source dataframe from a parquet directory on my server. The source dataframe has the following columns: