Re: [spark-graphframes]: Generating incorrect edges

2024-05-11 Thread Nijland, J.G.W. (Jelle, Student M-CS)
nt M-CS) ; user@spark.apache.org Subject: Re: [spark-graphframes]: Generating incorrect edges Hi Steve, Thanks for your statement. I tend to use uuid myself to avoid collisions. This built-in function generates random IDs that are highly likely to be unique across systems. My concerns are

Re: [spark-graphframes]: Generating incorrect edges

2024-05-01 Thread Mich Talebzadeh
Hi Steve, Thanks for your statement. I tend to use uuid myself to avoid collisions. This built-in function generates random IDs that are highly likely to be unique across systems. My concerns are on edge so to speak. If the Spark application runs for a very long time or encounters restarts, the

Re: [spark-graphframes]: Generating incorrect edges

2024-04-30 Thread Stephen Coy
Hi Mich, I was just reading random questions on the user list when I noticed that you said: On 25 Apr 2024, at 2:12 AM, Mich Talebzadeh wrote: 1) You are using monotonically_increasing_id(), which is not collision-resistant in distributed environments like Spark. Multiple hosts can

Re: [spark-graphframes]: Generating incorrect edges

2024-04-25 Thread Nijland, J.G.W. (Jelle, Student M-CS)
"128G" ).set("spark.executor.memoryOverhead", "32G" ).set("spark.driver.cores", "16" ).set("spark.driver.memory", "64G" ) I dont think b) applies as its a single machine. Kind regards, Jelle Fr

Re: [spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Mich Talebzadeh
o 100K records. > Once I go past that amount of records the results become inconsistent and > incorrect. > > Kind regards, > Jelle Nijland > > > -- > *From:* Mich Talebzadeh > *Sent:* Wednesday, April 24, 2024 4:40 PM > *To:* Nijland, J.G.W. (Jelle, Student M-CS) < > j.g.

Re: [spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Nijland, J.G.W. (Jelle, Student M-CS)
___ From: Mich Talebzadeh Sent: Wednesday, April 24, 2024 4:40 PM To: Nijland, J.G.W. (Jelle, Student M-CS) Cc: user@spark.apache.org Subject: Re: [spark-graphframes]: Generating incorrect edges OK few observations 1) ID Generation Method: How are you generating unique IDs (UUIDs, seque

Re: [spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Mich Talebzadeh
jl...@student.utwente.nl> wrote: > tags: pyspark,spark-graphframes > > Hello, > > I am running pyspark in a podman container and I have issues with > incorrect edges when I build my graph. > I start with loading a source dataframe from a parquet directory on my &

[spark-graphframes]: Generating incorrect edges

2024-04-24 Thread Nijland, J.G.W. (Jelle, Student M-CS)
tags: pyspark,spark-graphframes Hello, I am running pyspark in a podman container and I have issues with incorrect edges when I build my graph. I start with loading a source dataframe from a parquet directory on my server. The source dataframe has the following columns

spark-graphframes

2017-07-10 Thread Dennis Grinwald
Hello GraphFrame-community, our company is very interested in using GraphFrames for large enterprise tools. Therefore I would like to ask a few questions regarding the architecture of GraphFrames: 1. In GraphFrames Quick-Start Guide it says that it's built on top of SparkSQL. Is the

Re: Graph Analytics on HBase with HGraphDB and Spark GraphFrames

2017-04-03 Thread Weiqing Yang
il.com> > wrote: > >> Interesting! >> >> -- >> *From:* Robert Yokota <rayok...@gmail.com> >> *Sent:* Sunday, April 2, 2017 9:40:07 AM >> *To:* user@spark.apache.org >> *Subject:* Graph Analytics on HBase with HGraphDB and Spark GraphFrames >

Re: Graph Analytics on HBase with HGraphDB and Spark GraphFrames

2017-04-02 Thread Irving Duran
t; *To:* user@spark.apache.org > *Subject:* Graph Analytics on HBase with HGraphDB and Spark GraphFrames > > Hi, > > In case anyone is interested in analyzing graphs in HBase with Apache > Spark GraphFrames, this might be helpful: > > https://yokota.blog/2017/04/02/graph-analytics-on-hbase-with > -hgraphdb-and-spark-graphframes/ >

Re: Graph Analytics on HBase with HGraphDB and Spark GraphFrames

2017-04-02 Thread Felix Cheung
Interesting! From: Robert Yokota <rayok...@gmail.com> Sent: Sunday, April 2, 2017 9:40:07 AM To: user@spark.apache.org Subject: Graph Analytics on HBase with HGraphDB and Spark GraphFrames Hi, In case anyone is interested in analyzing graphs in HBase with

Graph Analytics on HBase with HGraphDB and Spark GraphFrames

2017-04-02 Thread Robert Yokota
Hi, In case anyone is interested in analyzing graphs in HBase with Apache Spark GraphFrames, this might be helpful: https://yokota.blog/2017/04/02/graph-analytics-on-hbase- with-hgraphdb-and-spark-graphframes/

Spark GraphFrames generic question

2016-12-16 Thread Ankur Srivastava
Hi I am working on two different use cases where the basic problem is same but scale is very different. In case 1 we have two entities that can have many to many relation and we would want to identify all subgraphs in the full graph and then further prune the graph to find the best relation.

Re: Spark GraphFrames

2016-08-02 Thread Denny Lee
To:Divya Gehlot <divya.htco...@gmail.com> > Cc:"user @spark" <user@spark.apache.org> > Date:2016/08/02 17:06 > Subject:Re: Spark GraphFrames > -- > > > > Hi, > Kay wrot

Re: Spark GraphFrames

2016-08-02 Thread Kazuaki Ishizaki
rk.apache.org> Date: 2016/08/02 17:06 Subject: Re: Spark GraphFrames Hi, Kay wrote a procedure to use GraphFrames with Spark. https://gist.github.com/kayousterhout/7008a8ebf2babeedc7ce6f8723fd1bf4 Kazuaki Ishizaki From:Divya Gehlot <divya.htco...@gmail.com> To

Re: Spark GraphFrames

2016-08-02 Thread Kazuaki Ishizaki
Hi, Kay wrote a procedure to use GraphFrames with Spark. https://gist.github.com/kayousterhout/7008a8ebf2babeedc7ce6f8723fd1bf4 Kazuaki Ishizaki From: Divya Gehlot <divya.htco...@gmail.com> To: "user @spark" <user@spark.apache.org> Date: 2016/08/02 14:52

Spark GraphFrames

2016-08-01 Thread Divya Gehlot
Hi, Has anybody has worked with GraphFrames. Pls let me know as I need to know the real case scenarios where It can used . Thanks, Divya