[jira] [Comment Edited] (SPARK-1153) Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.

2016-11-09 Thread Nicholas Tietz (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15651453#comment-15651453
 ] 

Nicholas Tietz edited comment on SPARK-1153 at 11/9/16 5:04 PM:


The decision we eventually made was to migrate as much of our code out of 
GraphX as we could (moving to writing more directly in Spark). We were running 
into other potential performance issues with GraphX and we could not do the 
kind of checkpointing we wanted to, so it was a workable solution for us. We 
wound up with minimal GraphX code (and when I left the company we were close to 
being able to remove ALL of it).

At the end, we just dealt with the pain of managing consistent IDs ourselves 
and joining them in. It was not ideal, but it worked and the performance hit 
was made up for in other areas where we were able to migrate off of GraphX.


was (Author: ntietz):
The decision we eventually made was to migrate as much of our code out of 
GraphX as we could (moving to writing more directly in Spark). We were running 
into other potential performance issues with GraphX and we could not do the 
kind of checkpointing we wanted to, so it was a workable solution for us.

At the end, we just dealt with the pain of managing consistent IDs ourselves 
and joining them in. It was not ideal, but it worked and the performance hit 
was made up for in other areas where we were able to migrate off of GraphX.

> Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.
> --
>
> Key: SPARK-1153
> URL: https://issues.apache.org/jira/browse/SPARK-1153
> Project: Spark
>  Issue Type: Improvement
>  Components: GraphX
>Affects Versions: 0.9.0
>Reporter: Deepak Nulu
>
> Currently, {{VertexId}} is a type-synonym for {{Long}}. I would like to be 
> able to use {{UUID}} as the vertex ID type because the data I want to process 
> with GraphX uses that type for its primay-keys. Others might have a different 
> type for their primary-keys. Generalizing {{VertexId}} (with a type class) 
> will help in such cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-1153) Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.

2014-05-28 Thread npanj (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14010846#comment-14010846
 ] 

npanj edited comment on SPARK-1153 at 5/28/14 6:48 AM:
---

An alternative approach, that I have been using: 
1 Use a preprocessing step that maps UUID to an Long.
2. Build graph based on Longs

For Mapping in step 1:
- Rank your uuids.
- some kind of has function?

For 1, graphx can provide a tool to generate map.

I will like to hear how others are building graphs out of non-Long node types.





was (Author: npanj):
An alternative approach, that I have been using: 
1 Use a preprocessing step that maps UUID to an Long.
2. Build graph based on Longs

For Mapping in step 1:
- Rank your uuids.
- some kind of has function?

For 1, graphx can provide a tool to generate map.

I will like to hear how others are building graphs out of non-Long node types




 Generalize VertexId in GraphX so that UUIDs can be used as vertex IDs.
 --

 Key: SPARK-1153
 URL: https://issues.apache.org/jira/browse/SPARK-1153
 Project: Spark
  Issue Type: Improvement
  Components: GraphX
Affects Versions: 0.9.0
Reporter: Deepak Nulu

 Currently, {{VertexId}} is a type-synonym for {{Long}}. I would like to be 
 able to use {{UUID}} as the vertex ID type because the data I want to process 
 with GraphX uses that type for its primay-keys. Others might have a different 
 type for their primary-keys. Generalizing {{VertexId}} (with a type class) 
 will help in such cases.



--
This message was sent by Atlassian JIRA
(v6.2#6252)