RE: Machine Learning on GraphX

2015-06-18 Thread Evo Eftimov
What is GraphX:

 

-  It can be viewed as a kind of Distributed, Parallel, Graph Database

-  It can be viewed as Graph Data Structure (Data Structures 101 from 
your CS course)

-  It features some off the shelve algos for Graph Processing and 
Navigation  (Algos and Data Structures 101) and the implementation of these 
takes advantage of the distributed parallel nature of GrapphX

 

Any of the MLib algos can be applied to ANY data structure from time series to 
graph to matrix/tabular etc – it is up to your needs and imagination 

 

As an example – Clustering – you can apply it to Graph Data Structure BUT you 
may also leverage the Graph inherent connection/clustering properties and Graph 
algos taking advantage of that Instead of e.g. the run of the mill K-Means 
which is ok for te.g. time series, matrix etc data structures

 

From: Timothée Rebours [mailto:t.rebo...@gmail.com] 
Sent: Thursday, June 18, 2015 10:44 AM
To: Akhil Das
Cc: user@spark.apache.org
Subject: Re: Machine Learning on GraphX

 

Thanks for the quick answer.
I've already followed this tutorial but it doesn't use GraphX at all. My goal 
would be to work directly on the graph, and not extracting edges and vertices 
from the graph as standard RDDs and then work on that with the standard MLlib's 
ALS, which has no interest. That's why I tried with the other implementation, 
but it's not optimized at all.

I might have gone in the wrong direction with the ALS, but I'd like to see 
what's possible to do with MLlib on GraphX. Any idea ?

 

2015-06-18 11:19 GMT+02:00 Akhil Das :

This might give you a good start 
http://ampcamp.berkeley.edu/big-data-mini-course/movie-recommendation-with-mllib.html
 its a bit old though.




Thanks

Best Regards

 

On Thu, Jun 18, 2015 at 2:33 PM, texol  wrote:

Hi,

I'm new to GraphX and I'd like to use Machine Learning algorithms on top of
it. I wanted to write a simple program implementing MLlib's ALS on a
bipartite graph (a simple movie recommendation), but didn't succeed. I found
an implementation on Spark 1.1.x
(https://github.com/ankurdave/spark/blob/GraphXALS/graphx/src/main/scala/org/apache/spark/graphx/lib/ALS.scala)
of ALS on GraphX, but it is painfully slow compared to the standard
implementation, and uses the deprecated (in the current version)
PregelVertex class.
Do we expect a new implementation ? Is there a smarter solution to do so ?

Thanks,
Regards,
Timothée Rebours.




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Machine-Learning-on-GraphX-tp23388.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

 





 

-- 

Timothée Rebours
13, rue Georges Bizet
78380 BOUGIVAL



Re: Machine Learning on GraphX

2015-06-18 Thread andy petrella
I guess that belief propagation could help here (at least, I find the ideas
enough similar), thus this article might be a good start :
http://arxiv.org/pdf/1004.1003.pdf
(it's on my todo list, hence cannot really help further ^^)

On Thu, Jun 18, 2015 at 11:44 AM Timothée Rebours 
wrote:

> Thanks for the quick answer.
> I've already followed this tutorial but it doesn't use GraphX at all. My
> goal would be to work directly on the graph, and not extracting edges and
> vertices from the graph as standard RDDs and then work on that with the
> standard MLlib's ALS, which has no interest. That's why I tried with the
> other implementation, but it's not optimized at all.
>
> I might have gone in the wrong direction with the ALS, but I'd like to see
> what's possible to do with MLlib on GraphX. Any idea ?
>
> 2015-06-18 11:19 GMT+02:00 Akhil Das :
>
>> This might give you a good start
>> http://ampcamp.berkeley.edu/big-data-mini-course/movie-recommendation-with-mllib.html
>> its a bit old though.
>>
>> Thanks
>> Best Regards
>>
>> On Thu, Jun 18, 2015 at 2:33 PM, texol  wrote:
>>
>>> Hi,
>>>
>>> I'm new to GraphX and I'd like to use Machine Learning algorithms on top
>>> of
>>> it. I wanted to write a simple program implementing MLlib's ALS on a
>>> bipartite graph (a simple movie recommendation), but didn't succeed. I
>>> found
>>> an implementation on Spark 1.1.x
>>> (
>>> https://github.com/ankurdave/spark/blob/GraphXALS/graphx/src/main/scala/org/apache/spark/graphx/lib/ALS.scala
>>> )
>>> of ALS on GraphX, but it is painfully slow compared to the standard
>>> implementation, and uses the deprecated (in the current version)
>>> PregelVertex class.
>>> Do we expect a new implementation ? Is there a smarter solution to do so
>>> ?
>>>
>>> Thanks,
>>> Regards,
>>> Timothée Rebours.
>>>
>>>
>>>
>>>
>>> --
>>> View this message in context:
>>> http://apache-spark-user-list.1001560.n3.nabble.com/Machine-Learning-on-GraphX-tp23388.html
>>> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>>>
>>> -
>>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>>> For additional commands, e-mail: user-h...@spark.apache.org
>>>
>>>
>>
>
>
> --
> Timothée Rebours
> 13, rue Georges Bizet
> 78380 BOUGIVAL
>


Re: Machine Learning on GraphX

2015-06-18 Thread Timothée Rebours
Thanks for the quick answer.
I've already followed this tutorial but it doesn't use GraphX at all. My
goal would be to work directly on the graph, and not extracting edges and
vertices from the graph as standard RDDs and then work on that with the
standard MLlib's ALS, which has no interest. That's why I tried with the
other implementation, but it's not optimized at all.

I might have gone in the wrong direction with the ALS, but I'd like to see
what's possible to do with MLlib on GraphX. Any idea ?

2015-06-18 11:19 GMT+02:00 Akhil Das :

> This might give you a good start
> http://ampcamp.berkeley.edu/big-data-mini-course/movie-recommendation-with-mllib.html
> its a bit old though.
>
> Thanks
> Best Regards
>
> On Thu, Jun 18, 2015 at 2:33 PM, texol  wrote:
>
>> Hi,
>>
>> I'm new to GraphX and I'd like to use Machine Learning algorithms on top
>> of
>> it. I wanted to write a simple program implementing MLlib's ALS on a
>> bipartite graph (a simple movie recommendation), but didn't succeed. I
>> found
>> an implementation on Spark 1.1.x
>> (
>> https://github.com/ankurdave/spark/blob/GraphXALS/graphx/src/main/scala/org/apache/spark/graphx/lib/ALS.scala
>> )
>> of ALS on GraphX, but it is painfully slow compared to the standard
>> implementation, and uses the deprecated (in the current version)
>> PregelVertex class.
>> Do we expect a new implementation ? Is there a smarter solution to do so ?
>>
>> Thanks,
>> Regards,
>> Timothée Rebours.
>>
>>
>>
>>
>> --
>> View this message in context:
>> http://apache-spark-user-list.1001560.n3.nabble.com/Machine-Learning-on-GraphX-tp23388.html
>> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>>
>> -
>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>> For additional commands, e-mail: user-h...@spark.apache.org
>>
>>
>


-- 
Timothée Rebours
13, rue Georges Bizet
78380 BOUGIVAL


Re: Machine Learning on GraphX

2015-06-18 Thread Akhil Das
This might give you a good start
http://ampcamp.berkeley.edu/big-data-mini-course/movie-recommendation-with-mllib.html
its a bit old though.

Thanks
Best Regards

On Thu, Jun 18, 2015 at 2:33 PM, texol  wrote:

> Hi,
>
> I'm new to GraphX and I'd like to use Machine Learning algorithms on top of
> it. I wanted to write a simple program implementing MLlib's ALS on a
> bipartite graph (a simple movie recommendation), but didn't succeed. I
> found
> an implementation on Spark 1.1.x
> (
> https://github.com/ankurdave/spark/blob/GraphXALS/graphx/src/main/scala/org/apache/spark/graphx/lib/ALS.scala
> )
> of ALS on GraphX, but it is painfully slow compared to the standard
> implementation, and uses the deprecated (in the current version)
> PregelVertex class.
> Do we expect a new implementation ? Is there a smarter solution to do so ?
>
> Thanks,
> Regards,
> Timothée Rebours.
>
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/Machine-Learning-on-GraphX-tp23388.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> -
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>