No, the API for Access to data in the EventServer does not require dataframes 
and so does not use them but you can easily convert into one if you need it. As 
to SparkML, use whatever you need in your algorithm. There are no restrictions 
as long as you build PIO for Spark 2 and include whatever libs you need in your 
Template’s build.sbt. 

I maintain The Universal Recommender, which uses Mahout on Spark, not MLlib. It 
also does not use Spark for deployed query serving, which is typical of many 
Templates. So there is room to use your own architecture as long as it fits the 
general patterns.

https://github.com/actionml/universal-recommender 
<https://github.com/actionml/universal-recommender>


On Apr 19, 2017, at 11:28 PM, Fangzhou Yang <fangzhou.y...@hotmail.com> wrote:

Thanks for the reply. 

As I understand, the template algorithm uses PAlgorithm interface from PIO, 
which are using RDD instead of DataFrame. Can I also implement a template 
algorithm with SparkML and DataFrame? Is there any guide online? 

@Pat Ferrel <mailto:p...@occamsmachete.com> Is the template that you 
maintaining on the github? If yes, could you provide the link?

Many Thanks,
Fangzhou 
From: Pat Ferrel <p...@occamsmachete.com>
Sent: Wednesday, April 19, 2017 10:37:08 PM
To: user@predictionio.incubator.apache.org
Subject: Re: Dose v.011 support Spark ML, DataFrame and Pipeline
 
There is no restriction in templates for what they use of Spark. The ones you 
are looking at simply don’t need those interfaces. If you need them and are 
writing templates you can use them. In fact I maintain a template that does not 
use Spark for the Algorithm, only for IO.

If you think some new API should be in the default PIO API which would that be?


On Apr 19, 2017, at 12:06 AM, Fangzhou Yang <fangzhou.y...@hotmail.com 
<mailto:fangzhou.y...@hotmail.com>> wrote:

Hi all,


I'm new to predictionio. I just noticed that v0.11 can already support Spark 
2.x, can it also currently support Spark ML, DataFrame and Pipeline. It seems 
the Algorithm interfaces support only Spark RDD. If SparkML is not supported 
for now, will it be on the roadmap? Are there anyone already work on it?  

Many Thanks,
Fangzhou

Reply via email to