[ 
https://issues.apache.org/jira/browse/PIO-38?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15525227#comment-15525227
 ] 

Kenneth Chan commented on PIO-38:
---------------------------------

Assuming you don't need all those event server feature. Then the easiest way to 
do it is simply don't run PIO Event Server. Then you can write Spark code to 
query your own data store in the DataSource.scala component of the template. 
Remove all reference to EventServer in the template code.



> add Apache Parquet as a data source
> -----------------------------------
>
>                 Key: PIO-38
>                 URL: https://issues.apache.org/jira/browse/PIO-38
>             Project: PredictionIO
>          Issue Type: New Feature
>            Reporter: Wojciech Indyk
>              Labels: features
>
> Apache Parquet (https://parquet.apache.org/) is a columnar data store, native 
> for Apache Spark and very well suited to storing batch data (as an input) for 
> PredictionIO Engine.
> Parquet is very popular to archive clickstream, so it would enable to use 
> PredictionIO without additional import of data (and duplication) to HBase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to