Weihua Jiang created FLINK-11096:
------------------------------------

             Summary: Create a new table based flink ML package
                 Key: FLINK-11096
                 URL: https://issues.apache.org/jira/browse/FLINK-11096
             Project: Flink
          Issue Type: Sub-task
          Components: Machine Learning Library, Table API & SQL
            Reporter: Weihua Jiang


Currently,  the DataSet based ML library is under org.apache._flink.ml_ scala 
package and under _flink-libraries/flink-ml directory._

 

There are two questions related to packaging:
 # Shall we create a new scala/java package, e.g. org.apache.flink.table.ml? Or 
still stay in org.apache.flink.ml?
 # Shall we still put new code in flink-libraries/flink-ml directory or create 
a new one, e.g. flink-libraries/flink-table-ml and corresponding maven package?

 

I implemented a prototype for the design and found that the new design is very 
hard to fit into existing flink.ml codebase. The existing flink.ml code is 
tightly coupled with DataSet API. Thus, I have to rewrite almost all parts of 
flink.ml to get some sample case to work. The only reusable code from flink.ml 
are the base math classes under _org.apache.flink.ml.math_ and 
_org.apache.flink.ml.metrics.distance_ packages. 

Considering this fact, I will prefer to create a new package 
org.apache.flink.table.ml and a new maven package flink-table-ml.

 

Please feel free to give your feedbacks. 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to