Hi,
I have been working on integrating WEKA into Drill to support building and 
scoring classification models. I have been successful in supporting all WEKA 
classifiers and making them run in a distributed fashion over Drill 1.2. The 
classifier accuracy is not affected by running in a distributed fashion and the 
training and scoring times are getting a huge boost using Drill. A paper on 
this has been published in the IEEE symposium on Big Data in June 2016 
[available: 
http://cs.queensu.ca/~khalifa/qdrill/QDrill_20160212IEEE_CameraReady.pdf] and 
we are now in the process of publishing another paper in which QDrill supports 
all WEKA algorithms. FYI, this can be easily extended to support clustering and 
other types of WEKA algorithms. The architecture also allows supporting other 
data mining libraries.
The QDrill project website is  http://cs.queensu.ca/~khalifa/qdrill, the 
project downloadable version on it is little bit old but I'm planning to upload 
a more updated stable version within the next 10 days. I'm also using an SVN 
repository and planning to move the project to GitHub to make it easier to get 
the latest Drill versions and to may be integrate with Drill at some point. 
Unfortunately, I have another meeting tomorrow at the same time of the hangout, 
but I would love to know your opinion and to discuss the process of evaluating 
this extension and may be integrating it with Drill at some point. 
Regards
Shadi KhalifaPhD CandidateSchool of Computing Queen's University Canada
I'm just a neuron in the society collective brain

01001001 00100000 01101100 01101111 01110110 01100101 00100000 01000101 
01100111 01111001 01110000 01110100 
P Please consider your environmental responsibility before printing this e-mail

 

    On Monday, October 3, 2016 10:52 PM, Laurent Goujon <[email protected]> 
wrote:
 

 Hi,

I'm currently working on improving metadata support for both the JDBC
driver and the C++ connector, more specifically the following JIRAs:

DRILL-4853: Update C++ protobuf source files
DRILL-4420: Server-side metadata and prepared-statement support for C++
connector
DRILL-4880: Support JDBC driver registration using ServiceLoader
DRILL-4925: Add tableType filter to GetTables metadata query
DRILL-4730: Update JDBC DatabaseMetaData implementation to use new Metadata
APIs

I  already opened multiple pull requests for those (the list is available
at https://github.com/apache/drill/pulls/laurentgo)

I'm planning to join tomorrow hangout in case people have questions about
those.

Cheers,

Laurent

On Mon, Oct 3, 2016 at 10:28 AM, Subbu Srinivasan <[email protected]>
wrote:

> Can we close on https://github.com/apache/drill/pull/518 ?
>
> On Mon, Oct 3, 2016 at 10:27 AM, Sudheesh Katkam <[email protected]>
> wrote:
>
> > Hi drillers,
> >
> > Our bi-weekly hangout is tomorrow (10/04/16, 10 AM PT). If you have any
> > suggestions for hangout topics, you can add them to this thread. We will
> > also ask around at the beginning of the hangout for topics.
> >
> > Thank you,
> > Sudheesh
> >
>


   

Reply via email to