Re: Apex DataFrame

2016-03-25 Thread Herger, Brendan
David, You bring up a great point. This hits on short term and long term goals. Version 1.0 I’ve purposely not included a DAG, because the DAG Planner trivially translates each operation (basically each line in the example) in to one operator. Additionally, every operator has one input port fo

Re: Apex DataFrame

2016-03-23 Thread David Yan
Hi Brendan, Can you draw the result DAG for the DataFrame pseudo-code in the document? I'd assume there will be a file input operator, a CSV parser operator, a filter operator (for price != null and datetime > 1992/1/24), an enricher (for normalized price and price max 30 days), a custom non-DataF

Re: Apex DataFrame

2016-03-23 Thread Devendra Tagare
at 6:27 PM, Ganelin, Ilya < > > ilya.gane...@capitalone.com> > > wrote: > > > > > Would love the community's input on this idea and thoughts on technical > > > challenges we may run into when implementing this within Apex. > > > > > > > > > > > > Sent with Goo

Re: Apex DataFrame

2016-03-23 Thread Ashwin Chandra Putta
s within Apex. > > > > > > > > Sent with Good (www.good.com) > > > > From: Herger, Brendan > > Sent: Tuesday, March 22, 2016 7:49:51 PM > > To: dev@apex.incubator.apache.org > > Subject: Apex DataFrame >

Re: Apex DataFrame

2016-03-23 Thread Amol Kekre
mplementing this within Apex. > > > > Sent with Good (www.good.com) > > From: Herger, Brendan > Sent: Tuesday, March 22, 2016 7:49:51 PM > To: dev@apex.incubator.apache.org > Subject: Apex DataFrame > > Hey all, > > I and the

RE: Apex DataFrame

2016-03-22 Thread Ganelin, Ilya
: dev@apex.incubator.apache.org Subject: Apex DataFrame Hey all, I and the Data Innovation Lab team at Capital One (including Ilya Ganelin, Brennon York, Dongming Liang and Rikin Shah) would like to propose the DataFrame for Apache Apex. We feel that a DataFrame object would solve Apex’s need for a standardized int

Apex DataFrame

2016-03-22 Thread Herger, Brendan
Hey all, I and the Data Innovation Lab team at Capital One (including Ilya Ganelin, Brennon York, Dongming Liang and Rikin Shah) would like to propose the DataFrame for Apache Apex. We feel that a DataFrame object would solve Apex’s need for a standardized internal data format (schema aware o