Re: [DISCUSS] CarbonData incubation proposal

2016-05-26 Thread Gangumalla, Uma
+1 (binding) Regards, Uma On 5/18/16, 8:52 PM, "Jean-Baptiste Onofré" wrote: >Hi all, > >We would like to discuss about a new proposal for the incubator: >CarbonData. > >CarbonData is a new Apache Hadoop native file format for faster >interactive query using advanced

Re: [DISCUSS] CarbonData incubation proposal

2016-05-25 Thread Julien Le Dem
Yes i believe references to asf projects when needed is sufficient. Julien > On May 23, 2016, at 16:19, Henry Saputra wrote: > > I thought the concern had been addressed? > > For Julian concern about Mondrian, the code was inspired by Mondrian but do > not have

Re: [DISCUSS] CarbonData incubation proposal

2016-05-24 Thread Liang Chen
.996316.n3.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49794.html Sent from the Apache Incubator - General mailing list archive at Nabble.com. - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org

Re: [DISCUSS] CarbonData incubation proposal

2016-05-24 Thread Nick Burch
On Thu, 19 May 2016, Jean-Baptiste Onofré wrote: The proposal is included below and also available on the wiki: https://wiki.apache.org/incubator/CarbonDataProposal Comparing the Initial Committers list with the Github contributors list, there look to be a few people currently quite involved

Re: [DISCUSS] CarbonData incubation proposal

2016-05-24 Thread JihongMa
.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49756.html Sent from the Apache Incubator - General mailing list archive at Nabble.com. - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Jean-Baptiste Onofré
Hi, +1, I asked to the current contributors to review and check all code in order to cleanup and identify "used/inspired/derived" code. On the other hand, I also said to Liang that a SGA will be required. I think we can start a vote and address the code cleanup in the mean time. Regards JB

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Gangumalla, Uma
Since the Parquet is ASF project, referencing may make sense to me,Yes, I think Parquet project guys can comment on this point whether is it make sense for them. >With the usual precautions of a BD scan on the incoming IP and ongoing diligence by the PPMC we'll be fine. Thanks Julian for this

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Henry Saputra
+1 to that, Julian. You were absolutely right, I apologize I did not make it clear. Really appreciate another set of eyes reviewing it. - Henry On Mon, May 23, 2016 at 5:15 PM, Julian Hyde wrote: > For the record, at the time that I reviewed the github repo, there was > code

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Julian Hyde
For the record, at the time that I reviewed the github repo, there was code that was not merely *inspired* by Mondrian code, but *derived* from Mondrian code. But that code has since been removed, so the issue is resolved. With the usual precautions of a BD scan on the incoming IP and ongoing

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Henry Saputra
I thought the concern had been addressed? For Julian concern about Mondrian, the code was inspired by Mondrian but do not have direct derivatives of the code. According to Jacky, the old code is no longer used. As for Julien concern about Parquet, the design seemed to be inspired by Parquet and

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Roman Shaposhnik
On Mon, May 23, 2016 at 3:44 PM, Marvin Humphrey wrote: > On Sun, May 22, 2016 at 10:57 PM, Jean-Baptiste Onofré > wrote: >> Hi Luke, >> >> I fully agree with you. The committers are already involved to clean-up the >> repo (PRs have been created). >>

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Marvin Humphrey
On Sun, May 22, 2016 at 10:57 PM, Jean-Baptiste Onofré wrote: > Hi Luke, > > I fully agree with you. The committers are already involved to clean-up the > repo (PRs have been created). > > IMHO, this step is decoupled from the proposal vote itself: the only > requirement is to

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Jean-Baptiste Onofré
s are no longer needed but still present in the repo, so we are planning to clean up the code base soon. Definitely, you are right, we will make sure all source code is under Apache License only. Regards, Jacky Li -- View this message in context: http://apache-incubator-general.996316.n3.nabble.

Re: [DISCUSS] CarbonData incubation proposal

2016-05-23 Thread Luke Han
packages are no longer needed but still present in the repo, >>> so >>> we are planning to clean up the code base soon. >>> >>> Definitely, you are right, we will make sure all source code is

Re: [DISCUSS] CarbonData incubation proposal

2016-05-22 Thread Jean-Baptiste Onofré
Definitely, you are right, we will make sure all source code is under Apache License only. Regards, Jacky Li -- View this message in context: http://apache-incubator-general.996316.n3.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49678.html Sent from the Apache Incubator - Gene

Re: [DISCUSS] CarbonData incubation proposal

2016-05-21 Thread Luke Han
ng to clean up the code base soon. > > Definitely, you are right, we will make sure all source code is under > Apache > License only. > > Regards, > Jacky Li > > > > > -- > View this message in context: > http://apache-incubator-general.996316.n3.nabble.com

Re: [DISCUSS] CarbonData incubation proposal

2016-05-20 Thread Jacky Li
make sure all source code is under Apache License only. Regards, Jacky Li -- View this message in context: http://apache-incubator-general.996316.n3.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49678.html Sent from the Apache Incubator - General mailing list archive at

Re: [DISCUSS] CarbonData incubation proposal

2016-05-20 Thread Jacky Li
. Please check whether still have issues. Regards, Jacky Li -- View this message in context: http://apache-incubator-general.996316.n3.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49676.html Sent from the Apache Incubator - General mailing list archive at Nabble.com

Re: [DISCUSS] CarbonData incubation proposal

2016-05-19 Thread Julien Le Dem
rmat : like > > interactive OLAP-style query, Sequential Access (big scan), Random Access > > (narrow scan). > > > > Please kindly let me know if the above info answer your questions. > > > > Regards > > Liang > > > > > > > &

Re: [DISCUSS] CarbonData incubation proposal

2016-05-19 Thread Julian Hyde
ess > (narrow scan). > > Please kindly let me know if the above info answer your questions. > > Regards > Liang > > > > > > > -- > View this message in context: >

Re: [DISCUSS] CarbonData incubation proposal

2016-05-19 Thread Liang Chen
ry, Sequential Access (big scan), Random Access (narrow scan). Please kindly let me know if the above info answer your questions. Regards Liang -- View this message in context: http://apache-incubator-general.996316.n3.nabble.com/DISCUSS-CarbonData-incubation-proposal-tp49643p49652.html Sent f

Re: [DISCUSS] CarbonData incubation proposal

2016-05-19 Thread Lars Francke
Hi Jean-Baptiste, can you - or anyone else for that matter - comment on how it relates to Parquet and ORC? The Github page says "The CaronData file format provides a highly efficient way to store structured data,it was designed to overcome limitations of the other Hadoop file formats." so it'd

RE: [DISCUSS] CarbonData incubation proposal

2016-05-19 Thread Zheng, Kai
: [DISCUSS] CarbonData incubation proposal Hi all, We would like to discuss about a new proposal for the incubator: CarbonData. CarbonData is a new Apache Hadoop native file format for faster interactive query using advanced columnar storage, index, compression and encoding techniques to improve

[DISCUSS] CarbonData incubation proposal

2016-05-18 Thread Jean-Baptiste Onofré
Hi all, We would like to discuss about a new proposal for the incubator: CarbonData. CarbonData is a new Apache Hadoop native file format for faster interactive query using advanced columnar storage, index, compression and encoding techniques to improve computing efficiency, in turn it will