Re: Iceberg table format

2017-12-11 Thread Atri Sharma
that we can start thinking about extending it (specifically around deletes?) Regards, Atri On Tue, Dec 12, 2017 at 12:55 AM, Ryan Blue wrote: > On Sat, Dec 9, 2017 at 3:38 AM, Atri Sharma wrote: >> >> Thanks for the specification. >> >> A couple of questions: >>

Re: Iceberg table format

2017-12-09 Thread Atri Sharma
Thanks for the specification. A couple of questions: 1) what does this to parquet and not to any underlying store? 2) If above is not true, can we expose an interface to install any underlying file format? 3) if we are defining snapshots, can we allow MVCC on top of the snapshots? To elaborate o

Re: Regarding PARQUET-1155

2017-12-07 Thread Atri Sharma
Dec 5, 2017 at 8:54 AM, Eric Owhadi wrote: > > > One thing to account for is the row count stats spread all over the > > various level of stats. If a record is logical deleted, then rowcount = > > rowcount -1. > > So when using any level of stats to compute row count,

Re: Regarding PARQUET-1155

2017-12-05 Thread Atri Sharma
at 9:17 PM, lukas nalezenec wrote: > Hi, > I think that delete marker is good idea. > I was in basic GDPR training and i think that it meets EU law requirements > > Lukas > > 2017-12-05 11:37 GMT+01:00 Atri Sharma : > >> Agreed. >> >> I have come up with

Re: Regarding PARQUET-1155

2017-12-05 Thread Atri Sharma
d, I am not sure this would meet the requirements > of the EU law, since the "deleted" data could still be read by an adversary > even if a Parquet implementation like parquet-mr did not permit it > > - Wes > > On Mon, Dec 4, 2017 at 11:55 AM, Atri Sharma wrote: >

Re: Regarding PARQUET-1155

2017-12-04 Thread Atri Sharma
encoding > scheme) would furthermore be compressed. Editing records in general > would change the size of the compressed and encoded data stream, so > you could not edit the page without rewriting the file. > > - Wes > > On Mon, Dec 4, 2017 at 11:46 AM, Atri Sharma wrote: >

Re: Regarding PARQUET-1155

2017-12-04 Thread Atri Sharma
t; > On Mon, Dec 4, 2017 at 11:30 AM, Atri Sharma wrote: >> Hi Folks, >> >> Any update? >> >> On Fri, Dec 1, 2017 at 9:23 AM, Atri Sharma wrote: >>> https://issues.apache.org/jira/browse/PARQUET-1155 >>> >>> Anybody working on it? Can I take it up? >> >> >> >> -- >> Regards, >> >> Atri >> l'apprenant -- Regards, Atri l'apprenant

Re: Regarding PARQUET-1155

2017-12-04 Thread Atri Sharma
Hi Folks, Any update? On Fri, Dec 1, 2017 at 9:23 AM, Atri Sharma wrote: > https://issues.apache.org/jira/browse/PARQUET-1155 > > Anybody working on it? Can I take it up? -- Regards, Atri l'apprenant

Regarding PARQUET-1155

2017-11-30 Thread Atri Sharma
https://issues.apache.org/jira/browse/PARQUET-1155 Anybody working on it? Can I take it up?

[jira] [Commented] (PARQUET-1155) Support for GDPR erase requirements

2017-11-30 Thread Atri Sharma (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16273057#comment-16273057 ] Atri Sharma commented on PARQUET-1155: -- Is this issue being actively worked o