This is a good scope. And given there are multiple choices, an advanced
student can expand scope to do both.

Kenn

On Thu, Feb 21, 2019 at 5:36 PM Austin Bennett <whatwouldausti...@gmail.com>
wrote:

> Hi Pablo,
>
> Agree on the usefulness.
>
> Some thoughts embedded:
>
>
> On Thu, Feb 21, 2019 at 3:19 PM Pablo Estrada <pabl...@google.com> wrote:
>
>> Hello all,
>> I was thinking that a filesystem with support for s3 would be great to
>> have in the Python SDK. If I am not wrong, it would simply involve
>> implementing the filesystem classes
>> <https://github.com/apache/beam/blob/master/sdks/python/apache_beam/io/filesystem.py>
>>  with
>> s3, right?
>>
>
> If talking about extending filesytems to alternate clouds -- would Azure
> Blob storage also be sensible (not something I need, but imagine could be
> valuable and pretty easy as I (naively) think these (gcs/azure-blob/s3 are
> largely interchangeable).
>
>
>> I am not familiar enough with s3, nor with filesystems, nor with AWS in
>> general - but I have some outstanding questions:
>>
>>    - Does this mean that we probably would need an extra [s3] target for
>>    installing apache_beam, like we do with [gcp]?
>>       - Not strictly necessary, but probably desirable...
>>
>> To think about:  would there be [aws] to encompass more potential/future
> aws things, to make it akin to gcp, or equivalently, would we want a
> [gcs/gs] target to narrow down what gets loaded?
>
>
>>    - How do we handle KMS in GCS filesystem?
>>    - Would the filesystem encapsulation make KMS support in an s3
>>    filesystem difficult?
>>    - Or even more... is the KMS support in AWS very different than in
>>    GCP?
>>       - I'd love comments from anyone informed around this : )
>>
>> I use KMS with AWS, the tricky part is custom managed keys.  I haven't
> dug in enough to see how similar/different GCS implementation is (I thought
> I only saw keys managed by GCP, so potentially easier, though AWS does have
> that option).
>
>
>
>>
>>    - Is this project of an appropriate size for a GSoC student?
>>
>> Can't speak to appropriate size; was this listed as a project?  Did we
> have sufficiently vague proposals?  I thought applications had been turned
> in?
>
>
>
>
>> Thoughts?
>> Best
>> -P.
>>
>

Reply via email to