Thanks Jarek and Shubham for the clarifications.

Looking forward to this one!

Thanks & Regards,
Amogh Desai

On Sat, Mar 16, 2024 at 10:10 AM Mehta, Shubham <shu...@amazon.com.invalid>
wrote:

> Jarek - I totally agree. We had a similar conversation in Dec 2022, and
> Filip K. from Google suggested [1] calling them "workspaces." But I think
> most of our users and contributors are used to the word "tenant." Trying a
> new term like "workspaces" just seems to make things more confusing. For
> example, when I tried using it with a couple of developers at AWS, who were
> somewhat familiar with Airflow, it immediately prompted questions about its
> relation to "tenants."
>
> I also liked how you explained it in the AIP response. It's like when
> Kubernetes talks about "multi-tenant" [2] and they mean it could be for
> different customers or different teams. What we’re doing with Airflow is
> for teams, not really different customers. But it's simpler to keep calling
> it "multi-tenant," just like Kubernetes does, and make sure we explain that
> we're talking about different teams.
>
> Reference:
> 1.
> https://docs.google.com/document/d/1n23h26p4_8F5-Cd0JGLPEnF3gumJ5hw3EpwUljz7HcE/edit?disco=AAAAlo3bv6Q
> 2. https://kubernetes.io/docs/concepts/security/multi-tenancy/
>
> Shubham
>
> On 2024-03-15, 2:50 AM, "Jarek Potiuk" <ja...@potiuk.com <mailto:
> ja...@potiuk.com>> wrote:
>
>
> CAUTION: This email originated from outside of the organization. Do not
> click links or open attachments unless you can confirm the sender and know
> the content is safe.
>
>
>
>
>
>
> AVERTISSEMENT: Ce courrier électronique provient d’un expéditeur externe.
> Ne cliquez sur aucun lien et n’ouvrez aucune pièce jointe si vous ne pouvez
> pas confirmer l’identité de l’expéditeur et si vous n’êtes pas certain que
> le contenu ne présente aucun risque.
>
>
>
>
>
>
> Thaks Shubham - that's a really nice and succinct description of the vision
> behind tht 'Multi-tenant deployment' AIP. The case you described is spot-on
> where I see there is enough value for the organisation that would like to
> create such multi-tenant deployment of Airflow.
>
>
> And I think one comment fro TP in the AIP that summaries it well - this is
> not really `Making Airflow multi-tenant'. It's merely enabling airflow
> components to be put together in a single multi-tenant deployment -
> consisting with a number of smaller sub-deployments that achieve a nice
> IMHO way of being able to decompose monolithic Airflow installation Ina a
> set of loosely cooperating 'sub-deployments' - achieving some (interesting)
> centralisation property - while giving enough freedom for tenants to manage
> their part separately
>
>
> In a way it might be a response to the claims that airflow is this old
> monolithic style application and it's not the new, cool micro-service
> /serverless world. Which I think trades some problems with a set of other
> problems mainly around managing all those micro-sevices to cooperate with
> each other and introduces a lot of performance and service management
> problems.
>
>
> But with the architecture described here it is a bit connecting best things
> if both worlds - splitting Airflow into 'midi-services' - where the
> architecture of your deploynent nicely follows Conway's Law and
> decomposition is done at the right boundaries - boundaries where Tenant
> Deployment Managers might claim as 'theirs' while the 'Airflow Platform
> Team' keeps the whole Airflow deployment under control because there are
> clear boundaries what the Platform team is responsible for, and where each
> Tenant has their own 'kingdom'.
>
>
> I think once we get this one approved and some deployments out there in the
> wild - we will be able to claim that the architecture is actually the next
> gen after monolithic, micro-sevices and serverless - bit done better,
> taking into account some real-life user scenarios and Making it relatively
> easy to adopt such deployment because of Conway's Law limitations.
>
>
> I thought a lot about Conway's Law playing a role in this whole design and
> I think your Business Case, Shubham, is a perfect reflection of how well
> the proposes architecture fits in the Business structures of organisation
> that would like to deploy it.
>
>
> J.
>
>
> czw., 14 mar 2024, 20:21 użytkownik Mehta, Shubham
> <shu...@amazon.com.inva <mailto:shu...@amazon.com.inva>lid> napisał:
>
>
> > Hi folks,
> >
> > Firstly, thanks Jarek for putting together such a thorough and
> > well-thought-out proposal.
> >
> >
> >
> > I am very much in support of the multi-tenancy proposal. Having discussed
> > this with over 30 customers (AWS and non-AWS), there's a clear desire to
> > shift focus from the complex management of multiple Airflow environments
> to
> > enhancing their capabilities, such as enabling data quality checks and
> > lineage. This proposal is a significant step towards achieving that goal.
> >
> > Acknowledging that not every Airflow user has enough time to thoroughly
> > review the AIP, I have drafted a user scenario that encapsulates what's
> > possible with the implementation of multi-tenancy support:
> >
> > *---- Scenario: Multi-Tenancy in Apache Airflow at [Rocket] ----*
> >
> > [Rocket], a leading [mobile gaming platform], has adeptly structured its
> > cloud operations using Apache Airflow to provide an efficient and secure
> > multi-tenant environment for orchestrating their complex workflows. This
> > approach caters to the diverse needs of their three main user groups: the
> > Data Engineering team, the Data Science team, and the Data Analytics
> team.
> >
> > All teams share basic Airflow components like the Scheduler and
> Webserver,
> > providing centralized management with shared cost. Each team has its own
> > distinct tenant cluster, offering self-sufficiency, flexibility, and
> > isolation. The Data Engineering team builds ETL/ELT pipelines and
> produces
> > user profile, telemetry, and marketing data. The Analytics team works
> with
> > marketing data and user information to build comprehensive dashboards.
> The
> > Data Science team uses Kubernetes as their execution environment for
> > heavy-duty machine learning tasks, producing a churn prediction dataset.
> >
> > Members of each team can only see and work with their own workflows.
> > However, Data engineers are granted access to all tenants, enabling them
> to
> > assist with DAG troubleshooting and optimization across all teams. Upon
> > logging in, users are presented with a tenant-specific view, displaying
> > only the relevant DAGs and artifacts. For those with multi-tenant access,
> > seamless navigation between different tenant views is available without
> the
> > need for re-authentication.
> >
> >
> > This setup lets each team work independently with their own tools and
> > data, while also getting help from data engineers when needed. It's
> secure,
> > efficient, and user-friendly.
> >
> > *Image:* https://imgur.com/gallery/uQNqiVc <
> https://imgur.com/gallery/uQNqiVc> (highly recommend reviewing
> > the image to understand the underlying setup)
> >
> >
> *-----------------------------------------------------------------------------------*
> >
> > I’d suggest that interested Airflow users review the scenario and share
> > your support or concerns on this concept in this thread or AIP. For those
> > interested in diving deeper into the details, the AIP is available here -
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-67+Multi-tenant+deployment+of+Airflow+components
> <
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-67+Multi-tenant+deployment+of+Airflow+components
> >
> >
> >
> > Thanks
> > Shubham
> > Product Manager - Amazon MWAA
> >
> >
> >
> > *From: *Jarek Potiuk <ja...@potiuk.com <mailto:ja...@potiuk.com>>
> > *Reply-To: *"us...@airflow.apache.org <mailto:us...@airflow.apache.org>"
> <us...@airflow.apache.org <mailto:us...@airflow.apache.org>>
> > *Date: *Monday, March 11, 2024 at 4:05 PM
> > *To: *"dev@airflow.apache.org <mailto:dev@airflow.apache.org>" <
> dev@airflow.apache.org <mailto:dev@airflow.apache.org>>, "
> > us...@airflow.apache.org <mailto:us...@airflow.apache.org>" <
> us...@airflow.apache.org <mailto:us...@airflow.apache.org>>
> > *Subject: *RE: [EXTERNAL] [COURRIEL EXTERNE] [DISCUSS] DRAFT AIP-67
> > Multi-tenant deployment of Airflow components
> >
> >
> >
> > *CAUTION*: This email originated from outside of the organization. Do not
> > click links or open attachments unless you can confirm the sender and
> know
> > the content is safe.
> >
> >
> >
> > *AVERTISSEMENT*: Ce courrier électronique provient d’un expéditeur
> > externe. Ne cliquez sur aucun lien et n’ouvrez aucune pièce jointe si
> vous
> > ne pouvez pas confirmer l’identité de l’expéditeur et si vous n’êtes pas
> > certain que le contenu ne présente aucun risque.
> >
> >
> >
> > I have iterated and already got a LOT of comments from a LOT of people
> > (Thanks everyone who spent time on it ). I'd say the document is out of
> > draft already, it very much describes the idea of multi-tenancy that I
> hope
> > we will be voting on some time in the future.
> >
> >
> >
> > Taking into account that ~ 30% of people in our survey said they want
> > "mutl-tenancy" - what I am REALLY interested in is to get honest feedback
> > about the proposal. Manly:
> >
> >
> >
> > **"Is this the multi-tenancy you were looking for?" *
> >
> >
> >
> > Or were you looking for different droids (err, tenants) maybe?.
> >
> >
> >
> > I do not want to exercise my Jedi skills to influence your opinion,
> that's
> > why the document is there (and some people say it's nice, readable and
> > pretty complete) so that you can judge yourself and give feedback.
> >
> >
> >
> > The document is here:
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-67+Multi-tenant+deployment+of+Airflow+components
> <
> https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-67+Multi-tenant+deployment+of+Airflow+components
> >
> >
> >
> >
> >
> > Feel free to comment here, or in the document. I would love to hear more
> > voices, and have some ideas what to do next to validate the idea, so
> please
> > - engage for now - but also expect some follow-ups.
> >
> >
> >
> > J.
> >
> >
> >
> >
> >
> > On Wed, Mar 6, 2024 at 9:16 AM Jarek Potiuk <ja...@potiuk.com <mailto:
> ja...@potiuk.com>> wrote:
> >
> > Sooo.. Seems that it's an AIP time :D I've just published a Draft of
> > AIP-67:
> >
> > Multi-tenant deployment of Airflow components
> >
> >
> >
> https://cwiki.apache.org/confluence/display/AIRFLOW/%5BDRAFT%5D+AIP-67+Multi-tenant+deployment+of+Airflow+components
> <
> https://cwiki.apache.org/confluence/display/AIRFLOW/%5BDRAFT%5D+AIP-67+Multi-tenant+deployment+of+Airflow+components
> >
> >
> > This AIP is a bit lighter in detail than the others you could see
> > from Jed , Nikolas and Maciej. This is really a DRAFT / High Level
> > idea of Multi-Tenancy that could be implemented as the follow-up after
> > previous steps of Multi-Tenancy implemented (or being implemented)
> > right now.
> >
> > I decided to - rather than describe all the details now - focus on
> > the concept of Multitenancy that I wanted to propose. Most of all
> > explaining the concept, comparing it to current ways of achieving some
> > forms of multi-tenancy and showing benefits and drawbacks of the
> > solution and connected costs (i.e. what complexity we need to add to
> > achieve it).
> >
> > When thinking about Multi-tenancy, I realized few things:
> >
> > * everyone might understand multi-tenancy differently
> > * some forms of multi-tenancy are achievable even today
> > * but - most of all - I started to question myself "Is this what we
> > can do, enough for some, sufficiently numerous groups of users to call
> > it a useful feature for them".
> >
> > So before we get into more details - my aim is to make sure we are all
> > at the same page on what we CAN do as a multi-tenancy, and eventually
> > to decide whether we SHOULD do it.
> >
> > Have fun. Bring in comments and feedback.
> >
> > More about all the currently active AIPs at today's Town Hall
> >
> > BTW. Do you think it's a surprise that 5 AIPS were announced just
> > before the Town Hall? I think not :D
> >
> > J.
> >
> >
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
> For additional commands, e-mail: dev-h...@airflow.apache.org
>

Reply via email to