[GitHub] [kafka-site] miguno commented on a change in pull request #334: KAFKA-12393: Document multi-tenancy considerations

2021-03-02 Thread GitBox


miguno commented on a change in pull request #334:
URL: https://github.com/apache/kafka-site/pull/334#discussion_r585572446



##
File path: 27/ops.html
##
@@ -1090,7 +1090,157 @@ 
 
 
-  6.4 Kafka Configuration
+  6.4 Multi-Tenancy
+
+  Multi-Tenancy 
Overview
+
+  
+As a highly scalable event streaming platform, Kafka is used by many users 
as their central nervous system, connecting in real-time a wide range of 
different systems and applications from various teams and lines of businesses. 
Such multi-tenant cluster environments command proper control and management to 
ensure the peaceful coexistence of these different needs. This section 
highlights features and best practices to set up such shared environments, 
which should help you operate clusters that meet SLAs/OLAs and that minimize 
potential collateral damage caused by "noisy neighbors".
+  
+
+  
+Multi-tenancy is a many-sided subject, including but not limited to:
+  
+
+  
+Creating user spaces for tenants (sometimes called namespaces)
+Configuring topics with data retention policies and more
+Securing topics and clusters with encryption, authentication, and 
authorization
+Isolating tenants with quotas and rate limits
+Monitoring and metering
+Inter-cluster data sharing (cf. geo-replication)
+  
+
+  Creating User 
Spaces (Namespaces) For Tenants With Topic Naming
+
+  
+Kafka administrators operating a multi-tenant cluster typically need to 
define user spaces for each tenant. For the purpose of this section, "user 
spaces" are a collection of topics, which are grouped together under the 
management of a single entity or user.
+  
+
+  
+In Kafka, the main unit of data is the topic. Users can create and name 
each topic. They can also delete them, but it is not possible to rename a topic 
directly. Instead, to rename a topic, the user must create a new topic, move 
the messages from the original topic to the new, and then delete the original. 
With this in mind, it is recommended to define logical spaces, based on an 
hierarchical topic naming structure. This setup can then be combined with 
security features, such as prefixed ACLs, to isolate different spaces and 
tenants, while also minimizing the administrative overhead for securing the 
data in the cluster.
+  
+
+  
+These logical user spaces can be grouped in different ways, and the 
concrete choice depends on how your organization prefers to use your Kafka 
clusters. The most common groupings are as follows.
+  
+
+  
+By team or organizational unit: Here, the team is the main 
aggregator. In an organization where teams are the main user of the Kafka 
infrastructure, this might be the best grouping.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+
...(e.g., "acme.infosec.telemetry.logins")
+  
+
+  
+By project or product: Here, a team manages more than one 
project. Their credentials will be different for each project, so all the 
controls and settings will always be project related.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+..(e.g., "mobility.payments.suspicious")
+  
+
+  
+Certain information should normally not be put in a topic name, such as 
information that is likely to change over time (e.g., the name of the intended 
consumer) or that is a technical detail or metadata that is available elsewhere 
(e.g., the topic's partition count and other configuration settings).
+  
+
+  
+  To enforce a topic naming structure, it is useful to disable the Kafka 
feature to auto-create topics on demand by setting 
auto.create.topics.enable=false in the broker configuration. This 
stops users and applications from deliberately or inadvertently creating topics 
with arbitrary names, thus violating the naming structure. Then, you may want 
to put in place your own organizational process for controlled, yet automated 
creation of topics according to your naming convention, using scripting or your 
favorite automation toolkit.
+  
+
+  Configuring 
Topics: Data Retention And More
+
+  
+Kafka's configuration is very flexible due to its fine granularity, and it 
supports a plethora of per-topic configuration 
settings to help administrators set up multi-tenant clusters. For example, 
administrators often need to define data retention policies to control how much 
and/or for how long data will be stored in a topic, with settings such as retention.bytes (size) and retention.ms (time). This limits storage consumption 
within the cluster, and helps complying with legal requirements such as GDPR.
+  
+
+  Securing Clusters and 
Topics: Authentication, Authorization, Encryption
+
+  
+  Because the documentation has a dedicated chapter on security that applies to any Kafka deployment, this 
section focuses on additional considerations for multi-tenant environments.
+  
+
+  
+Security settings for Kafka fall into three

[GitHub] [kafka-site] miguno commented on a change in pull request #334: KAFKA-12393: Document multi-tenancy considerations

2021-03-02 Thread GitBox


miguno commented on a change in pull request #334:
URL: https://github.com/apache/kafka-site/pull/334#discussion_r585640800



##
File path: 27/ops.html
##
@@ -1090,7 +1090,157 @@ 
 
 
-  6.4 Kafka Configuration
+  6.4 Multi-Tenancy
+
+  Multi-Tenancy 
Overview
+
+  
+As a highly scalable event streaming platform, Kafka is used by many users 
as their central nervous system, connecting in real-time a wide range of 
different systems and applications from various teams and lines of businesses. 
Such multi-tenant cluster environments command proper control and management to 
ensure the peaceful coexistence of these different needs. This section 
highlights features and best practices to set up such shared environments, 
which should help you operate clusters that meet SLAs/OLAs and that minimize 
potential collateral damage caused by "noisy neighbors".
+  
+
+  
+Multi-tenancy is a many-sided subject, including but not limited to:
+  
+
+  
+Creating user spaces for tenants (sometimes called namespaces)
+Configuring topics with data retention policies and more
+Securing topics and clusters with encryption, authentication, and 
authorization
+Isolating tenants with quotas and rate limits
+Monitoring and metering
+Inter-cluster data sharing (cf. geo-replication)
+  
+
+  Creating User 
Spaces (Namespaces) For Tenants With Topic Naming
+
+  
+Kafka administrators operating a multi-tenant cluster typically need to 
define user spaces for each tenant. For the purpose of this section, "user 
spaces" are a collection of topics, which are grouped together under the 
management of a single entity or user.
+  
+
+  
+In Kafka, the main unit of data is the topic. Users can create and name 
each topic. They can also delete them, but it is not possible to rename a topic 
directly. Instead, to rename a topic, the user must create a new topic, move 
the messages from the original topic to the new, and then delete the original. 
With this in mind, it is recommended to define logical spaces, based on an 
hierarchical topic naming structure. This setup can then be combined with 
security features, such as prefixed ACLs, to isolate different spaces and 
tenants, while also minimizing the administrative overhead for securing the 
data in the cluster.
+  
+
+  
+These logical user spaces can be grouped in different ways, and the 
concrete choice depends on how your organization prefers to use your Kafka 
clusters. The most common groupings are as follows.
+  
+
+  
+By team or organizational unit: Here, the team is the main 
aggregator. In an organization where teams are the main user of the Kafka 
infrastructure, this might be the best grouping.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+
...(e.g., "acme.infosec.telemetry.logins")
+  
+
+  
+By project or product: Here, a team manages more than one 
project. Their credentials will be different for each project, so all the 
controls and settings will always be project related.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+..(e.g., "mobility.payments.suspicious")
+  
+
+  
+Certain information should normally not be put in a topic name, such as 
information that is likely to change over time (e.g., the name of the intended 
consumer) or that is a technical detail or metadata that is available elsewhere 
(e.g., the topic's partition count and other configuration settings).
+  
+
+  
+  To enforce a topic naming structure, it is useful to disable the Kafka 
feature to auto-create topics on demand by setting 
auto.create.topics.enable=false in the broker configuration. This 
stops users and applications from deliberately or inadvertently creating topics 
with arbitrary names, thus violating the naming structure. Then, you may want 
to put in place your own organizational process for controlled, yet automated 
creation of topics according to your naming convention, using scripting or your 
favorite automation toolkit.

Review comment:
   Ack and updated.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [kafka-site] miguno commented on a change in pull request #334: KAFKA-12393: Document multi-tenancy considerations

2021-03-02 Thread GitBox


miguno commented on a change in pull request #334:
URL: https://github.com/apache/kafka-site/pull/334#discussion_r585575690



##
File path: 27/ops.html
##
@@ -1090,7 +1090,157 @@ 
 
 
-  6.4 Kafka Configuration
+  6.4 Multi-Tenancy
+
+  Multi-Tenancy 
Overview
+
+  
+As a highly scalable event streaming platform, Kafka is used by many users 
as their central nervous system, connecting in real-time a wide range of 
different systems and applications from various teams and lines of businesses. 
Such multi-tenant cluster environments command proper control and management to 
ensure the peaceful coexistence of these different needs. This section 
highlights features and best practices to set up such shared environments, 
which should help you operate clusters that meet SLAs/OLAs and that minimize 
potential collateral damage caused by "noisy neighbors".
+  
+
+  
+Multi-tenancy is a many-sided subject, including but not limited to:
+  
+
+  
+Creating user spaces for tenants (sometimes called namespaces)
+Configuring topics with data retention policies and more
+Securing topics and clusters with encryption, authentication, and 
authorization
+Isolating tenants with quotas and rate limits
+Monitoring and metering
+Inter-cluster data sharing (cf. geo-replication)
+  
+
+  Creating User 
Spaces (Namespaces) For Tenants With Topic Naming
+
+  
+Kafka administrators operating a multi-tenant cluster typically need to 
define user spaces for each tenant. For the purpose of this section, "user 
spaces" are a collection of topics, which are grouped together under the 
management of a single entity or user.
+  
+
+  
+In Kafka, the main unit of data is the topic. Users can create and name 
each topic. They can also delete them, but it is not possible to rename a topic 
directly. Instead, to rename a topic, the user must create a new topic, move 
the messages from the original topic to the new, and then delete the original. 
With this in mind, it is recommended to define logical spaces, based on an 
hierarchical topic naming structure. This setup can then be combined with 
security features, such as prefixed ACLs, to isolate different spaces and 
tenants, while also minimizing the administrative overhead for securing the 
data in the cluster.
+  
+
+  
+These logical user spaces can be grouped in different ways, and the 
concrete choice depends on how your organization prefers to use your Kafka 
clusters. The most common groupings are as follows.
+  
+
+  
+By team or organizational unit: Here, the team is the main 
aggregator. In an organization where teams are the main user of the Kafka 
infrastructure, this might be the best grouping.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+
...(e.g., "acme.infosec.telemetry.logins")
+  
+
+  
+By project or product: Here, a team manages more than one 
project. Their credentials will be different for each project, so all the 
controls and settings will always be project related.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+..(e.g., "mobility.payments.suspicious")
+  
+
+  
+Certain information should normally not be put in a topic name, such as 
information that is likely to change over time (e.g., the name of the intended 
consumer) or that is a technical detail or metadata that is available elsewhere 
(e.g., the topic's partition count and other configuration settings).
+  
+
+  
+  To enforce a topic naming structure, it is useful to disable the Kafka 
feature to auto-create topics on demand by setting 
auto.create.topics.enable=false in the broker configuration. This 
stops users and applications from deliberately or inadvertently creating topics 
with arbitrary names, thus violating the naming structure. Then, you may want 
to put in place your own organizational process for controlled, yet automated 
creation of topics according to your naming convention, using scripting or your 
favorite automation toolkit.
+  
+
+  Configuring 
Topics: Data Retention And More
+
+  
+Kafka's configuration is very flexible due to its fine granularity, and it 
supports a plethora of per-topic configuration 
settings to help administrators set up multi-tenant clusters. For example, 
administrators often need to define data retention policies to control how much 
and/or for how long data will be stored in a topic, with settings such as retention.bytes (size) and retention.ms (time). This limits storage consumption 
within the cluster, and helps complying with legal requirements such as GDPR.
+  
+
+  Securing Clusters and 
Topics: Authentication, Authorization, Encryption
+
+  
+  Because the documentation has a dedicated chapter on security that applies to any Kafka deployment, this 
section focuses on additional considerations for multi-tenant environments.
+  
+
+  
+Security settings for Kafka fall into three

[GitHub] [kafka-site] miguno commented on a change in pull request #334: KAFKA-12393: Document multi-tenancy considerations

2021-03-02 Thread GitBox


miguno commented on a change in pull request #334:
URL: https://github.com/apache/kafka-site/pull/334#discussion_r585572446



##
File path: 27/ops.html
##
@@ -1090,7 +1090,157 @@ 
 
 
-  6.4 Kafka Configuration
+  6.4 Multi-Tenancy
+
+  Multi-Tenancy 
Overview
+
+  
+As a highly scalable event streaming platform, Kafka is used by many users 
as their central nervous system, connecting in real-time a wide range of 
different systems and applications from various teams and lines of businesses. 
Such multi-tenant cluster environments command proper control and management to 
ensure the peaceful coexistence of these different needs. This section 
highlights features and best practices to set up such shared environments, 
which should help you operate clusters that meet SLAs/OLAs and that minimize 
potential collateral damage caused by "noisy neighbors".
+  
+
+  
+Multi-tenancy is a many-sided subject, including but not limited to:
+  
+
+  
+Creating user spaces for tenants (sometimes called namespaces)
+Configuring topics with data retention policies and more
+Securing topics and clusters with encryption, authentication, and 
authorization
+Isolating tenants with quotas and rate limits
+Monitoring and metering
+Inter-cluster data sharing (cf. geo-replication)
+  
+
+  Creating User 
Spaces (Namespaces) For Tenants With Topic Naming
+
+  
+Kafka administrators operating a multi-tenant cluster typically need to 
define user spaces for each tenant. For the purpose of this section, "user 
spaces" are a collection of topics, which are grouped together under the 
management of a single entity or user.
+  
+
+  
+In Kafka, the main unit of data is the topic. Users can create and name 
each topic. They can also delete them, but it is not possible to rename a topic 
directly. Instead, to rename a topic, the user must create a new topic, move 
the messages from the original topic to the new, and then delete the original. 
With this in mind, it is recommended to define logical spaces, based on an 
hierarchical topic naming structure. This setup can then be combined with 
security features, such as prefixed ACLs, to isolate different spaces and 
tenants, while also minimizing the administrative overhead for securing the 
data in the cluster.
+  
+
+  
+These logical user spaces can be grouped in different ways, and the 
concrete choice depends on how your organization prefers to use your Kafka 
clusters. The most common groupings are as follows.
+  
+
+  
+By team or organizational unit: Here, the team is the main 
aggregator. In an organization where teams are the main user of the Kafka 
infrastructure, this might be the best grouping.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+
...(e.g., "acme.infosec.telemetry.logins")
+  
+
+  
+By project or product: Here, a team manages more than one 
project. Their credentials will be different for each project, so all the 
controls and settings will always be project related.
+  
+
+  
+Example topic naming structure:
+  
+
+  
+..(e.g., "mobility.payments.suspicious")
+  
+
+  
+Certain information should normally not be put in a topic name, such as 
information that is likely to change over time (e.g., the name of the intended 
consumer) or that is a technical detail or metadata that is available elsewhere 
(e.g., the topic's partition count and other configuration settings).
+  
+
+  
+  To enforce a topic naming structure, it is useful to disable the Kafka 
feature to auto-create topics on demand by setting 
auto.create.topics.enable=false in the broker configuration. This 
stops users and applications from deliberately or inadvertently creating topics 
with arbitrary names, thus violating the naming structure. Then, you may want 
to put in place your own organizational process for controlled, yet automated 
creation of topics according to your naming convention, using scripting or your 
favorite automation toolkit.
+  
+
+  Configuring 
Topics: Data Retention And More
+
+  
+Kafka's configuration is very flexible due to its fine granularity, and it 
supports a plethora of per-topic configuration 
settings to help administrators set up multi-tenant clusters. For example, 
administrators often need to define data retention policies to control how much 
and/or for how long data will be stored in a topic, with settings such as retention.bytes (size) and retention.ms (time). This limits storage consumption 
within the cluster, and helps complying with legal requirements such as GDPR.
+  
+
+  Securing Clusters and 
Topics: Authentication, Authorization, Encryption
+
+  
+  Because the documentation has a dedicated chapter on security that applies to any Kafka deployment, this 
section focuses on additional considerations for multi-tenant environments.
+  
+
+  
+Security settings for Kafka fall into three