Technical guide covering implementation details and usage of the
standalone crypto kernel module feature.

Signed-off-by: Jay Wang <[email protected]>
---
 crypto/fips140/README | 404 ++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 404 insertions(+)
 create mode 100644 crypto/fips140/README

diff --git a/crypto/fips140/README b/crypto/fips140/README
new file mode 100644
index 0000000000000..52488d5726d25
--- /dev/null
+++ b/crypto/fips140/README
@@ -0,0 +1,404 @@
+## 1. Introduction
+
+Amazon Linux is releasing a new kernel feature that converts the previously 
built-in kernel crypto subsystem into a standalone kernel module. This module 
becomes the carrier of the kernel crypto subsystem and can be loaded at early 
boot to provide the same functionality as the original built-in crypto. The 
primary motivation for this modularization is to streamline Federal Information 
Processing Standards (FIPS) validation, a critical cryptographic certification 
for cloud computing users doing business with the U.S. government.
+ 
+In a bit more detail, previously, FIPS certification was tied to the entire 
kernel image, meaning non-crypto updates could potentially invalidate 
certification. With this feature, FIPS certification is tied only to the crypto 
module. Therefore, once the module is certified, loading this certified module 
on newer kernels automatically makes those kernels FIPS-certified. As a result, 
this approach can save re-certification costs and 12-18 months of waiting time 
by reducing the need for repeated FIPS re-certification cycles.
+
+This document provides technical details on how this feature is designed and 
implemented for users or developers who are interested in developing upon it, 
and is organized as follows:
+- Section 2 - Getting Started: Quick start on how to enable the feature
+- Section 3 - Workflow Overview: Changes this feature brings to build and 
runtime
+- Section 4 - Design Implementation Details: Technical deep-dive into each 
component
+- Section 5 - Customizing and Extending Crypto Module: How to select crypto to 
be included and extend to new crypto/architectures
+- Section 6 - Reusing a Certified Module in Practice: Different reuse and 
maintenance strategies and their tradeoffs
+
+## 2. Getting Started
+
+This section provides a quick start guide for developers on how to enable, 
compile and use the standalone cryptography module feature.
+
+### 2.1 Basic Configuration
+
+The feature is controlled by a single configuration option:
+```
+CONFIG_CRYPTO_FIPS140_EXTMOD=y
+```
+What it does: When enabled, automatically redirects a set of cryptographic 
algorithms from the main kernel into a standalone module 
`crypto/fips140/fips140.ko`. The cryptographic algorithms that are redirected 
need to satisfy all the following conditions, otherwise the cryptography will 
remain in its original form:
+1. Must be configured as built-in (i.e., `CONFIG_CRYPTO_*=y`). This means 
cryptography already configured as modular (i.e., `CONFIG_CRYPTO_*=m`) are not 
redirected as they are already modularized.
+2. Must be among a list, which can be customized by developers as described in 
Section 5.
+
+When disabled, the kernel behaves as before.
+
+### 2.2 Build Process
+
+Once `CONFIG_CRYPTO_FIPS140_EXTMOD=y` is set, no additional steps are 
required. The standalone module will be built automatically as part of the 
standard kernel build process:
+```
+make -j$(nproc)
+# or
+make vmlinux
+```
+**What happens automatically (No user action required):**
+1. Build the module as `crypto/fips140/fips140.ko`
+2. The cryptography module will be loaded at boot time
+3. All kernel cryptographic services will provide the same functionality as 
before (i.e., prior to introducing this new feature) once boot completes.
+
+### 2.3 Advanced Configuration Options
+
+**Using External Cryptography Module:**
+```
+CONFIG_CRYPTO_FIPS140_EXTMOD_SOURCE=y
+```
+By default, `CONFIG_CRYPTO_FIPS140_EXTMOD_SOURCE` is not set, meaning the 
freshly built cryptography module is used. Otherwise, the pre-built standalone 
cryptography module from `fips140_build/crypto/fips140/fips140.ko` and modular 
cryptography such as `fips140_build/crypto/aes.ko` (need to manually place 
pre-built modules in these locations before the build) are included in kernel 
packaging (e.g., during `make modules_install`) and are used at later boot time.
+
+**Dual Version Support:**
+```
+CONFIG_CRYPTO_FIPS140_DUAL_VERSION=y
+```
+Encapsulate two versions of `fips140.ko` into kernel: one is freshly built for 
non-FIPS mode usage, another is pre-built specified by 
`fips140_build/crypto/fips140/fips140.ko` for FIPS mode usage. The appropriate 
version is selected and loaded at boot time based on boot time FIPS mode status.
+
+### 2.4 Verification
+
+To verify the feature is working, after install and boot with the new kernel:
+```
+# Check if fips140.ko module is loaded
+lsmod | grep fips140
+# Check if crypto algorithms are served by the fips140 module
+cat /proc/crypto | grep module | grep fips140
+```
+
+## 3. Workflow Overview
+
+This section provides an overview without delving into deep technical details 
of the changes the standalone cryptography module feature introduces. When this 
feature is enabled, it introduces changes to both the kernel build and booting 
process. 
+
+3.1 Build-Time Changes
+
+Kernel cryptography subsystem consists of both cryptography management 
infrastructure (e.g., `crypto/api.c`, `crypto/algapi.c`, etc), along with 
hundreds of different cryptography algorithms (e.g., `crypto/arc4.c`).
+
+**Traditional Build Process:**
+Traditionally, cryptography management infrastructure are always built-in to 
the kernel, while cryptographic algorithms can be configured to be built either 
as built-in (`CONFIG_CRYPTO_*=y`) or as separate modular (`CONFIG_CRYPTO_*=m`) 
`.ko` file depending on kernel configuration:
+As a result, the builtin cryptography management infrastructure and 
cryptographic algorithms are statically linked into the kernel binary:
+```
+cryptographic algorithms source files → compiled as .o objfiles →  linked into 
vmlinux → single kernel binary
+```
+**With Standalone Cryptography Module:**
+This feature automatically transforms the builtin cryptographic components 
into a standalone cryptography module, `fips140.ko`. To do so, it develops a 
new kernel build rule `crypto-objs-$(CONFIG_CRYPTO_*)` such that, once this 
build rule is applied to a cryptographic algorithm, such cryptographic 
algorithm will be automatically collected into the cryptography module if it is 
configured as built-in (i.e, `CONFIG_CRYPTO_*=y`), for example:
+```
+// in crypto/asymmetric_keys/Makefile
+- obj-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
++ crypto-objs-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
+```
+Such build change allows the modularization transformation to only affect 
selected cryptographic algorithms (i.e, where the 
`crypto-objs-$(CONFIG_CRYPTO_*`) is applied).
+
+Then, after the `fips140.ko` is generated, it will be embedded back into main 
kernel vmlinux as a replacement part. The purpose of this embedding, instead of 
traditionally putting the `fips140.ko` into filesystem, is a preparation to 
allow the module to be loaded early enough even before the filesystem is ready.
+
+The new build process is illustrated below.
+```
+cryptographic algorithms source files → compiled as .o objfiles → 
automatically collected and linked into fips140.ko → embedded fips140.ko into 
vmlinux as a replaceable binary
+```
+
+### 3.2 Runtime Changes
+
+**Traditional Boot Process:**
+The kernel initializes the cryptographic subsystem early during boot, 
executing each cryptographic initialization routine accordingly. These 
initialization routines may depend on other cryptographic components or other 
kernel subsystems, so their invocation follows a well-defined execution order 
to ensure they are initialized before their first use.
+```
+kernel starts → cryptography subsystem initialization → cryptography subsystem 
available → other components use cryptography
+```
+**With Standalone Cryptography Module:**
+At the start of kernel boot, compared to a regular kernel, the first major 
change introduced by this feature is that no cryptography services are 
initially available — since the entire cryptography subsystem has been 
decoupled from the main kernel.
+To ensure that the cryptography subsystem becomes available early enough 
(before the first kernel component that requires cryptography services), the 
standalone cryptography kernel module must be loaded at a very early stage, 
even before the filesystem becomes available.
+
+However, the regular module loading mechanism relies on placing kernel modules 
in the filesystem and loading them from there, which creates a chicken-and-egg 
problem — the cryptography module cannot be loaded until the filesystem is 
ready, yet some kernel components may require cryptography services even before 
that point.
+
+To address this, the second change introduced by this feature is that the 
cryptography kernel module is loaded directly from memory, leveraging the 
earlier compilation changes that embed the module binary into the main kernel 
image. Afterward, the feature includes a “plug-in” mechanism that connects the 
decoupled cryptography subsystem back to the main kernel, ensuring that kernel 
cryptography users can correctly locate and invoke the cryptography routine 
entry points.
+
+Finally, to ensure proper initialization, the feature guarantees that all 
cryptography algorithms and the cryptography management infra execute their 
initialization routines in the exact same order as they would if they were 
built-in.
+
+The process described above is illustrated below.
+```
+kernel starts → no cryptography available → load fips140.ko from memory → plug 
cryptography back to kernel → module initialization → cryptographic services 
available → other components use cryptography
+```
+
+## 4. Design Implementation Details
+
+While the earlier sections provide a holistic view of how this feature shapes 
the kernel, this section provides deeper design details on how these 
functionalities are realized. There are three key design components:
+1. A specialized compile rule that automatically compiles and collects all 
built-in cryptographic algorithm object files to generate the final module 
binary under arbitrary kernel configurations, and then embeds the generated 
binary into the main kernel image for early loading.
+2. A mechanism to convert interactions between the cryptography subsystem and 
the main kernel into a pluggable interface.
+3. A module loading and initialization process that ensures the cryptography 
subsystem is properly initialized as if it were built-in.
+
+### 4.1. Specialized Compilation System
+
+**Automatic Collection and Linking of Built-in Cryptographic Algorithm 
Objects:**
+The first step in generating the `fips140.ko` module is to compile and collect 
built-in cryptographic components (i.e., those specified by 
`CONFIG_CRYPTO_*=y`).
+Traditionally, the existing module build process requires all module 
components (e.g., source files) to reside in a single directory. However, this 
approach is not suitable for our case, where hundreds of cryptographic 
algorithm source files are scattered across multiple directories.
+
+A naïve approach would be to create a separate Makefile that duplicates the 
original build rules with adjusted paths.
+However, this method is not scalable due to the large number of cryptographic 
build rules, many of which are highly customized and can vary under different 
Kconfig settings, making such a separate Makefile even more complex.
+Moreover, this approach cannot ensure that built-in cryptographic algorithms 
are completely removed from the main kernel, which would result in redundant 
cryptographic code being included in both the kernel and the module.
+
+To tackle this challenge, we automated the object collection and linking 
process by introducing special build logic for the kernel cryptography 
subsystem.
+Specifically, to automatically collect cryptography object files while 
preserving their original compilation settings (such as flags, headers, and 
paths), we introduced a new compilation rule:
+```
+crypto-objs-y += *.o
+```
+This replaces the original `obj-y += *.o` rule in cryptography Makefiles 
later, for example:
+```
+// in crypto/asymmetric_keys/Makefile
+- obj-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
++ crypto-objs-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
+asymmetric_keys-y := \
+    asymmetric_type.o \
+    restrict.o \
+    signature.o
+```
+in the cryptography subsystem Makefiles, allowing most of the existing 
Makefile logic to be reused.
+As a result, when the standalone cryptography module feature is enabled, any 
cryptographic algorithm configured as built-in (for example, 
`crypto-objs-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o` where 
`CONFIG_ASYMMETRIC_KEY_TYPE=y`) will be automatically collected and linked into 
a single final object binary, `fips140.o`.
+During this process, a special compilation flag (`-DFIPS_MODULE=1`) is applied 
to instruct each object file to be compiled in a module-specific manner. This 
flag will later be used to generate the pluggable interface on both the main 
kernel side and the module side from the same source code.
+
+The implementation details are as follows: it follows a similar methodology 
used by the `obj-y` collection process for building `vmlinux.o`. The 
`crypto-objs-y` rule is placed in `scripts/Makefile.build`, which is executed 
by each directory Makefile to collect the corresponding crypto object files. 
Each directory then creates a `crypto-module.a` archive that contains all 
`crypto-objs-y += <object>.o` files under that directory. In the parent 
directories, these `crypto-module.a` archives are recursively included into the 
parent’s own `crypto-module.a`, and this process continues upward until the 
final `fips140.o` is generated.
+
+**A Separate Module Generation Pipeline for Building the Final Kernel Module 
from Linked Cryptographic Algorithm Object:**
+With the linked cryptographic algorithm object (i.e., `fips140.o`), the next 
step is to generate the final kernel module, `fips140.ko`.
+
+A direct approach would be to inject the `fips140.ko` module build into the 
existing modules generation pipeline (i.e., `make modules`) by providing our 
pre-generated `fips140.o`. However, we choose not to do this because it would 
create a circular make rule dependency (which is invalid in Makefiles and 
causes build failures), resulting in mutual dependencies between the modules 
and vmlinux targets (i.e., `modules:vmlinux` and `vmlinux:modules` at the same 
time).
+This happens for the following reasons:
+1. Since we will later embed `fips140.ko` into the final kernel image (as 
described in the next section), we must make vmlinux depend on `fips140.ko`. In 
other words: `vmlinux: fips140.ko`.
+2. When the kernel is built with `CONFIG_DEBUG_INFO_BTF_MODULES=y`, it 
requires: modules: vmlinux. This is because `CONFIG_DEBUG_INFO_BTF_MODULES=y` 
takes vmlinux as input to generate BTF info for the module, and inserts such 
info into the `.ko` module by default.
+3. If we choose to inject `fips140.ko` into make modules, this would create a 
make rule dependency: `fips140.ko: modules`. Combined with items 1 and 2, this 
eventually creates an invalid circular dependency between vmlinux and modules.
+
+Due to these reasons, the design choice is to use a separate make pipeline 
(defined as `fips140-ready` in the Makefile). This new pipeline reuses the same 
module generation scripts used by make modules but adds additional logic in 
`scripts/Makefile.{modfinal|modinst|modpost}` and `scripts/mod/modpost.c` to 
handle module symbol generation and verification correctly. 
+
+**A Seamless Process That Embeds the Generated Binary Into the Main Kernel 
Image for Early Loading:**
+As mentioned earlier, in order to load the standalone cryptography module 
early in the boot process—before the filesystem is ready—the module binary must 
be embedded into the final kernel image (i.e., vmlinux) so that it can be 
loaded directly from memory.
+We intend for this embedding process to be completely seamless and 
automatically triggered whenever vmlinux is built (i.e., during `make vmlinux`).
+
+To achieve this, the feature adds a Make dependency rule so that vmlinux 
depends on `fips140.ko`.
+It also modifies the vmlinux link rules (i.e., 
`arch/<arch>/kernel/vmlinux.lds.S`, `scripts/Makefile.vmlinux`, and 
`scripts/link-vmlinux.sh`) so that the generated module binary is finally 
combined with `vmlinux.o`.
+
+In addition, we allow multiple cryptography module binary versions (for 
example, a certified cryptography binary and a latest, up-to-date but 
uncertified one) to be embedded into the main kernel image to serve different 
user needs. This design allows regular (non-FIPS) users to benefit from the 
latest cryptographic updates, while FIPS-mode users continue to use the 
certified cryptography module.
+
+To support this, we introduce an optional configuration, 
`CONFIG_CRYPTO_FIPS140_DUAL_VERSION`. When enabled, this option allows two 
cryptography module versions to be embedded within a single kernel build and 
ensures that the appropriate module is selected and loaded at boot time based 
on the system’s FIPS mode status.
+
+### 4.2. Pluggable Interface Between the Built-in Cryptography Subsystem and 
the Main Kernel
+
+Although the module binary (`fips140.ko`) has been embedded into the final 
kernel image (`vmlinux`) as described in the previous section, it is not linked 
to the kernel in any way. This is because `fips140.ko` is embedded in a 
data-only manner, so the main kernel cannot directly call any functions or 
access any data defined in the module binary. Since the main kernel and modules 
can only interact through exported symbols (i.e., via `EXPORT_SYMBOL()`), this 
also applies to the crypto kernel module — the main kernel can only interact 
with the crypto functions and variables defined in the crypto module through 
exported symbols, meaning these functions and variables must also have their 
symbols exported in the module after they are moved from the main kernel to the 
module.
+
+However, simply making these crypto symbols symbol-exported in the module 
without additional handling would cause the kernel to fail to compile. This is 
because the existing kernel module symbol resolution mechanism is essentially 
one-way: it supports symbols defined in the main kernel and referenced by 
kernel modules. However, it does not support the reverse case — symbols defined 
in a kernel module but used by the main kernel — which is exactly the crypto 
module case, as there are many crypto users still residing in the main kernel. 
The reason is that compilation of the main kernel requires all symbol addresses 
to be known to achieve a successful linking phase.
+
+To address this, we introduce a pluggable interface to support this 
reverse-direction symbol resolution between crypto symbols defined in the 
module and referenced by the main kernel, by placing **address placeholders** 
at all crypto usage points in the main kernel. These address placeholders are 
initially set to NULL during compilation to provide a concrete address that 
satisfies the linking phase. Then, during runtime, once the cryptography kernel 
module is loaded, these placeholders are updated to the correct addresses 
before their first use in the main kernel. In the rest of this section, we 
first introduce this pluggable interface mechanism, and then explain how to 
apply it to the built-in cryptographic algorithms and variables.
+
+**The Pluggable Interface Mechanism:**
+There are two types of address holders used to achieve this pluggable 
interface:
+- Function addresses (the majority): We use a trampoline to redirect the 
original jump instruction to another location whose target destination is held 
by the value of a function pointer. To avoid additional security concerns, such 
as the function pointer being arbitrarily modified, these function pointers are 
made `__ro_after_init` to ensure they cannot be modified after kernel init. We 
implement this function-address placeholder as the 
`DEFINE_CRYPTO_FN_REDIRECT()` wrapper.
+- Variable addresses (the remaining smaller portion): For these, we use a 
pointer of the corresponding data type. We implement this address placeholder 
as the `DECLARE_CRYPTO_VAR()` and `DEFINE_CRYPTO_API_STUB()` wrappers:
+
+These wrappers are applied to each symbol-exported (i.e., `EXPORT_SYMBOL()`) 
cryptographic function and variable (details on how to apply them are described 
later). Once applied, the wrappers are compiled differently for the main kernel 
and for the built-in cryptographic algorithm source code—acting as the “outlet” 
and the “plug,” respectively—using different compilation flags 
(`-DFIPS_MODULE`) introduced by our customized build rules described earlier.
+
+As a result, the kernel can successfully compile even when the built-in 
cryptographic algorithms are removed, thanks to these address placeholders. At 
boot time, the placeholders initially hold NULL, but since no cryptography 
users exist at that stage, the kernel can still start booting correctly. After 
the cryptography module is loaded, the placeholders are dynamically updated to 
the correct addresses later (by `do_crypto_var()` and `do_crypto_fn()`, 
described in a later section).
+
+**Applying the Pluggable Interface Mechanism to Cryptographic Algorithms:**
+
+To apply these pluggable interface wrappers to a cryptographic algorithm and 
make them take effect, we follow the steps below (using 
`crypto/asymmetric_keys/asymmetric_type.c` as an example):
+1. **Apply `crypto-objs-y` compile rule to the cryptographic algorithm:**
+```
+// in crypto/asymmetric_keys/Makefile
+- obj-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
++ crypto-objs-$(CONFIG_ASYMMETRIC_KEY_TYPE) += asymmetric_keys.o
+asymmetric_keys-y := \
+    asymmetric_type.o \
+    restrict.o \
+    signature.o
+```
+2. **Locate the communication point between the cryptographic algorithm and 
the main kernel:**
+
+The cryptography subsystem is designed such that most interactions between the 
main kernel and cryptographic algorithms occur through exported symbols using 
`EXPORT_SYMBOL()` wrappers.
+This kernel design exists because most cryptographic algorithm implementations 
must support both built-in and modular modes. 
+
+Consequently, the cryptographic functions and variables exported by 
`EXPORT_SYMBOL()` are a well-defined and identifiable interface between the 
cryptography subsystem and the main kernel: 
+```
+// in crypto/asymmetric_keys/asymmetric_type.c 
+//Exported cryptographic function:
+bool asymmetric_key_id_same(const struct asymmetric_key_id *kid1,
+                const struct asymmetric_key_id *kid2) {...}
+EXPORT_SYMBOL_GPL(asymmetric_key_id_same); 
+//Exported cryptographic variable:
+struct key_type key_type_asymmetric = {...};
+EXPORT_SYMBOL_GPL(key_type_asymmetric); 
+```
+
+3. **Redirect crypto symbol references in the main kernel to address 
placeholders:**
+
+With the placeholders in place, the remaining problem is directing the main 
kernel to use them rather than the original symbols. Since all crypto users 
must include the corresponding header files to obtain function and variable 
declarations, the headers are a natural place to perform this redirection. Each 
declaration is transformed using a macro that hooks it to the corresponding 
placeholder.
+
+For exported variable symbols (a small number, ~10 symbols), their declaration 
in the header file is replaced with the `DECLARE_CRYPTO_VAR()` wrapper to 
redirect variable access from a concrete address to a placeholder:
+```
+// in include/keys/asymmetric-type.h
+// for exported cryptographic variables:
+- struct key_type key_type_asymmetric;
++ DECLARE_CRYPTO_VAR(CONFIG_ASYMMETRIC_KEY_TYPE, key_type_asymmetric, struct 
key_type, );
++ #if defined(CONFIG_CRYPTO_FIPS140_EXTMOD) && !defined(FIPS_MODULE) && 
IS_BUILTIN(CONFIG_ASYMMETRIC_KEY_TYPE)
++ #define key_type_asymmetric (*((struct 
key_type*)CRYPTO_VAR_NAME(key_type_asymmetric)))
++ #endif 
+```
+By doing so, we can automatically force all cryptography users to go through 
the placeholders, because those users already include the same header file.
+The wrapper also takes the cryptographic algorithm Kconfig symbol as a 
parameter, so that when a cryptographic algorithm is built as a module (for 
example, `CONFIG_ASYMMETRIC_KEY_TYPE=m`), the original function declarations 
remain unchanged and are not affected.
+
+For exported function symbols (the majority, ~hundreds), a similar approach 
could be taken, but instead we use an automated method to redirect function 
address usage to placeholders during the kernel compilation process. This makes 
the crypto module implementation less intrusive to the kernel source tree, as 
no header file modifications are needed. To achieve this, a linker option 
`--wrap=<symbols-to-redirect>` is leveraged to rename all uses of crypto 
functions in the main kernel to dedicated trampolines that act as address 
placeholders. As a consequence, all references to crypto function symbols are 
automatically redirected to the address placeholders, avoiding mass intrusive 
changes to the mainline kernel source tree.
+
+4. **Add the address-placeholder definition wrappers into a dedicated file 
`fips140-var-redirect.c`:**
+
+After redirecting crypto users to use address placeholders, we also need to 
add the definitions of those address placeholders.
+
+For exported variable symbols (a small number, ~10 symbols), add the 
placeholder definition wrapper `DEFINE_CRYPTO_VAR_STUB` to a dedicated file 
`fips140-var-redirect.c`.
+```
+// in crypto/fips140/fips140-var-redirect.c
+// for exported cryptographic variables:
++ #undef key_type_asymmetric
++ DEFINE_CRYPTO_VAR_STUB(key_type_asymmetric);
++ #endif
+```
+This file will be compiled separately and acts as both the “outlet” and the 
“plug” for the main kernel and the cryptography module, respectively.
+
+For exported function symbols (the majority, ~hundreds), a similar wrapper 
`DEFINE_CRYPTO_FN_REDIRECT()` is used, but again, the application of this 
wrapper is automated, so there is no need to manually apply it. Instead, it is 
generated automatically by the script 
`crypto/fips140/gen-fips140-fn-redirect.sh` on every kernel build.
+
+We apply the above steps to both architecture-independent and 
architecture-specific cryptographic algorithms.
+
+### 4.3. Initialization Synchronization
+
+To ensure the embedded `fips140.ko` module binary provides the same 
cryptography functionality as the regular kernel, the kernel needs:
+1. A module loader to load the module binary directly from memory,
+2. A mechanism to plug the module back into the kernel by updating the address 
placeholders, and
+3. Correct cryptography subsystem initialization, as if the cryptographic 
algorithms were still built-in.
+
+**Directly Load Module Binary from Memory:**
+Regular modules are loaded from the filesystem and undergo signature 
verification on the module binary, which relies on cryptographic operations. 
However, since we have already fully decoupled the cryptography subsystem, we 
must skip this step for this `fips140.ko` module.
+To achieve this, we add a new loader function `load_crypto_module_mem()` that 
can load the module binary directly from memory at the designed address without 
checking the signature. Since the module binary is embedded into main kernel in 
an ELF section, as specified in the linker script:
+```
+// in arch/<arch>/kernel/vmlinux.lds.S
+    .fips140_embedded : AT(ADDR(.fips140_embedded) - LOAD_OFFSET) {
+        . = ALIGN(8);
+        _binary_fips140_ko_start = .;
+        KEEP(*(.fips140_module_data))
+        _binary_fips140_ko_end = .;
+    }
+```
+Therefore, the runtime memory address of the module can be accessed directly 
by the module loader to invoke the new loader function 
`load_crypto_module_mem()`.
+
+**Plug Back the Module by Updating Address Placeholder Values:**
+To update the address placeholders in the main kernel to the correct addresses 
matching the loaded module, after compilation the placeholder values are placed 
into dedicated key-value data structures, which reside in ELF sections 
`__crypto_fn_keys` and `__crypto_var_keys`.
+This can be seen from the definition of the placeholder's key-value data 
structure:
+```
+#define __CRYPTO_FN_KEY(sym)                                   \
+       extern void *__fips140_fn_ptr_##sym;                    \
+       static struct _crypto_fn_key __##sym##_fn_key           \
+               __used                                          \
+               __section("__crypto_fn_keys")                   \ // Place in a 
dedicated ELF Section
+               __aligned(__alignof__(struct _crypto_fn_key)) = {       \
+               .ptr = (void **)&__fips140_fn_ptr_##sym,                \
+               .func = (void *)&sym,                           \
+       };
+
+#define DEFINE_CRYPTO_VAR_STUB(name) \
+    static struct crypto_var_key __crypto_##name##_var_key \
+        __used \
+        __section("__crypto_var_keys") \ // Place in a dedicated ELF Section
+        __aligned(__alignof__(struct crypto_var_key)) = \
+    { \
+        .ptr = &CRYPTO_VAR_NAME(name), \
+        .var = (void*)&name, \
+    };
+```
+The purpose of doing this is to allow the main kernel to quickly locate the 
placeholders and update them to the correct addresses. The update functions are 
defined as `do_crypto_var()` and `do_crypto_fn()`, which are executed at module 
load.
+
+As a result, all cryptography users in the main kernel can now call the 
cryptographic functions as if they were built-in.
+
+**Initialize Cryptography Subsystem as if it Were Built-in:**
+Cryptographic components must be properly initialized before use, and this 
initialization is typically achieved through dedicated initialization functions 
(e.g., `module_init(crypto_init_func)` or `late_initcall(crypto_init_func)`). 
Traditionally, these init functions are executed automatically as part of the 
kernel boot phase. However, now that they are moved to a crypto module, there 
needs to be a way to collect and execute them.
+
+To collect these init functions, the init wrappers (e.g., `module_init()` and 
`late_initcall`) are modified to automatically place the wrapped crypto init 
function into a dedicated list in the crypto module, for example:
+```
+// in include/linux/module.h
+#define subsys_initcall(fn) \
+       static initcall_t __used __section(".fips_initcall0") \ // a dedicated 
list
+               __fips_##fn = fn;
+
+#define module_init(initfn) \
+       static initcall_t __used __section(".fips_initcall1") \ // a dedicated 
list
+               __fips_##initfn = initfn;
+```
+By doing so, all init functions are now aggregated for execution by 
`run_initcalls()` at module load.
+
+Besides collecting these crypto init functions, rather than simply executing 
them, another key consideration is that their execution order often has strict 
requirements. In other words, for these collected crypto init functions, we 
must ensure that their initialization order is preserved as before because 
failure to follow the correct order can result in kernel panic.
+
+To address this, we introduce a synchronization mechanism between the main 
kernel and the module to ensure all cryptographic algorithms are executed in 
the correct kernel boot phase. In more details, we spawn the module 
initialization process `fips_loader_init()` as an async thread 
`fips140_sync_thread()`, in which we call `run_initcalls()` to execute the 
initialization calls of each cryptographic algorithm.
+Then, we introduce synchronization helpers such as 
`wait_until_fips140_level_sync(int level)` to ensure the initialization order 
of all cryptographic algorithms is synchronized with the main kernel.
+
+## 5. Customization and Extension of Cryptography Module
+
+This section describes how developers can customize which cryptographic 
algorithms are included in the standalone cryptography module, as well as 
extend this feature to other cryptographic algorithms or hardware architectures.
+
+### 5.1. Cryptography Selection Mechanism
+
+The feature automatically includes cryptographic algorithms that meet specific 
criteria:
+1. **Built-in Configuration**: Only cryptographic algorithms configured as 
`CONFIG_CRYPTO_*=y` are candidates for inclusion
+2. **Explicit Inclusion**: Cryptographic algorithms must be explicitly 
converted using the `crypto-objs-$(CONFIG__CRYPTO_*`) build rule
+
+### 5.2. Extend Support to New Cryptographic Algorithms
+
+To extend support to a new cryptographic algorithm in the standalone module, 
follow these steps:
+
+**Step 1: Update the Makefile**
+```
+# in crypto/[algorithm]/Makefile
+- obj-$(CONFIG_CRYPTO_ALGORITHM) += algorithm.o
++ crypto-objs-$(CONFIG_CRYPTO_ALGORITHM) += algorithm.o
+```
+For Architecture-Specific Cryptographic Algorithms:
+- Apply the `crypto-objs-` rule in the appropriate `arch/*/crypto/Makefile`
+
+**Step 2: Add Pluggable Interface Support**
+If the cryptographic algorithm has symbol-exported variables via 
`EXPORT_SYMBOL()`, add the pluggable interface wrappers. There is no need to 
manually apply wrappers for symbol-exported functions:
+```
+// Example: in include/keys/asymmetric-type.h
+// for exported cryptographic variables:
+- struct key_type key_type_asymmetric;
++ DECLARE_CRYPTO_VAR(CONFIG_ASYMMETRIC_KEY_TYPE, key_type_asymmetric, struct 
key_type, );
++ #if defined(CONFIG_CRYPTO_FIPS140_EXTMOD) && !defined(FIPS_MODULE) && 
IS_BUILTIN(CONFIG_ASYMMETRIC_KEY_TYPE)
++ #define key_type_asymmetric (*((struct 
key_type*)CRYPTO_VAR_NAME(key_type_asymmetric)))
++ #endif 
+```
+Then, add the corresponding stubs in `crypto/fips140/fips140-var-redirect.c`:
+```
++ #undef key_type_asymmetric
++ DEFINE_CRYPTO_VAR_STUB(key_type_asymmetric);
++ #endif
+```
+For Architecture-Specific Cryptographic Algorithms:
+- Include architecture-specific stubs in 
`arch/*/crypto/fips140/fips140-var-redirect.c`:
+
+### 5.3. Architecture-Specific Extensions
+
+**Extending to New Architectures:**
+Currently supported architectures are x86_64 and ARM64. To extend this feature 
to additional architectures:
+1. **Update Linker Scripts**: Add ELF sections in 
`arch/[new-arch]/kernel/vmlinux.lds.S`:
+```
+.fips140_embedded : AT(ADDR(.fips140_embedded) - LOAD_OFFSET) {
+    . = ALIGN(8);
+    _binary_fips140_ko_start = .;
+    KEEP(*(.fips140_module_data))
+    _binary_fips140_ko_end = .;
+}
+```
+2. **Create Architecture-Specific Files**: Set up 
`arch/[new-arch]/crypto/fips140/` directory with Makefile and 
`fips140-var-redirect.c` following the pattern used in x86_64 and ARM64.
+
+## 6. Reusing a certified module in practice
+
+With the crypto subsystem restructured as a loadable module, a previously 
certified module can be reused across kernel updates. Crypto development does 
not freeze in the meantime — updated modules can always be submitted for fresh 
certification. The reuse of an already-certified module is intended to simply 
bridge the gap until the new certification arrives, that is, using the old one 
before the new one gets certified. How much of the certified module to reuse, 
however, involves a tradeoff between crypto feature availability, certification 
turnaround time, and engineering effort.
+
+The most conservative option is no reuse at all: abandon the previous 
certification and submit the updated crypto module for a full FIPS 
certification. This allows the crypto subsystem to benefit from the latest 
upstream changes, but at the cost of the full 12-to-18-month waiting period, 
according to experiences from downstream distributions.
+
+At the opposite end, distributions can choose to reuse the certified module 
binary entirely on a newer kernel. This enables the new kernel to be validated 
on day one. The tradeoff is clear: besides forgoing crypto updates, engineers 
must ensure ABI compatibility between the updated kernel and the module.
+
+A middle ground is to reuse only the source code of the certified module, 
freezing it and recompiling against the updated main kernel. Since the source 
code remains unchanged, the new module can go through a Non-Security Relevant 
([NSRL](https://csrc.nist.gov/csrc/media/Projects/cryptographic-module-validation-program/documents/fips%20140-3/FIPS-140-3-CMVP%20Management%20Manual.pdf))
 process — a simpler FIPS re-certification that typically reduces the waiting 
time from 12–18 months down to 3–4 months. Compared to reusing the binary 
entirely, this option requires less engineering effort, since engineers need 
only maintain source-level API compatibility (i.e., by patching the main kernel 
source code) rather than binary-level ABI compatibility between the crypto 
module and the main kernel.
+
+In summary, converting the kernel crypto subsystem into a loadable module 
enables reuse of a certified module across kernel updates. Whether through 
binary reuse, source-code reuse, or fresh certification, different choices 
represent different tradeoffs, and distributions can balance crypto feature 
availability, certification turnaround time, and engineering effort according 
to their needs.
+
+---
+Written by Jay Wang <[email protected]> <[email protected]>, Amazon 
Linux
\ No newline at end of file
-- 
2.47.3


Reply via email to