** Changed in: linux-aws (Ubuntu Noble)
Status: In Progress => Fix Committed
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-aws in Ubuntu.
https://bugs.launchpad.net/bugs/2101185
Title:
Backport patches to support NVIDIA GB200
Status in linux-aws package in Ubuntu:
New
Status in linux-aws source package in Noble:
Fix Committed
Status in linux-aws source package in Oracular:
In Progress
Bug description:
[Impact]
AWS requested the following patchsets to support GB200 to be backported
to 6.8 kernels and newer:
#1: Update SMMUv3 to the modern iommu API (part 2b/3)
(https://lore.kernel.org/linux-iommu/[email protected]/)
#2: Add Tegra241 (Grace) CMDQV Support (part 1/2)
(https://lore.kernel.org/linux-iommu/[email protected]/)
#3: iommu/tegra241-cmdqv: Fix alignment failure at max_n_shift
(https://lore.kernel.org/all/[email protected]/)
[Fix]
Oracular:
- Patchset #1 is already included in the o:aws tree.
- All patches from patchset #2 are clean cherry-picks. Fixes to commit
918eb5c856f6 ("iommu/arm-smmu-v3: Add in-kernel support for NVIDIA
Tegra241 (Grace) CMDQV"), from patchset 2, which were also clean
cherry-picks, were added.
- Patchset #3's only patch and one fix to it were clean cherry-picks.
Noble:
Backporting patchset #1 required backporting the patchsets it was built on
top of:
- All patches from "Update SMMUv3 to the modern iommu API (part 1/3)"
(https://lore.kernel.org/all/[email protected]/),
except for "iommu/arm-smmu-v3: Hold arm_smmu_asid_lock during all of
attach_dev", which is already in the tree. And
* "iommu/arm-smmu-v3: Add cpu_to_le64() around STRTAB_STE_0_V", as a fix
to "iommu/arm-smmu-v3: Make STE programming independent of the callers".
- Parts of "Update SMMUv3 to the modern iommu API (part 2/3)"
(https://lore.kernel.org/all/[email protected]/).
- All patches from "Make the SMMUv3 CD logic match the new STE design"
(part 2a/3)
(https://lore.kernel.org/all/[email protected]/).
And
* "iommu: Introduce iommu_group_mutex_assert()", as pre-req for
"iommu/arm-smmu-v3: Move the CD generation for SVA into a function".
* "iommu/arm-smmu-v3: Fix access for STE.SHCFG", as pre-req for
"iommu/arm-smmu-v3: Add unit tests for arm_smmu_write_entry".
* "iommu/arm-smmu-v3: Avoid uninitialized asid in case of error", as a
fix for "iommu/arm-smmu-v3: Build the whole CD in arm_smmu_make_s1_cd()".
Other pre-reqs for patchset #1 included:
- "iommu: Pass domain to remove_dev_pasid() op", as pre-req for
"iommu/arm-smmu-v3: Put the SVA mmu notifier in the smmu_domain".
- A backport of "iommu: Add ops->domain_alloc_sva()", as pre-req for
"iommu/arm-smmu-v3: Convert to domain_alloc_sva()".
Patchset #2 was almost all clean cherry-picks except for "iommu/arm-smmu-v3:
Add acpi_smmu_iort_probe_model for impl". Additionally, these were needed:
* "iommu/arm-smmu-v3: Make the kunit into a module" and "iommu/arm-smmu-v3:
Use *-y instead of *-objs in Makefile" to clean cherry-pick "iommu/arm-smmu-v3:
Add in-kernel support for NVIDIA Tegra241 (Grace) CMDQV".
* "iommu/arm-smmu-v3: add missing MODULE_DESCRIPTION() macro", as a fix for
"iommu/arm-smmu-v3: Make the kunit into a module".
* Other `Fixes` to commit 918eb5c856f6 ("iommu/arm-smmu-v3: Add in-kernel
support for NVIDIA Tegra241 (Grace) CMDQV"), which were clean cherry-picks.
Patchset #3's only patch and one fix to it were clean cherry-picks.
[Test Case]
Compile and boot tested.
Tested by AWS.
[Where problems could occur]
These patches introduce significant refactoring and features to the
SMMUv3 driver. Additionally, support to Tegra241's CMDQ-Virtualization
is being added.
[Other info]
SF #00404773
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-aws/+bug/2101185/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp