On 2021-01-20 11:40, Chunyan Zhang wrote:
[...]
+     pgt_base_iova = dom->pgt_va +
+             ((iova - mdata->iova_start) >> SPRD_IOMMU_PAGE_SHIFT);
+
+     spin_lock_irqsave(&dom->pgtlock, flags);
+     for (i = 0; i < page_num; i++) {
+             pgt_base_iova[i] = pabase >> SPRD_IOMMU_PAGE_SHIFT;

Out of curiosity, is the pagetable walker cache-coherent, or is this
currently managing to work by pure chance and natural cache churn?

->iotlb_sync_map() was implemented in this driver, I guess that has
done what you say here?

No, sync_map only ensures that the previous (invalid) PTE isn't held in the IOMMU's TLB. If pgt_va is a regular page allocation then you're writing the new PTE to normal kernel memory, with nothing to guarantee that write goes any further than the CPU's L1 cache. Thus either the IOMMU has capable of snooping the CPU caches in order to see the updated PTE value (rather than refetching the stale value from DRAM), or you're just incredibly lucky that by the time the IOMMU *does* go to fetch the PTE for that address, that updated cache line has already been evicted out to DRAM naturally.

This is not an issue if you use the proper DMA allocator, since that will ensure you get a non-cacheable buffer if you need one.

Robin.

Reply via email to