On 11/08/18 14:33, Laszlo Ersek wrote: > On 11/08/18 03:58, Eric Dong wrote: >> In current implementation, core level semaphore use same container >> with package level semaphore. This design will let the core level >> semaphore not works as expected in below case: >> 1. Feature A has CPU_FEATURE_CORE_BEFORE dependence with Feature B. >> 2. Feature C has CPU_FEATURE_PACKAGE_AFTER dependence with Feature B. >> in this case an core level semaphore will be add between A and B, and >> an package level semaphore will be add between B and C. >> >> For a CPU has one package, two cores and 4 threads. Execute like below: >> >> Thread 1 Thread 2 ..... Thread 4 >> ReleaseSemaph(1,2) -| >> WaitForSemaph(1(2)) -|<-----------------------These two are Core Semaph >> ReleaseSemaph(1,2) -| >> WaitForSemaph(2) -| <--- Core Semaph >> >> ReleaseSemaph (1,2,3,4) -| >> WaitForSemaph (1(4)) -| <---------------- Package Semaph >> >> ReleaseSemaph(3,4) >> WaitForSemaph(4(2)) <- Core Semaph >> >> In above case, for thread 4, when it executes a core semaphore, i will >> found WaitForSemaph(4(2)) is met because Thread 1 has execute a package >> semaphore and ReleaseSemaph(4) for it before. This is not an expect >> behavior. Thread 4 should wait for thread 3 to do this. >> >> Fix this issue by separate the semaphore container for core level and >> package level. >> >> Cc: Laszlo Ersek <ler...@redhat.com> >> Cc: Ruiyu Ni <ruiyu...@intel.com> >> Contributed-under: TianoCore Contribution Agreement 1.1 >> Signed-off-by: Eric Dong <eric.d...@intel.com> >> --- >> UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c | 21 ++++++++++++++------- >> 1 file changed, 14 insertions(+), 7 deletions(-) >> >> diff --git a/UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c >> b/UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c >> index a45e2dd3d7..65461485a4 100644 >> --- a/UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c >> +++ b/UefiCpuPkg/PiSmmCpuDxeSmm/CpuS3.c >> @@ -41,9 +41,10 @@ typedef struct { >> // Flags used when program the register. >> // >> typedef struct { >> - volatile UINTN ConsoleLogLock; // Spinlock used to >> control console. >> - volatile UINTN MemoryMappedLock; // Spinlock used to >> program mmio >> - volatile UINT32 *SemaphoreCount; // Semaphore used to >> program semaphore. >> + volatile UINTN ConsoleLogLock; // Spinlock used to >> control console. >> + volatile UINTN MemoryMappedLock; // Spinlock used to >> program mmio >> + volatile UINT32 *CoreSemaphoreCount; // Semaphore used to >> program semaphore. >> + volatile UINT32 *PackageSemaphoreCount; // Semaphore used to >> program semaphore. >> } PROGRAM_CPU_REGISTER_FLAGS; >> >> // >> @@ -348,11 +349,12 @@ ProgramProcessorRegister ( >> ASSERT ( >> (ApLocation != NULL) && >> (CpuStatus->ValidCoreCountPerPackage != 0) && >> - (CpuFlags->SemaphoreCount) != NULL >> + (CpuFlags->CoreSemaphoreCount != NULL) && >> + (CpuFlags->PackageSemaphoreCount != NULL) >> ); >> - SemaphorePtr = CpuFlags->SemaphoreCount; >> switch (RegisterTableEntry->Value) { >> case CoreDepType: >> + SemaphorePtr = CpuFlags->CoreSemaphoreCount; >> // >> // Get Offset info for the first thread in the core which current >> thread belongs to. >> // >> @@ -373,6 +375,7 @@ ProgramProcessorRegister ( >> break; >> >> case PackageDepType: >> + SemaphorePtr = CpuFlags->PackageSemaphoreCount; >> ValidCoreCountPerPackage = (UINT32 >> *)(UINTN)CpuStatus->ValidCoreCountPerPackage; >> // >> // Get Offset info for the first thread in the package which >> current thread belongs to. >> @@ -1037,10 +1040,14 @@ GetAcpiCpuData ( >> ASSERT (mAcpiCpuData.ApLocation != 0); >> } >> if (CpuStatus->PackageCount != 0) { >> - mCpuFlags.SemaphoreCount = AllocateZeroPool ( >> + mCpuFlags.CoreSemaphoreCount = AllocateZeroPool ( >> sizeof (UINT32) * CpuStatus->PackageCount * >> CpuStatus->MaxCoreCount * >> CpuStatus->MaxThreadCount); >> - ASSERT (mCpuFlags.SemaphoreCount != NULL); >> + ASSERT (mCpuFlags.CoreSemaphoreCount != NULL); >> + mCpuFlags.PackageSemaphoreCount = AllocateZeroPool ( >> + sizeof (UINT32) * CpuStatus->PackageCount * >> + CpuStatus->MaxCoreCount * >> CpuStatus->MaxThreadCount); >> + ASSERT (mCpuFlags.PackageSemaphoreCount != NULL); >> } >> InitializeSpinLock((SPIN_LOCK*) &mCpuFlags.MemoryMappedLock); >> InitializeSpinLock((SPIN_LOCK*) &mCpuFlags.ConsoleLogLock); >> > > The patch looks OK, superficially speaking. > > Also this looks like a bugfix to a new feature already committed in this > development cycle, so I think it may go in during the hard feature freeze. > > I have some requests (no need to repost): > > (1) Please make sure there is a TianoCore BZ for this issue. > > (2) Please reference said BZ in the commit message. > > (For example, commit c60d36b4d1, for > <https://bugzilla.tianocore.org/show_bug.cgi?id=1307> missed the > reference to BZ#1307.) > > (3) Before pushing, please fix up the indentation of the > AllocateZeroPool() arguments (both calls). > > (4) Can you please file the bugzilla now about unifying the > implementation between RegisterCpuFeaturesLib and PiSmmCpuDxeSmm? > > (That's for the next development cycle, but we should report the BZ for > it at this point, I believe.)
Oh, sorry, I thought I had added my ACK, but it turns out I forgot about it. So, with the above points, Acked-by: Laszlo Ersek <ler...@redhat.com> Thanks Laszlo _______________________________________________ edk2-devel mailing list edk2-devel@lists.01.org https://lists.01.org/mailman/listinfo/edk2-devel