Hi Bob, Thanks.
I am a little bit confused by the "customized deepcopy" and "cache for uni file parser" data. These look like they are slowing down rather than speeding up the build. Regards, Leif On Wed, Dec 05, 2018 at 02:51:58AM +0000, Feng, Bob C wrote: > Hi, > > I have added the performance data in BZ > https://bugzilla.tianocore.org/show_bug.cgi?id=1288 . Please have a > review. > > Thanks, > Bob > > > -----Original Message----- > From: edk2-devel [mailto:edk2-devel-boun...@lists.01.org] On Behalf Of Feng, > Bob C > Sent: Sunday, November 11, 2018 8:41 AM > To: Leif Lindholm <leif.lindh...@linaro.org> > Cc: Carsey, Jaben <jaben.car...@intel.com>; edk2-devel@lists.01.org; Gao, > Liming <liming....@intel.com> > Subject: Re: [edk2] [Patch] BaseTools: Optimize string concatenation > > Hi Leif, > > I use my desktop to do the benchmark. > CPU: Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz > Memory: 16G > OS: Win10 > > I'll add the performance detailed data to BZ. > > -Bob > > -----Original Message----- > From: Leif Lindholm [mailto:leif.lindh...@linaro.org] > Sent: Friday, November 9, 2018 7:49 PM > To: Feng, Bob C <bob.c.f...@intel.com> > Cc: edk2-devel@lists.01.org; Carsey, Jaben <jaben.car...@intel.com>; Gao, > Liming <liming....@intel.com> > Subject: Re: [edk2] [Patch] BaseTools: Optimize string concatenation > > Hi Bob, > > On Fri, Nov 09, 2018 at 03:25:04AM +0000, Feng, Bob C wrote: > > Yes. I should show the data. > > > > My unites scripts is as below. The parameter lines is a string list > > which size is 43395. The test result is > > > > ''.join(String list) time: 0.042262 > > String += String time : 3.822699 > > > > def TestPlus(lines): > > str_target = "" > > > > for line in lines: > > str_target += line > > > > return str_target > > > > def TestJoin(lines): > > str_target = [] > > > > for line in lines: > > str_target.append(line) > > > > return "".join(str_target) > > > > def CompareStrCat(): > > lines = GetStrings() > > print (len(lines)) > > > > begin = time.perf_counter() > > for _ in range(10): > > TestJoin(lines) > > end = time.perf_counter() - begin > > print ("''.join(String list) time: %f" % end) > > > > begin = time.perf_counter() > > for _ in range(10): > > TestPlus(lines) > > end = time.perf_counter() - begin > > print ("String += String time: %f" % end) > > > > For build OvmfX64, it's not very effective, it saves 2~3 second in > > Parse/AutoGen phase, because OvmfX64 is relatively simple. It does not > > enable much features such as Multiple SKU and structure PCD by default > > and there is no big size Autogen.c/Autogen.h/Makefile generated > > either. but for the complex platform, this patch will be much > > effective. The unites above simulates a real case that there is a > > 43395 lines of Autogen.c generated. > > I beg to differ. Shaving 2-3 seconds off the autogen phase is a substantial > improvement. > > However, on my wimpy 24-core Cortex-A53 system, the effect is not noticeable > (fluctuates between 1:56-1:58 whether with or without this patch). > > And even on my x86 workstation, I see no measurable difference (12-13s). > What is the hardware you are benchmarking on? > > It does not appear to be detrimental to performance on any of my platforms, > but I would like to see some more measurements, and I would like to see that > logged with some more detail in bugzilla. > > Regards, > > Leif > > > Since this patch mostly effect the Parser/AutoGen phase, I just use > > "build genmake" to show the improvement data. > > The final result for clean build is: > > Current code: 17 seconds > > After patch: 15 seconds > > > > Details: > > Current data: > > > > d:\edk2 (master -> origin) > > λ build genmake -p OvmfPkg\OvmfPkgIa32X64.dsc -a IA32 -a X64 -t > > VS2015x86 Build environment: Windows-10-10.0.10240 Build start time: > > 10:12:32, Nov.09 2018 > > > > WORKSPACE = d:\edk2 > > ECP_SOURCE = d:\edk2\edkcompatibilitypkg > > EDK_SOURCE = d:\edk2\edkcompatibilitypkg > > EFI_SOURCE = d:\edk2\edkcompatibilitypkg > > EDK_TOOLS_PATH = d:\edk2\basetools > > EDK_TOOLS_BIN = d:\edk2\basetools\bin\win32 > > CONF_PATH = d:\edk2\conf > > > > Architecture(s) = IA32 X64 > > Build target = DEBUG > > Toolchain = VS2015x86 > > > > Active Platform = d:\edk2\OvmfPkg\OvmfPkgIa32X64.dsc > > Flash Image Definition = d:\edk2\OvmfPkg\OvmfPkgIa32X64.fdf > > > > Processing meta-data ....... done! > > Generating code . done! > > Generating makefile . done! > > Generating code .. done! > > Generating makefile ...... done! > > > > - Done - > > Build end time: 10:12:49, Nov.09 2018 > > Build total time: 00:00:17 > > > > After applying this patch: > > > > d:\edk2 (master -> origin) > > λ build genmake -p OvmfPkg\OvmfPkgIa32X64.dsc -a IA32 -a X64 - > > Build environment: Windows-10-10.0.10240 > > Build start time: 10:11:41, Nov.09 2018 > > > > WORKSPACE = d:\edk2 > > ECP_SOURCE = d:\edk2\edkcompatibilitypkg > > EDK_SOURCE = d:\edk2\edkcompatibilitypkg > > EFI_SOURCE = d:\edk2\edkcompatibilitypkg > > EDK_TOOLS_PATH = d:\edk2\basetools > > EDK_TOOLS_BIN = d:\edk2\basetools\bin\win32 > > CONF_PATH = d:\edk2\conf > > > > > > Architecture(s) = IA32 X64 > > Build target = DEBUG > > Toolchain = VS2015x86 > > > > Active Platform = d:\edk2\OvmfPkg\OvmfPkgIa32X64.dsc > > Flash Image Definition = d:\edk2\OvmfPkg\OvmfPkgIa32X64.fdf > > > > Processing meta-data ..... done! > > Generating code . done! > > Generating makefile . done! > > Generating code .. done! > > Generating makefile ...... done! > > > > - Done - > > Build end time: 10:11:56, Nov.09 2018 > > Build total time: 00:00:15 > > > > > > Thanks, > > Bob > > > > > > -----Original Message----- > > From: Leif Lindholm [mailto:leif.lindh...@linaro.org] > > Sent: Friday, November 9, 2018 12:53 AM > > To: Feng, Bob C <bob.c.f...@intel.com> > > Cc: edk2-devel@lists.01.org; Carsey, Jaben <jaben.car...@intel.com>; > > Gao, Liming <liming....@intel.com> > > Subject: Re: [edk2] [Patch] BaseTools: Optimize string concatenation > > > > On Thu, Nov 08, 2018 at 06:16:25PM +0800, BobCF wrote: > > > https://bugzilla.tianocore.org/show_bug.cgi?id=1288 > > > > > > This patch is one of build tool performance improvement series > > > patches. > > > > > > This patch is going to use join function instead of string += > > > string2 statement. > > > > > > Current code use string += string2 in a loop to combine a string. > > > while creating a string list in a loop and using > > > "".join(stringlist) after the loop will be much faster. > > > > Do you have any numbers on the level of improvement seen? > > > > Either for the individual scripts when called identically, or (if > > measurable) on the build of an entire platform (say OvmfX64?). > > > > Regards, > > > > Leif > > > > > Contributed-under: TianoCore Contribution Agreement 1.1 > > > Signed-off-by: BobCF <bob.c.f...@intel.com> > > > Cc: Liming Gao <liming....@intel.com> > > > Cc: Jaben Carsey <jaben.car...@intel.com> > > > --- > > > BaseTools/Source/Python/AutoGen/StrGather.py | 39 +++++++++++++------ > > > BaseTools/Source/Python/Common/Misc.py | 21 +++++----- > > > .../Source/Python/Workspace/InfBuildData.py | 4 +- > > > .../Python/Workspace/WorkspaceCommon.py | 11 ++---- > > > 4 files changed, 44 insertions(+), 31 deletions(-) > > > > > > diff --git a/BaseTools/Source/Python/AutoGen/StrGather.py > > > b/BaseTools/Source/Python/AutoGen/StrGather.py > > > index 361d499076..d34a9e9447 100644 > > > --- a/BaseTools/Source/Python/AutoGen/StrGather.py > > > +++ b/BaseTools/Source/Python/AutoGen/StrGather.py > > > @@ -135,11 +135,11 @@ def AscToHexList(Ascii): > > > # @param UniGenCFlag UniString is generated into AutoGen C file > > > when it is set to True > > > # > > > # @retval Str: A string of .h file content > > > # > > > def CreateHFileContent(BaseName, UniObjectClass, IsCompatibleMode, > > > UniGenCFlag): > > > - Str = '' > > > + Str = [] > > > ValueStartPtr = 60 > > > Line = COMMENT_DEFINE_STR + ' ' + LANGUAGE_NAME_STRING_NAME + ' ' * > > > (ValueStartPtr - len(DEFINE_STR + LANGUAGE_NAME_STRING_NAME)) + > > > DecToHexStr(0, 4) + COMMENT_NOT_REFERENCED > > > Str = WriteLine(Str, Line) > > > Line = COMMENT_DEFINE_STR + ' ' + > > > PRINTABLE_LANGUAGE_NAME_STRING_NAME + ' ' * (ValueStartPtr - > > > len(DEFINE_STR + PRINTABLE_LANGUAGE_NAME_STRING_NAME)) + DecToHexStr(1, > > > 4) + COMMENT_NOT_REFERENCED > > > Str = WriteLine(Str, Line) > > > @@ -164,16 +164,16 @@ def CreateHFileContent(BaseName, UniObjectClass, > > > IsCompatibleMode, UniGenCFlag): > > > Line = COMMENT_DEFINE_STR + ' ' + Name + ' ' + > > > DecToHexStr(Token, 4) + COMMENT_NOT_REFERENCED > > > else: > > > Line = COMMENT_DEFINE_STR + ' ' + Name + ' ' * > > > (ValueStartPtr - len(DEFINE_STR + Name)) + DecToHexStr(Token, 4) + > > > COMMENT_NOT_REFERENCED > > > UnusedStr = WriteLine(UnusedStr, Line) > > > > > > - Str = ''.join([Str, UnusedStr]) > > > + Str.extend( UnusedStr) > > > > > > Str = WriteLine(Str, '') > > > if IsCompatibleMode or UniGenCFlag: > > > Str = WriteLine(Str, 'extern unsigned char ' + BaseName + > > > 'Strings[];') > > > - return Str > > > + return "".join(Str) > > > > > > ## Create a complete .h file > > > # > > > # Create a complet .h file with file header and file content # @@ > > > -185,11 +185,11 @@ def CreateHFileContent(BaseName, UniObjectClass, > > > IsCompatibleMode, UniGenCFlag): > > > # @retval Str: A string of complete .h file > > > # > > > def CreateHFile(BaseName, UniObjectClass, IsCompatibleMode, UniGenCFlag): > > > HFile = WriteLine('', CreateHFileContent(BaseName, > > > UniObjectClass, IsCompatibleMode, UniGenCFlag)) > > > > > > - return HFile > > > + return "".join(HFile) > > > > > > ## Create a buffer to store all items in an array # > > > # @param BinBuffer Buffer to contain Binary data. > > > # @param Array: The array need to be formatted > > > @@ -209,11 +209,11 @@ def CreateBinBuffer(BinBuffer, Array): > > > # > > > def CreateArrayItem(Array, Width = 16): > > > MaxLength = Width > > > Index = 0 > > > Line = ' ' > > > - ArrayItem = '' > > > + ArrayItem = [] > > > > > > for Item in Array: > > > if Index < MaxLength: > > > Line = Line + Item + ', ' > > > Index = Index + 1 > > > @@ -221,11 +221,11 @@ def CreateArrayItem(Array, Width = 16): > > > ArrayItem = WriteLine(ArrayItem, Line) > > > Line = ' ' + Item + ', ' > > > Index = 1 > > > ArrayItem = Write(ArrayItem, Line.rstrip()) > > > > > > - return ArrayItem > > > + return "".join(ArrayItem) > > > > > > ## CreateCFileStringValue > > > # > > > # Create a line with string value > > > # > > > @@ -236,11 +236,11 @@ def CreateArrayItem(Array, Width = 16): > > > > > > def CreateCFileStringValue(Value): > > > Value = [StringBlockType] + Value > > > Str = WriteLine('', CreateArrayItem(Value)) > > > > > > - return Str > > > + return "".join(Str) > > > > > > ## GetFilteredLanguage > > > # > > > # apply get best language rules to the UNI language code list # @@ > > > -438,11 +438,11 @@ def CreateCFileContent(BaseName, UniObjectClass, > > > IsCompatibleMode, UniBinBuffer, > > > # > > > # Join package data > > > # > > > AllStr = Write(AllStr, Str) > > > > > > - return AllStr > > > + return "".join(AllStr) > > > > > > ## Create end of .c file > > > # > > > # Create end of .c file > > > # > > > @@ -465,11 +465,11 @@ def CreateCFileEnd(): > > > # > > > def CreateCFile(BaseName, UniObjectClass, IsCompatibleMode, FilterInfo): > > > CFile = '' > > > CFile = WriteLine(CFile, CreateCFileContent(BaseName, > > > UniObjectClass, IsCompatibleMode, None, FilterInfo)) > > > CFile = WriteLine(CFile, CreateCFileEnd()) > > > - return CFile > > > + return "".join(CFile) > > > > > > ## GetFileList > > > # > > > # Get a list for all files > > > # > > > @@ -572,17 +572,34 @@ def GetStringFiles(UniFilList, SourceFileList, > > > IncludeList, IncludePathList, Ski > > > > > > # > > > # Write an item > > > # > > > def Write(Target, Item): > > > - return ''.join([Target, Item]) > > > + if isinstance(Target,str): > > > + Target = [Target] > > > + if not Target: > > > + Target = [] > > > + if isinstance(Item,list): > > > + Target.extend(Item) > > > + else: > > > + Target.append(Item) > > > + return Target > > > > > > # > > > # Write an item with a break line > > > # > > > def WriteLine(Target, Item): > > > - return ''.join([Target, Item, '\n']) > > > + if isinstance(Target,str): > > > + Target = [Target] > > > + if not Target: > > > + Target = [] > > > + if isinstance(Item, list): > > > + Target.extend(Item) > > > + else: > > > + Target.append(Item) > > > + Target.append('\n') > > > + return Target > > > > > > # This acts like the main() function for the script, unless it is > > > 'import'ed into another # script. > > > if __name__ == '__main__': > > > EdkLogger.info('start') > > > diff --git a/BaseTools/Source/Python/Common/Misc.py > > > b/BaseTools/Source/Python/Common/Misc.py > > > index 80236db160..8dcbe141ae 100644 > > > --- a/BaseTools/Source/Python/Common/Misc.py > > > +++ b/BaseTools/Source/Python/Common/Misc.py > > > @@ -777,21 +777,21 @@ class TemplateString(object): > > > > > > return "".join(StringList) > > > > > > ## Constructor > > > def __init__(self, Template=None): > > > - self.String = '' > > > + self.String = [] > > > self.IsBinary = False > > > self._Template = Template > > > self._TemplateSectionList = self._Parse(Template) > > > > > > ## str() operator > > > # > > > # @retval string The string replaced > > > # > > > def __str__(self): > > > - return self.String > > > + return "".join(self.String) > > > > > > ## Split the template string into fragments per the ${BEGIN} and > > > ${END} flags > > > # > > > # @retval list A list of TemplateString.Section objects > > > # > > > @@ -835,13 +835,16 @@ class TemplateString(object): > > > # @param Dictionary The placeholder dictionaries > > > # > > > def Append(self, AppendString, Dictionary=None): > > > if Dictionary: > > > SectionList = self._Parse(AppendString) > > > - self.String += "".join(S.Instantiate(Dictionary) for S in > > > SectionList) > > > + self.String.append( "".join(S.Instantiate(Dictionary) > > > + for S in SectionList)) > > > else: > > > - self.String += AppendString > > > + if isinstance(AppendString,list): > > > + self.String.extend(AppendString) > > > + else: > > > + self.String.append(AppendString) > > > > > > ## Replace the string template with dictionary of placeholders > > > # > > > # @param Dictionary The placeholder dictionaries > > > # > > > @@ -1741,27 +1744,21 @@ class PathClass(object): > > > # > > > # @retval False The two PathClass are different > > > # @retval True The two PathClass are the same > > > # > > > def __eq__(self, Other): > > > - if isinstance(Other, type(self)): > > > - return self.Path == Other.Path > > > - else: > > > - return self.Path == str(Other) > > > + return self.Path == str(Other) > > > > > > ## Override __cmp__ function > > > # > > > # Customize the comparsion operation of two PathClass > > > # > > > # @retval 0 The two PathClass are different > > > # @retval -1 The first PathClass is less than the second PathClass > > > # @retval 1 The first PathClass is Bigger than the second > > > PathClass > > > def __cmp__(self, Other): > > > - if isinstance(Other, type(self)): > > > - OtherKey = Other.Path > > > - else: > > > - OtherKey = str(Other) > > > + OtherKey = str(Other) > > > > > > SelfKey = self.Path > > > if SelfKey == OtherKey: > > > return 0 > > > elif SelfKey > OtherKey: > > > diff --git a/BaseTools/Source/Python/Workspace/InfBuildData.py > > > b/BaseTools/Source/Python/Workspace/InfBuildData.py > > > index 44d44d24eb..d615cccdf7 100644 > > > --- a/BaseTools/Source/Python/Workspace/InfBuildData.py > > > +++ b/BaseTools/Source/Python/Workspace/InfBuildData.py > > > @@ -612,11 +612,13 @@ class InfBuildData(ModuleBuildClassObject): > > > for Record in RecordList: > > > Lib = Record[0] > > > Instance = Record[1] > > > if Instance: > > > Instance = NormPath(Instance, self._Macros) > > > - RetVal[Lib] = Instance > > > + RetVal[Lib] = Instance > > > + else: > > > + RetVal[Lib] = None > > > return RetVal > > > > > > ## Retrieve library names (for Edk.x style of modules) > > > @cached_property > > > def Libraries(self): > > > diff --git a/BaseTools/Source/Python/Workspace/WorkspaceCommon.py > > > b/BaseTools/Source/Python/Workspace/WorkspaceCommon.py > > > index 8d8a3e2789..55d01fa4b2 100644 > > > --- a/BaseTools/Source/Python/Workspace/WorkspaceCommon.py > > > +++ b/BaseTools/Source/Python/Workspace/WorkspaceCommon.py > > > @@ -126,17 +126,14 @@ def GetModuleLibInstances(Module, Platform, > > > BuildDatabase, Arch, Target, Toolcha > > > while len(LibraryConsumerList) > 0: > > > M = LibraryConsumerList.pop() > > > for LibraryClassName in M.LibraryClasses: > > > if LibraryClassName not in LibraryInstance: > > > # override library instance for this module > > > - if LibraryClassName in > > > Platform.Modules[str(Module)].LibraryClasses: > > > - LibraryPath = > > > Platform.Modules[str(Module)].LibraryClasses[LibraryClassName] > > > - else: > > > - LibraryPath = > > > Platform.LibraryClasses[LibraryClassName, ModuleType] > > > - if LibraryPath is None or LibraryPath == "": > > > - LibraryPath = M.LibraryClasses[LibraryClassName] > > > - if LibraryPath is None or LibraryPath == "": > > > + LibraryPath = > > > Platform.Modules[str(Module)].LibraryClasses.get(LibraryClassName,Platform.LibraryClasses[LibraryClassName, > > > ModuleType]) > > > + if LibraryPath is None: > > > + LibraryPath = M.LibraryClasses.get(LibraryClassName) > > > + if LibraryPath is None: > > > if FileName: > > > EdkLogger.error("build", > > > RESOURCE_NOT_AVAILABLE, > > > "Instance of library class > > > [%s] is not found" % LibraryClassName, > > > File=FileName, > > > ExtraData="in [%s] > > > [%s]\n\tconsumed by module [%s]" % (str(M), Arch, str(Module))) > > > -- > > > 2.19.1.windows.1 > > > > > > _______________________________________________ > > > edk2-devel mailing list > > > edk2-devel@lists.01.org > > > https://lists.01.org/mailman/listinfo/edk2-devel > _______________________________________________ > edk2-devel mailing list > edk2-devel@lists.01.org > https://lists.01.org/mailman/listinfo/edk2-devel _______________________________________________ edk2-devel mailing list edk2-devel@lists.01.org https://lists.01.org/mailman/listinfo/edk2-devel