Re: -warmup

2023-08-30 Thread Werner LEMBERG
> Something else: I think that the 'TOTAL' line doesn't make sense > right now. Please separate this line slightly from the rest of the > table and print the *cumulated timing* (in 's', not 'µs') of all > tests, something like > > Total duration for all tests: 25.3s > > and > > Total

Re: -warmup

2023-08-29 Thread Werner LEMBERG
>> I still think that for such cases the number of iterations of the >> affected tests should be increased to get more precise values. > > the times are for single iteration. (chunk median/chunk size) Yes, but the number of iterations is the same regardless whether a test takes 10µs or 1000µs –

Re: -warmup

2023-08-29 Thread Ahmet Göksu
> I still think that for such cases the number of iterations of the affected > tests should be increased to get more precise values. the times are for single iteration. (chunk median/chunk size) > Please separate this line slightly from the rest of the table > and print the *cumulated timing* (in

Re: -warmup

2023-08-29 Thread Werner LEMBERG
> here is the results with chris’ suggestion. (thanks chris) Much better, thanks! > still a bit noise on only load and load_advances. are results > acceptable? As far as I can see, the biggest differences occur if the 'Baseline' and 'Benchmark' columns contain very small values. I still

Re: -warmup

2023-08-29 Thread Ahmet Göksu
hi, here is the results with chris’ suggestion. (thanks chris) i will check hyperfine. still a bit noise on only load and load_advances. are results acceptable? Best, Goksu goksu.in On 28 Aug 2023 21:19 +0300, Werner LEMBERG , wrote: > > code Freetype Benchmark Results Warning: Baseline and

Re: -warmup

2023-08-28 Thread Werner LEMBERG
>> Should I proceed to detect outliers? Since we do not get the same >> error rate consistently, I think we will not find the target we >> expected by outliers. > > Why do you think so? Please explain your reasoning. Just remember > that backup processes (like cleaning up the hard disk,

Re: -warmup

2023-08-21 Thread Werner LEMBERG
Ahmet, > I have edited the code aligning with the Hin-Tak’s suggestion. Here > is the two results pages, also pushed on gitlab. Thanks. It seems to me we are getting nearer. However, there are still large differences. * Chris mentioned a potential problem with `clock_gettime` in the code

Re: -warmup

2023-08-21 Thread Werner LEMBERG
>> To summarize: Benchmark comparisons only work if there is a sound >> mathematical foundation to reduce the noise. > > I am probably not qualified, but I am following the discussion for > some time. And I think there is a problem with the benchmarking > itself. If I understand correctly the

Re: -warmup

2023-08-18 Thread Ahmet Göksu
Hİ, I have edited the code aligning with the Hin-Tak’s suggestion. Here is the two results pages, also pushed on gitlab. Best, Goksu goksu.in On 18 Aug 2023 14:02 +0300, Werner LEMBERG , wrote: > > > What happens if you use, say, `-c 10', just running the > > > `Get_Char_Index` test? Are the

Re: -warmup

2023-08-18 Thread chris
On Fri, 18 Aug 2023 11:02:49 + (UTC), Werner LEMBERG wrote: > To summarize: Benchmark comparisons only work if there is a sound > mathematical foundation to reduce the noise. I am probably not qualified, but I am following the discussion for some time. And I think there is a problem with the

Re: -warmup

2023-08-18 Thread Werner LEMBERG
>> What happens if you use, say, `-c 10', just running the >> `Get_Char_Index` test? Are the percental timing differences then >> still that large? > Actually Get_Char_Index, on the three pages I have sent in the > prev. mail, is higher than 6% only 4 times out of 15 total. (which is > seem on

Re: -warmup

2023-08-18 Thread Ahmet Göksu
Hi, The approach we initially took was, in fact, based on the principle of the interquartile range (IQR) – a method that excludes outliers by determining the range between the first and third quartiles. However, I understand from your feedback that directly focusing on the median and quantiles

Re: -warmup

2023-08-18 Thread Hin-Tak Leung
On Friday, 18 August 2023 at 00:21:41 BST, Ahmet Göksu wrote: > about outliers, i splitted every tests into chuncks that is sized 100. Made > IQR calculations and calculated average time on valid chunks. you can find > the result in the attachment also pushed to gitlab. > also,

Re: -warmup

2023-08-17 Thread Ahmet Göksu
> What happens if you use, say, `-c 10', just running the > `Get_Char_Index` test? Are the percental timing differences then > still that large? Actually Get_Char_Index, on the three pages I have sent in the prev. mail, is higher than 6% only 4 times out of 15 total. (which is seem on other

Re: -warmup

2023-08-17 Thread Werner LEMBERG
> Just remember that backup processes (like cleaning up the hard disk, > running some cron jobs, etc.) can pop up anytime, thus influencing > the result. s/backup/background/

Re: -warmup

2023-08-17 Thread Werner LEMBERG
> I have added the total table that you suggested. Thanks. > I think Get_Char_Index is not the problem, the results varies all > the time. As far as I can see, there is a direct relationship between the total cumulated time of a test and the timing variation: The smaller the cumulated time,

Re: -warmup

2023-08-16 Thread Ahmet Göksu
Hi, I have added the total table that you suggested. I think Get_Char_Index is not the problem, the results varies all the time. Here are the three results that i had in the same minute (one has different flags). Should I proceed to detect outliers? Since we do not get the same error rate

Re: -warmup

2023-08-07 Thread Werner LEMBERG
>> What exactly means 'Baseline (ms)'? Is the shown number the time >>  for one loop? For all loops together? Please clarify and mention >>  this on the HTML page. > > Clarified that the times are milliseconds for the cumulative time > for all iterations. Thanks. The sentence is not easily

Re: -warmup

2023-08-07 Thread Ahmet Göksu
Hi! I changed code to warmup with number of iterations. > What exactly means 'Baseline (ms)'? Is the shown number the time >  for one loop? For all loops together? Please clarify and mention >  this on the HTML page. Clarified that the times are milliseconds for the cumulative time for all

Re: -warmup

2023-08-03 Thread Werner LEMBERG
> It is warming up as the given number of seconds with -w flag before > every benchmark test. > > There are still differences like 100%.. Also, 1 sec warmup means > (test count)*(font count) 70 secs for the results. Mhmm, I'm not sure whether a warmup *time span* makes sense. I would rather