RE: GSOC 2018 Project - A New Sorting Routine

Kefan Yang Mon, 23 Jul 2018 15:22:35 -0700

Hi Tomas!

I did a few tests on my own Linux machine, but the problem is that my resources 
on AWS(CPU, RAM and even Disk space) are very limited. I considered 
establishing virtual machine on my own PC but the performance is even worse.


My original patch has two main optimizations: (1) switch to heap sort when 
depth limit exceeded (2) check whether the array is presorted only once at the 
beginning. Now I want to test these optimizations separately. On AWS EC2 
instance, regressions on CREATE INDEX cases seems to be less significant if we 
use (1) only, but I can only test up to 100000 records and 512MB memory using 
your scripts.

So would you mind re-running the tests using the two patches I provided in the 
attachment? That will be very helpful

Regards,
Kefan

From: Tomas Vondra
Sent: July 18, 2018 2:26 PM
To: Kefan Yang
Cc: Andrey Borodin; Peter Geoghegan; PostgreSQL Hackers
Subject: Re: GSOC 2018 Project - A New Sorting Routine

I don't have any script for that - load the files into a spreadsheet,
create pivot tables and you're done.

regards

On 07/18/2018 11:13 PM, Kefan Yang wrote:
> Hey Tomas!
> 
>  
> 
> I am trying to reproduce the results on my machine. Could you please
> share the script to generate .ods files?
> 
>  
> 
> Regards,
> 
> Kefan
> 
>  
> 
> *From: *Tomas Vondra <mailto:[email protected]>
> *Sent: *July 18, 2018 2:05 AM
> *To: *Andrey Borodin <mailto:[email protected]>
> *Cc: *Peter Geoghegan <mailto:[email protected]>; Kefan Yang
> <mailto:[email protected]>; PostgreSQL Hackers
> <mailto:[email protected]>
> *Subject: *Re: GSOC 2018 Project - A New Sorting Routine
> 
>  
> 
>  
> 
>  
> 
> On 07/18/2018 07:06 AM, Andrey Borodin wrote:
> 
>> Hi, Tomas!
> 
>>
> 
>>> 15 июля 2018 г., в 1:20, Tomas Vondra <[email protected]
> 
>>> <mailto:[email protected]>> написал(а):
> 
>>> 
> 
>>> So I doubt it's this, but I've tweaked the scripts to also set this GUC
> 
>>> and restarted the tests on both machines. Let's see what that does.
> 
>>
> 
>> Do you observe any different results?
> 
>>
> 
>  
> 
> It did change the CREATE INDEX results, depending on the scale. The full
> 
> data is available at [1] and [2], attached is a spreadsheet summary from
> 
> the Xeon box.
> 
>  
> 
> For the largest scale (1M rows) the regressions for CREATE INDEX queries
> 
> mostly disappeared. For 10k rows it still affects CREATE INDEX with a
> 
> text column, and the 100k case behaves just like before (so significant
> 
> regressions for CREATE INDEX).
> 
>  
> 
> I don't have time to investigate this further at the moment, but I'm
> 
> still of the opinion that there's little to gain by replacing our
> 
> current sort algorithm with this.
> 
>  
> 
>  
> 
> [1] https://bitbucket.org/tvondra/sort-intro-sort-xeon/src/master/
> 
> [2] https://bitbucket.org/tvondra/sort-intro-sort-i5/src/master/
> 
>  
> 
> regards
> 
>  
> 
> -- 
> 
> Tomas Vondra                  http://www.2ndQuadrant.com
> 
> PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
> 
>  
> 

-- 
Tomas Vondra                  http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

check_once.diff
Description: Binary data

use_heap.diff
Description: Binary data

RE: GSOC 2018 Project - A New Sorting Routine

Reply via email to