[PERFORM] Group by more efficient than distinct?

Francisco Reyes Thu, 17 Apr 2008 20:55:44 -0700

I am trying to get a distinct set of rows from 2 tables.

After looking at someone else's query I noticed they were doing a group byto obtain the unique list.

After comparing on multiple machines with several tables, it seems usinggroup by to obtain a distinct list is substantially faster than usingselect distinct.


Is there any dissadvantage of using "group by" to obtain a unique list?

On a small dataset the difference was about 20% percent.

Group by

HashAggregate (cost=369.61..381.12 rows=1151 width=8) (actualtime=76.641..85.167 rows=2890 loops=1)


Distinct

Unique (cost=1088.23..1174.53 rows=1151 width=8) (actualtime=90.516..140.123 rows=2890 loops=1)

Although I don't have the numbers here with me, a simmilar result wasobtaining against a query that would return 100,000 rows. 20% and morespeed differnce between "group by" over "select distinct".

--
Sent via pgsql-performance mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

[PERFORM] Group by more efficient than distinct?

Reply via email to