Re: [mart-dev] uniqueRowsOnly(1) vs. DISTINCT SQL

Syed Haider Fri, 13 Feb 2009 04:17:44 -0800

Hi Kolja,

The DISTINCT would not work for all the queries because there isnt juston SQL statement being executed by the library to retrieve the results.Sometimes, its spread across multiple datasets on different locationswhich may well be running on different databases too. This has to bedone in the library rather SQL level. the complexity of the merge isreally down to the spread of attribute values in the database.

However, for single table queries we can improve the speed as yousuggested. We will add this optimization in the next release.


Thanks
Syed


Kolja Henckel wrote:

Hello there!

I use the BioMart Perl API and just figured out some "problem":

When using the option
uniqueRowsOnly(1)
the API fetches all rows matching the query and afterwards deletes theones that are too much (so that only unique ones are returned or printed).
My problem is that I have about 1 mio. datarows and only 18 differentvalues in the desired Attribute.This means that the query takes about 10 minutes for the result of 18values.Is it possible (or planned, or already implemented somewhere, somehow?)to implement the uniqueRowsOnly-option using the SELECT DISTINCT optionof SQL?
In this case the query should perform within seconds...

Cheers, Kolja


PS: thanks for the great mart, anyway :)

Re: [mart-dev] uniqueRowsOnly(1) vs. DISTINCT SQL

Reply via email to