On 12/03/2010 12:40 PM, Jayadevan M wrote:
Hello,

I went this way, but for a large number of user_id's, it's quite slow:

CREATE VIEW v_views AS
    SELECT user_id, product_id, count(*) as views
    FROM viewlog
    GROUP BY user_id, product_id

SELECT
    DISTINCT user_id,
    (SELECT product_id FROM v_views inn WHERE inn.user_id = out.user_id
ORDER BY views DESC LIMIT 1) as product_id,
    (SELECT views FROM v_views inn WHERE inn.user_id = out.user_id ORDER
BY
views DESC LIMIT 1) as views
FROM
    v_views out

Does this work faster?
select x.user_id,y.product_id,x.count from
(select user_id, max(count ) as count from (select user_id,product_id,
count(*) as count from viewlog group by user_id,product_id) as x group by
user_id
) as x inner join
(select user_id,product_id, count(*) as count1 from viewlog group by
user_id,product_id ) as y
on x.user_id=y.user_id and x.count=y.count1

The issue in both approaches is that if I have two product_ids that are viewed same number of times and share the first place as most viewed products by that user, I'll get only one of them (LIMIT 1 OR MAX() can only return one row :).

I don't see how I can sort this out with elegance in SQL.

        Mario

--
Sent via pgsql-sql mailing list (pgsql-sql@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-sql

Reply via email to