Re: efficient data reduction (and deduping)

Поиск
Список
Период
Сортировка
От Alessandro Gagliardi
Тема Re: efficient data reduction (and deduping)
Дата
Msg-id CAAB3BBL_6ju5QS2qEbmRAtePN2BOi==dBig925RjXH2QyyGzwA@mail.gmail.com
обсуждение исходный текст
Ответ на Re: efficient data reduction (and deduping)  (Claudio Freire <klaussfreire@gmail.com>)
Ответы Re: efficient data reduction (and deduping)  (Claudio Freire <klaussfreire@gmail.com>)
Список pgsql-performance
Interesting solution. If I'm not mistaken, this does solve the problem of having two entries for the same user at the exact same time (which violates my pk constraint) but it does so by leaving both of them out (since there is no au1.hr_timestamp > au2.hr_timestamp in that case). Is that right?

On Thu, Mar 1, 2012 at 10:35 AM, Claudio Freire <klaussfreire@gmail.com> wrote:
Try

INSERT INTO hourly_activity
SELECT ... everything from au1 ...
FROM activity_unlogged au1
LEFT JOIN activity_unlogged au2 ON au2.user_id = au1.user_id
                                                   AND
date_trunc('hour', au2.hr_timestamp) = date_trunc('hour',
au1.hr_timestamp)
                                                   AND
au2.hr_timestamp < au1.hr_timestamp
WHERE au2.user_id is null;

В списке pgsql-performance по дате отправления:

Предыдущее
От: Alessandro Gagliardi
Дата:
Сообщение: Re: efficient data reduction (and deduping)
Следующее
От: Claudio Freire
Дата:
Сообщение: Re: efficient data reduction (and deduping)