Re: designing time dimension for star schema

Поиск

Список

Период

Сортировка

От	Merlin Moncure
Тема	Re: designing time dimension for star schema
Дата	11 февраля 2014 г. 06:00:24
Msg-id	CAHyXU0xHjCM_aX5vkSsNSB84UHTJuymxC4OoxVkMP5oBnAdssA@mail.gmail.com обсуждение исходный текст
Ответ на	designing time dimension for star schema (Mark Wong <markwkm@gmail.com>)
Список	pgsql-general

Дерево обсуждения

On Mon, Feb 10, 2014 at 8:45 AM, Mark Wong <markwkm@gmail.com> wrote:
> Hello everybody,
>
> I was wondering if anyone had any experiences they can share when
> designing the time dimension for a star schema and the like.  I'm
> curious about how well it would work to use a timestamp for the
> attribute key, as opposed to a surrogate key, and populating the time
> dimension with triggers on insert to the fact tables.  This is
> something that would have data streaming in (as oppose to bulk
> loading) and I think we want time granularity to the minute.
>
> A simplified example:
>
> -- Time dimension
> CREATE TABLE time (
>     datetime TIMESTAMP WITH TIME ZONE NOT NULL,
>     day_of_week SMALLINT NOT NULL
> );
> CREATE UNIQUE INDEX ON time (datetime);
>
> -- Fact
> CREATE TABLE fact(
>     datetime TIMESTAMP WITH TIME ZONE NOT NULL,
>     FOREIGN KEY (datetime) REFERENCES time(datetime)
> );
>
> -- Function to populate the time dimension
> CREATE OR REPLACE FUNCTION decompose_timestamp() RETURNS TRIGGER AS $$
> BEGIN
>     NEW.datetime = date_trunc('minutes', NEW.datetime);
>     INSERT INTO time (datetime, day_of_week)
>     VALUES (NEW.datetime, date_part('dow', NEW.datetime));
>     RETURN NEW;
> EXCEPTION
>     WHEN unique_violation THEN
>         -- Do nothing if the timestamp already exists in the dimension table.
>         RETURN new;
> END; $$
> LANGUAGE 'plpgsql';

Failure to inline the date/time in dimension tables a terrible
practice IMO.   You add a lookup and an expensive subtransaction to
each insert.  When querying the fact table you tack on a join for
every query that does not need to be there (for no benefit I can see).

merlin

В списке pgsql-general по дате отправления:

Предыдущее

От: Tom Lane
Дата: 11 февраля 2014 г., 05:01:21
Сообщение: Re: designing time dimension for star schema

Следующее

От: Nedal Slman
Дата: 11 февраля 2014 г., 14:46:42
Сообщение: ...

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: designing time dimension for star schema

Предыдущее

Следующее