Обсуждение: Design for dashboard query

Поиск
Список
Период
Сортировка

Design for dashboard query

От
sud
Дата:

Hello All,

Its postgres version 15.4. We are having a requirement in which aggregated information for all the users has to be displayed on the UI screen. It should show that information on the screen. So basically, it would be scanning the full table data which is billions of rows across many months and then join with other master tables and aggregate those and then display the results based on the input "user id" filter.

In such a scenario we are thinking of using a materialized view on top of the base tables which will store the base information and refresh those periodically to show the data based on the input user id. However i am seeing , postgres not supporting incremental refresh of materialized view and full refresh can take longer. So , do we have any other option available? Additionally , It should not impact or block the online users querying the same materialized view when the refresh is happening.


Re: Design for dashboard query

От
Sushrut Shivaswamy
Дата:
Have you tried creating an index on the user ID column?
Scanning the entire table to apply granular filters on a few attributes seems unnecessary.

Materialised views make sense if you want to aggregate some columns and query a subset of the data but would recommend trying indexes first.

Finally, shameless plug but consider using the pg_analytica extension that enables fast analytic queries on the tables which is ideal for analytics use cases like dashboards.
I’m the author of the extension and am looking for initial users to try it out.

Thanks,
Sushrut


On Sat, 15 Jun 2024 at 6:54 PM, sud <suds1434@gmail.com> wrote:

Hello All,

Its postgres version 15.4. We are having a requirement in which aggregated information for all the users has to be displayed on the UI screen. It should show that information on the screen. So basically, it would be scanning the full table data which is billions of rows across many months and then join with other master tables and aggregate those and then display the results based on the input "user id" filter.

In such a scenario we are thinking of using a materialized view on top of the base tables which will store the base information and refresh those periodically to show the data based on the input user id. However i am seeing , postgres not supporting incremental refresh of materialized view and full refresh can take longer. So , do we have any other option available? Additionally , It should not impact or block the online users querying the same materialized view when the refresh is happening.


Re: Design for dashboard query

От
yudhi s
Дата:

On Sat, 15 Jun 2024 at 6:54 PM, sud <suds1434@gmail.com> wrote:

Hello All,

Its postgres version 15.4. We are having a requirement in which aggregated information for all the users has to be displayed on the UI screen. It should show that information on the screen. So basically, it would be scanning the full table data which is billions of rows across many months and then join with other master tables and aggregate those and then display the results based on the input "user id" filter.

In such a scenario we are thinking of using a materialized view on top of the base tables which will store the base information and refresh those periodically to show the data based on the input user id. However i am seeing , postgres not supporting incremental refresh of materialized view and full refresh can take longer. So , do we have any other option available? Additionally , It should not impact or block the online users querying the same materialized view when the refresh is happening.



Yes incremental refresh is by default not available but you can do the refresh using "concurrently" keyword which will be online i believe and won't block your ongoing  queries if any. 

And there are some other ways like create another view(say mv_new) with exactly the same definition and refresh that with whatever interval you want and then switch the views with the original one using the rename command.


ALTER MATERIALIZED VIEW mv RENAME TO mv_old;
ALTER MATERIALIZED VIEW mv_new RENAME TO mv;
DROP MATERIALIZED VIEW mv_old;

Or

have a log table created (say transaction_log) and populate it using triggers with each delta insert/update/delete on the transaction table. And then schedule a cron job which will periodically flush the rows from the transaction_log table to the materialized view. This will achieve your incremental refresh.