Re: Index Skip Scan

От

Jesper Pedersen

Тема

Re: Index Skip Scan

Дата

13 сентября 2018 г. в 18:39:46

Msg-id

11b7c9d1-ae00-4385-97b2-fcd45f0a1a71@redhat.com

обсуждение

Ответ на

Re: Index Skip Scan (Alexander Kuzmenkov)

Список

pgsql-hackers

Дерево обсуждения

Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 18 июня 2018 г. в 18:25:39

Re: Index Skip Scan Andrew Dunstan <andrew.dunstan@2ndquadrant.com> 18 июня 2018 г. в 23:20:07

Re: Index Skip Scan Alexander Korotkov <a.korotkov@postgrespro.ru> 19 июня 2018 г. в 00:06:59

Re: Index Skip Scan Michael Paquier <michael@paquier.xyz> 19 июня 2018 г. в 04:40:02

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 19 июня 2018 г. в 13:01:24

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 19 июня 2018 г. в 20:06:08

Re: Index Skip Scan Alexander Korotkov <a.korotkov@postgrespro.ru> 18 июня 2018 г. в 20:31:48

Re: Index Skip Scan Bhushan Uparkar <bhushan.uparkar@gmail.com> 16 августа 2018 г. в 08:44:58

Re: Index Skip Scan Thomas Munro <thomas.munro@enterprisedb.com> 16 августа 2018 г. в 09:22:25

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 16 августа 2018 г. в 21:28:45

Re: Index Skip Scan Stephen Frost <sfrost@snowman.net> 16 августа 2018 г. в 21:36:02

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 16 августа 2018 г. в 21:50:34

Re: Index Skip Scan Stephen Frost <sfrost@snowman.net> 16 августа 2018 г. в 22:05:30

Re: Index Skip Scan Andres Freund <andres@anarazel.de> 17 августа 2018 г. в 00:44:19

Re: Index Skip Scan Peter Geoghegan <pg@bowt.ie> 16 августа 2018 г. в 22:48:34

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 17 августа 2018 г. в 20:15:46

Re: Index Skip Scan Thomas Munro <thomas.munro@enterprisedb.com> 17 августа 2018 г. в 02:10:56

Re: Index Skip Scan Peter Geoghegan <pg@bowt.ie> 17 августа 2018 г. в 20:52:05

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 11 сентября 2018 г. в 00:47:06

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 11 сентября 2018 г. в 16:21:57

Re: Index Skip Scan Alexander Kuzmenkov <a.kuzmenkov@postgrespro.ru> 13 сентября 2018 г. в 16:01:13

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 13 сентября 2018 г. в 18:39:46

Re: Index Skip Scan Alexander Kuzmenkov <a.kuzmenkov@postgrespro.ru> 13 сентября 2018 г. в 22:36:24

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 15 сентября 2018 г. в 22:52:39

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 27 сентября 2018 г. в 16:59:50

Re: Index Skip Scan Alexander Kuzmenkov <a.kuzmenkov@postgrespro.ru> 15 ноября 2018 г. в 11:41:41

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 16 ноября 2018 г. в 15:06:09

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 17 ноября 2018 г. в 23:27:07

Re: Index Skip Scan Alexander Kuzmenkov <a.kuzmenkov@postgrespro.ru> 21 ноября 2018 г. в 15:38:55

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 21 ноября 2018 г. в 20:56:34

Re: Index Skip Scan Peter Geoghegan <pg@bowt.ie> 4 декабря 2018 г. в 03:26:31

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 20 декабря 2018 г. в 13:46:09

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 26 января 2019 г. в 17:45:54

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 27 января 2019 г. в 17:17:46

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 30 января 2019 г. в 17:19:05

Re: Index Skip Scan Kyotaro HORIGUCHI <horiguchi.kyotaro@lab.ntt.co.jp> 31 января 2019 г. в 06:31:53

Re: Index Skip Scan James Coleman <jtc331@gmail.com> 1 февраля 2019 г. в 21:04:58

Re: Index Skip Scan Andres Freund <andres@anarazel.de> 1 февраля 2019 г. в 22:05:03

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 20 февраля 2019 г. в 16:35:08

Re: Index Skip Scan Jeff Janes <jeff.janes@gmail.com> 28 февраля 2019 г. в 21:45:15

Re: Index Skip Scan Jeff Janes <jeff.janes@gmail.com> 28 февраля 2019 г. в 22:23:15

Re: Index Skip Scan Thomas Munro <thomas.munro@gmail.com> 28 февраля 2019 г. в 23:03:06

Re: Index Skip Scan Tomas Vondra <tomas.vondra@2ndquadrant.com> 28 февраля 2019 г. в 23:10:55

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 5 марта 2019 г. в 15:05:42

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 14 марта 2019 г. в 13:32:49

Re: Index Skip Scan Kyotaro HORIGUCHI <horiguchi.kyotaro@lab.ntt.co.jp> 15 марта 2019 г. в 00:51:57

Re: Index Skip Scan Kyotaro HORIGUCHI <horiguchi.kyotaro@lab.ntt.co.jp> 15 марта 2019 г. в 03:54:52

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 16 марта 2019 г. в 16:14:20

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 19 марта 2019 г. в 13:07:32

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 28 марта 2019 г. в 10:01:24

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 11 мая 2019 г. в 16:35:41

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 29 мая 2019 г. в 15:50:51

Re: Index Skip Scan Floris Van Nee <florisvannee@Optiver.com> 1 июня 2019 г. в 04:01:38

Re: Index Skip Scan Floris Van Nee <florisvannee@Optiver.com> 1 июня 2019 г. в 04:10:23

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 3 июня 2019 г. в 18:13:56

Re: Index Skip Scan Rafia Sabih <rafia.pghackers@gmail.com> 1 июня 2019 г. в 10:03:30

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 3 июня 2019 г. в 18:16:45

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 3 июня 2019 г. в 20:31:33

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 1 июня 2019 г. в 10:28:28

Re: Index Skip Scan Floris Van Nee <florisvannee@Optiver.com> 1 июня 2019 г. в 15:33:57

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 1 июня 2019 г. в 16:57:31

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 1 февраля 2019 г. в 19:24:38

Re: Index Skip Scan James Coleman <jtc331@gmail.com> 15 ноября 2018 г. в 14:28:03

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 27 сентября 2018 г. в 23:28:34

Re: Index Skip Scan Pavel Stehule <pavel.stehule@gmail.com> 9 октября 2018 г. в 13:42:24

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 9 октября 2018 г. в 13:58:09

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 9 октября 2018 г. в 13:59:28

Re: Index Skip Scan Pavel Stehule <pavel.stehule@gmail.com> 9 октября 2018 г. в 14:13:03

Re: Index Skip Scan Pavel Stehule <pavel.stehule@gmail.com> 9 октября 2018 г. в 16:12:31

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 10 октября 2018 г. в 15:34:27

Re: Index Skip Scan Robert Haas <robertmhaas@gmail.com> 12 октября 2018 г. в 17:43:48

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 16 октября 2018 г. в 19:22:18

Re: Index Skip Scan Sergei Kornilov <sk@zsrv.org> 12 ноября 2018 г. в 12:28:58

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 12 ноября 2018 г. в 12:55:28

Re: Index Skip Scan Dmitry Dolgov <9erthalion6@gmail.com> 14 ноября 2018 г. в 16:48:47

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 16 августа 2018 г. в 21:23:10

Re: Index Skip Scan Jesper Pedersen <jesper.pedersen@redhat.com> 19 июня 2018 г. в 20:00:36

Hi Alexander.

On 9/13/18 9:01 AM, Alexander Kuzmenkov wrote:
> While testing this patch

Thanks for the review !

> I noticed that current implementation doesn't 
> perform well when we have lots of small groups of equal values. Here is 
> the execution time of index skip scan vs unique over index scan, in ms, 
> depending on the size of group. The benchmark script is attached.
> 
> group size    skip        unique
> 1             2,293.85    132.55
> 5             464.40      106.59
> 10            239.61      102.02
> 50            56.59       98.74
> 100           32.56       103.04
> 500           6.08        97.09
> 

Yes, this doesn't look good. Using your test case I'm seeing that unique 
is being chosen when the group size is below 34, and skip above. This is 
with the standard initdb configuration; did you change something else ? 
Or did you force the default plan ?

> So, the current implementation can lead to performance regression, and 
> the choice of the plan depends on the notoriously unreliable ndistinct 
> statistics. 

Yes, Peter mentioned this, which I'm still looking at.

> The regression is probably because skip scan always does 
> _bt_search to find the next unique tuple. 

Very likely.

> I think we can improve this, 
> and the skip scan can be strictly faster than index scan regardless of 
> the data. As a first approximation, imagine that we somehow skipped 
> equal tuples inside _bt_next instead of sending them to the parent 
> Unique node. This would already be marginally faster than Unique + Index 
> scan. A more practical implementation would be to remember our position 
> in tree (that is, BTStack returned by _bt_search) and use it to skip 
> pages in bulk. This looks straightforward to implement for a tree that 
> does not change, but I'm not sure how to make it work with concurrent 
> modifications. Still, this looks a worthwhile direction to me, because 
> if we have a strictly faster skip scan, we can just use it always and 
> not worry about our unreliable statistics. What do you think?
> 

This is something to look at -- maybe there is a way to use 
btpo_next/btpo_prev instead/too in order to speed things up. Atm we just 
have the scan key in BTScanOpaqueData. I'll take a look after my 
upcoming vacation; feel free to contribute those changes in the meantime 
of course.

Thanks again !

Best regards,
  Jesper

В списке pgsql-hackers по дате отправления

Предыдущее

От: Dilip Kumar

Дата: 13 сентября 2018 г. в 18:33:39

Сообщение: Re: speeding up planning with partitions

Следующее

От: Alvaro Herrera

Дата: 13 сентября 2018 г. в 19:35:04

Сообщение: Re: cache lookup failed for constraint when alter table referred bypartition table