Обсуждение: SP-GiST confusion: indexed column's type vs. index column type

Поиск

Список

Период

Сортировка

SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

02 апреля 2021 г., 16:37:51

While reviewing Pavel Borisov's patch to enable INCLUDE columns in
SP-GiST, I found some things that seem like pre-existing bugs.
These only accidentally fail to cause any problems in the existing
SP-GiST opclasses:

1. The attType passed to an opclass's config method is documented as

    Oid         attType;        /* Data type to be indexed */

Now, I would read that as meaning the type of the underlying heap
column; the documentation and code about when a "compress" method
is required certainly seem to think so too.  What is actually being
passed, though, is the data type of the index column, that is the
"opckeytype" of the index opclass.  We've failed to notice this because
(1) for most of the core SP-GiST opclasses, the two types are the same,
and (2) none of the core opclasses bother to examine attType anyway.

2. When performing an index-only scan on an SP-GiST index, what we
pass back as the tuple descriptor of the output tuples is just the
index relation's own tupdesc, cf spgbeginscan:

    /* Set up indexTupDesc and xs_hitupdesc in case it's an index-only scan */
    so->indexTupDesc = scan->xs_hitupdesc = RelationGetDescr(rel);

Again, what this is going to report is the opckeytype, not the
reconstructed heap column datatype.  That's just flat out wrong.
We've failed to notice because the only core opclass for which
those types are different is poly_ops, which does not support
reconstructing the polygons for index-only scan.

We need to do something about this because the INCLUDE patch needs
the relation descriptor of an SP-GiST index to match the reality
of what is stored in the leaf tuples.  Right now, as far as I can tell,
there isn't really any necessary connection between the atttype
claimed by the relation descriptor and the leaf type that's physically
stored.  They're accidentally the same in all existing opclasses,
but only accidentally.

I propose changing things so that

(A) attType really is the input (heap) data type.

(B) We enforce that leafType agrees with the opclass opckeytype,
ensuring the index tupdesc can be used for leaf tuples.

(C) The tupdesc passed back for an index-only scan reports the
input (heap) data type.

This might be too much change for the back branches.  Given the
lack of complaints to date, I think we can just fix it in HEAD.

            regards, tom lane

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Peter Geoghegan

Дата:

02 апреля 2021 г., 16:46:28

On Fri, Apr 2, 2021 at 9:37 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
> I propose changing things so that
>
> (A) attType really is the input (heap) data type.
>
> (B) We enforce that leafType agrees with the opclass opckeytype,
> ensuring the index tupdesc can be used for leaf tuples.
>
> (C) The tupdesc passed back for an index-only scan reports the
> input (heap) data type.
>
> This might be too much change for the back branches.  Given the
> lack of complaints to date, I think we can just fix it in HEAD.

+1 to fixing it on HEAD only.

-- 
Peter Geoghegan

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

02 апреля 2021 г., 23:24:23

Peter Geoghegan <pg@bowt.ie> writes:
> On Fri, Apr 2, 2021 at 9:37 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:
>> I propose changing things so that
>> (A) attType really is the input (heap) data type.
>> (B) We enforce that leafType agrees with the opclass opckeytype,
>> ensuring the index tupdesc can be used for leaf tuples.
>> (C) The tupdesc passed back for an index-only scan reports the
>> input (heap) data type.
>> 
>> This might be too much change for the back branches.  Given the
>> lack of complaints to date, I think we can just fix it in HEAD.

> +1 to fixing it on HEAD only.

Here's a draft patch for that, in case anyone wants to look it
over.

The confusion went even deeper than I thought, as some of the code
mistakenly thought that reconstructed "leafValue" values were of the
leaf data type rather than the input attribute type.  (Which is not
too surprising, given that that's such a misleading name, but the
docs are clear and correct on the point.)

Also, both the code and docs thought that the "reconstructedValue"
datums that are passed down the tree during a search should be of
the leaf data type.  This seems to me to be arrant nonsense.
As an example, if you made an opclass that indexes 1-D arrays
by labeling each inner node with successive array elements,
right down to the leaf which is the last array element, it will
absolutely not work for the reconstructedValues to be of the
leaf type --- they have to be of the array type.  (As I said
in commit 1ebdec8c0, this'd be a fairly poorly-chosen opclass
design, but it seems like it ought to physically work.)

Given the amount of confusion here, I don't have a lot of confidence
that an opclass that wants to reconstruct values while having
leafType different from input type will work even with this patch.
I'm strongly tempted to make a src/test/modules module that
implements exactly the silly design given above, just so we have
some coverage for this scenario.

            regards, tom lane

diff --git a/doc/src/sgml/spgist.sgml b/doc/src/sgml/spgist.sgml
index ea88ae45e5..ba66deb003 100644
--- a/doc/src/sgml/spgist.sgml
+++ b/doc/src/sgml/spgist.sgml
@@ -330,19 +330,25 @@ typedef struct spgConfigOut
      </para>

      <para>
-      <structfield>leafType</structfield> is typically the same as
-      <structfield>attType</structfield>.  For the reasons of backward
-      compatibility, method <function>config</function> can
-      leave <structfield>leafType</structfield> uninitialized; that would
-      give the same effect as setting <structfield>leafType</structfield> equal
-      to <structfield>attType</structfield>.  When <structfield>attType</structfield>
-      and <structfield>leafType</structfield> are different, then optional
+      <structfield>leafType</structfield> must match the index storage type
+      defined by the operator class's <structfield>opckeytype</structfield>
+      catalog entry.  For reasons of backward compatibility,
+      method <function>config</function> can
+      leave <structfield>leafType</structfield> uninitialized (zero);
+      that is interpreted as meaning <structfield>opckeytype</structfield>.
+      (Recall that <structfield>opckeytype</structfield> can in turn be zero,
+      implying the storage type is the same as the operator class's input
+      type.)  In what follows, <structfield>leafType</structfield> should be
+      understood as referring to the fully resolved leaf data type.
+     </para>
+
+     <para>
+      When <structfield>attType</structfield>
+      and <structfield>leafType</structfield> are different, the optional
       method <function>compress</function> must be provided.
       Method <function>compress</function> is responsible
       for transformation of datums to be indexed from <structfield>attType</structfield>
       to <structfield>leafType</structfield>.
-      Note: both consistent functions will get <structfield>scankeys</structfield>
-      unchanged, without transformation using <function>compress</function>.
      </para>
      </listitem>
     </varlistentry>
@@ -677,8 +683,7 @@ typedef struct spgInnerConsistentOut
        <structfield>reconstructedValue</structfield> is the value reconstructed for the
        parent tuple; it is <literal>(Datum) 0</literal> at the root level or if the
        <function>inner_consistent</function> function did not provide a value at the
-       parent level. <structfield>reconstructedValue</structfield> is always of
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type.
+       parent level.
        <structfield>traversalValue</structfield> is a pointer to any traverse data
        passed down from the previous call of <function>inner_consistent</function>
        on the parent index tuple, or NULL at the root level.
@@ -713,9 +718,14 @@ typedef struct spgInnerConsistentOut
        necessarily so, so an array is used.)
        If value reconstruction is needed, set
        <structfield>reconstructedValues</structfield> to an array of the values
-       of <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type
        reconstructed for each child node to be visited; otherwise, leave
        <structfield>reconstructedValues</structfield> as NULL.
+       The reconstructed values are assumed to be of the indexed column's
+       type, that is <structname>spgConfigIn</structname>.<structfield>attType</structfield>.
+       (However, since the core system will do nothing with them except
+       possibly copy them, it is sufficient for them to have the
+       same <literal>typlen</literal> and <literal>typbyval</literal>
+       properties as <structfield>attType</structfield>.)
        If ordered search is performed, set <structfield>distances</structfield>
        to an array of distance values according to <structfield>orderbys</structfield>
        array (nodes with lowest distances will be processed first).  Leave it
@@ -797,8 +807,7 @@ typedef struct spgLeafConsistentOut
        <structfield>reconstructedValue</structfield> is the value reconstructed for the
        parent tuple; it is <literal>(Datum) 0</literal> at the root level or if the
        <function>inner_consistent</function> function did not provide a value at the
-       parent level. <structfield>reconstructedValue</structfield> is always of
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type.
+       parent level.
        <structfield>traversalValue</structfield> is a pointer to any traverse data
        passed down from the previous call of <function>inner_consistent</function>
        on the parent index tuple, or NULL at the root level.
@@ -816,8 +825,8 @@ typedef struct spgLeafConsistentOut
        The function must return <literal>true</literal> if the leaf tuple matches the
        query, or <literal>false</literal> if not.  In the <literal>true</literal> case,
        if <structfield>returnData</structfield> is <literal>true</literal> then
-       <structfield>leafValue</structfield> must be set to the value of
-       <structname>spgConfigIn</structname>.<structfield>attType</structfield> type
+       <structfield>leafValue</structfield> must be set to the value (of type
+       <structname>spgConfigIn</structname>.<structfield>attType</structfield>)
        originally supplied to be indexed for this leaf tuple.  Also,
        <structfield>recheck</structfield> may be set to <literal>true</literal> if the match
        is uncertain and so the operator(s) must be re-applied to the actual
@@ -834,7 +843,7 @@ typedef struct spgLeafConsistentOut
    </variablelist>

  <para>
-  The optional user-defined method are:
+  The optional user-defined methods are:
  </para>

  <variablelist>
@@ -842,15 +851,22 @@ typedef struct spgLeafConsistentOut
      <term><function>Datum compress(Datum in)</function></term>
      <listitem>
       <para>
-       Converts the data item into a format suitable for physical storage in
-       a leaf tuple of index page.  It accepts
+       Converts a data item into a format suitable for physical storage in
+       a leaf tuple of the index.  It accepts a value of type
        <structname>spgConfigIn</structname>.<structfield>attType</structfield>
-       value and returns
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield>
-       value.  Output value should not be toasted.
+       and returns a value of type
+       <structname>spgConfigOut</structname>.<structfield>leafType</structfield>.
+       The output value should not be toasted.
+      </para>
+
+      <para>
+       Note: the <function>compress</function> method is only applied to
+       values to be stored.  The consistent methods receive query scankeys
+       unchanged, without transformation using <function>compress</function>.
       </para>
      </listitem>
     </varlistentry>
+
     <varlistentry>
      <term><function>options</function></term>
      <listitem>
diff --git a/src/backend/access/spgist/spgscan.c b/src/backend/access/spgist/spgscan.c
index 20e67c3f7d..0c30d6d0e8 100644
--- a/src/backend/access/spgist/spgscan.c
+++ b/src/backend/access/spgist/spgscan.c
@@ -81,7 +81,7 @@ pairingheap_SpGistSearchItem_cmp(const pairingheap_node *a,
 static void
 spgFreeSearchItem(SpGistScanOpaque so, SpGistSearchItem *item)
 {
-    if (!so->state.attLeafType.attbyval &&
+    if (!so->state.attType.attbyval &&
         DatumGetPointer(item->value) != NULL)
         pfree(DatumGetPointer(item->value));

@@ -296,6 +296,7 @@ spgbeginscan(Relation rel, int keysz, int orderbysz)
 {
     IndexScanDesc scan;
     SpGistScanOpaque so;
+    TupleDesc    outTupDesc;
     int            i;

     scan = RelationGetIndexScan(rel, keysz, orderbysz);
@@ -314,8 +315,21 @@ spgbeginscan(Relation rel, int keysz, int orderbysz)
                                              "SP-GiST traversal-value context",
                                              ALLOCSET_DEFAULT_SIZES);

-    /* Set up indexTupDesc and xs_hitupdesc in case it's an index-only scan */
-    so->indexTupDesc = scan->xs_hitupdesc = RelationGetDescr(rel);
+    /*
+     * Set up indexTupDesc and xs_hitupdesc in case it's an index-only scan.
+     * (It's rather annoying to do this work when it might be wasted, but for
+     * most opclasses we can re-use the index reldesc instead of making one.)
+     */
+    if (so->state.attType.type ==
+        TupleDescAttr(RelationGetDescr(rel), 0)->atttypid)
+        outTupDesc = RelationGetDescr(rel);
+    else
+    {
+        outTupDesc = CreateTemplateTupleDesc(1);
+        TupleDescInitEntry(outTupDesc, 1, NULL,
+                           so->state.attType.type, -1, 0);
+    }
+    so->indexTupDesc = scan->xs_hitupdesc = outTupDesc;

     /* Allocate various arrays needed for order-by scans */
     if (scan->numberOfOrderBys > 0)
@@ -447,9 +461,10 @@ spgNewHeapItem(SpGistScanOpaque so, int level, ItemPointer heapPtr,
     item->level = level;
     item->heapPtr = *heapPtr;
     /* copy value to queue cxt out of tmp cxt */
+    /* caution: "leafValue" is of type attType not leafType */
     item->value = isnull ? (Datum) 0 :
-        datumCopy(leafValue, so->state.attLeafType.attbyval,
-                  so->state.attLeafType.attlen);
+        datumCopy(leafValue, so->state.attType.attbyval,
+                  so->state.attType.attlen);
     item->traversalValue = NULL;
     item->isLeaf = true;
     item->recheck = recheck;
@@ -589,10 +604,11 @@ spgMakeInnerItem(SpGistScanOpaque so,
         : parentItem->level;

     /* Must copy value out of temp context */
+    /* (we assume reconstructedValues are of type attType) */
     item->value = out->reconstructedValues
         ? datumCopy(out->reconstructedValues[i],
-                    so->state.attLeafType.attbyval,
-                    so->state.attLeafType.attlen)
+                    so->state.attType.attbyval,
+                    so->state.attType.attlen)
         : (Datum) 0;

     /*
diff --git a/src/backend/access/spgist/spgutils.c b/src/backend/access/spgist/spgutils.c
index 949352ada7..084a3f78fc 100644
--- a/src/backend/access/spgist/spgutils.c
+++ b/src/backend/access/spgist/spgutils.c
@@ -23,6 +23,7 @@
 #include "access/xact.h"
 #include "catalog/pg_amop.h"
 #include "commands/vacuum.h"
+#include "nodes/nodeFuncs.h"
 #include "storage/bufmgr.h"
 #include "storage/indexfsm.h"
 #include "storage/lmgr.h"
@@ -89,6 +90,66 @@ spghandler(PG_FUNCTION_ARGS)
     PG_RETURN_POINTER(amroutine);
 }

+/*
+ * GetIndexInputType
+ *        Determine the nominal input data type for an index column
+ *
+ * We define the "nominal" input type as the associated opclass's opcintype,
+ * or if that is a polymorphic type, the base type of the heap column or
+ * expression that is the index's input.  The reason for preferring the
+ * opcintype is that non-polymorphic opclasses probably don't want to hear
+ * about binary-compatible input types.  For instance, if a text opclass
+ * is being used with a varchar heap column, we want to report "text" not
+ * "varchar".  Likewise, opclasses don't want to hear about domain types,
+ * so if we do consult the actual input type, we make sure to flatten domains.
+ *
+ * At some point maybe this should go somewhere else, but it's not clear
+ * if any other index AMs have a use for it.
+ */
+static Oid
+GetIndexInputType(Relation index, AttrNumber indexcol)
+{
+    Oid            opcintype;
+    AttrNumber    heapcol;
+    List       *indexprs;
+    ListCell   *indexpr_item;
+
+    Assert(index->rd_index != NULL);
+    Assert(indexcol > 0 && indexcol <= index->rd_index->indnkeyatts);
+    opcintype = index->rd_opcintype[indexcol - 1];
+    if (!IsPolymorphicType(opcintype))
+        return opcintype;
+    heapcol = index->rd_index->indkey.values[indexcol - 1];
+    if (heapcol != 0)            /* Simple index column? */
+        return getBaseType(get_atttype(index->rd_index->indrelid, heapcol));
+
+    /*
+     * If the index expressions are already cached, skip calling
+     * RelationGetIndexExpressions, as it will make a copy which is overkill.
+     * We're not going to modify the trees, and we're not going to do anything
+     * that would invalidate the relcache entry before we're done.
+     */
+    if (index->rd_indexprs)
+        indexprs = index->rd_indexprs;
+    else
+        indexprs = RelationGetIndexExpressions(index);
+    indexpr_item = list_head(indexprs);
+    for (int i = 1; i <= index->rd_index->indnkeyatts; i++)
+    {
+        if (index->rd_index->indkey.values[i - 1] == 0)
+        {
+            /* expression column */
+            if (indexpr_item == NULL)
+                elog(ERROR, "wrong number of index expressions");
+            if (i == indexcol)
+                return getBaseType(exprType((Node *) lfirst(indexpr_item)));
+            indexpr_item = lnext(indexprs, indexpr_item);
+        }
+    }
+    elog(ERROR, "wrong number of index expressions");
+    return InvalidOid;            /* keep compiler quiet */
+}
+
 /* Fill in a SpGistTypeDesc struct with info about the specified data type */
 static void
 fillTypeDesc(SpGistTypeDesc *desc, Oid type)
@@ -109,6 +170,7 @@ spgGetCache(Relation index)
     if (index->rd_amcache == NULL)
     {
         Oid            atttype;
+        Oid            indexatttype;
         spgConfigIn in;
         FmgrInfo   *procinfo;
         Buffer        metabuffer;
@@ -121,11 +183,11 @@ spgGetCache(Relation index)
         Assert(index->rd_att->natts == 1);

         /*
-         * Get the actual data type of the indexed column from the index
-         * tupdesc.  We pass this to the opclass config function so that
+         * Get the actual (well, nominal) data type of the column being
+         * indexed.  We pass this to the opclass config function so that
          * polymorphic opclasses are possible.
          */
-        atttype = TupleDescAttr(index->rd_att, 0)->atttypid;
+        atttype = GetIndexInputType(index, 1);

         /* Call the config function to get config info for the opclass */
         in.attType = atttype;
@@ -136,18 +198,30 @@ spgGetCache(Relation index)
                           PointerGetDatum(&in),
                           PointerGetDatum(&cache->config));

+        /*
+         * The leafType had better agree with the actual index column type,
+         * which index.c will have derived from the opclass's opcintype.
+         */
+        indexatttype = TupleDescAttr(RelationGetDescr(index), 0)->atttypid;
+        if (OidIsValid(cache->config.leafType) &&
+            cache->config.leafType != indexatttype)
+            ereport(ERROR,
+                    (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
+                     errmsg("SP-GiST leaf data type %s does not match declared type %s",
+                            format_type_be(cache->config.leafType),
+                            format_type_be(indexatttype))));
+
         /* Get the information we need about each relevant datatype */
         fillTypeDesc(&cache->attType, atttype);

-        if (OidIsValid(cache->config.leafType) &&
-            cache->config.leafType != atttype)
+        if (indexatttype != atttype)
         {
             if (!OidIsValid(index_getprocid(index, 1, SPGIST_COMPRESS_PROC)))
                 ereport(ERROR,
                         (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
                          errmsg("compress method must be defined when leaf type is different from input type")));

-            fillTypeDesc(&cache->attLeafType, cache->config.leafType);
+            fillTypeDesc(&cache->attLeafType, indexatttype);
         }
         else
         {
diff --git a/src/backend/access/spgist/spgvalidate.c b/src/backend/access/spgist/spgvalidate.c
index 8bc3889a4d..b2f1ffa30f 100644
--- a/src/backend/access/spgist/spgvalidate.c
+++ b/src/backend/access/spgist/spgvalidate.c
@@ -43,6 +43,7 @@ spgvalidate(Oid opclassoid)
     Form_pg_opclass classform;
     Oid            opfamilyoid;
     Oid            opcintype;
+    Oid            opckeytype;
     char       *opclassname;
     HeapTuple    familytup;
     Form_pg_opfamily familyform;
@@ -57,6 +58,7 @@ spgvalidate(Oid opclassoid)
     spgConfigOut configOut;
     Oid            configOutLefttype = InvalidOid;
     Oid            configOutRighttype = InvalidOid;
+    Oid            configOutLeafType = InvalidOid;

     /* Fetch opclass information */
     classtup = SearchSysCache1(CLAOID, ObjectIdGetDatum(opclassoid));
@@ -66,6 +68,7 @@ spgvalidate(Oid opclassoid)

     opfamilyoid = classform->opcfamily;
     opcintype = classform->opcintype;
+    opckeytype = classform->opckeytype;
     opclassname = NameStr(classform->opcname);

     /* Fetch opfamily information */
@@ -118,13 +121,20 @@ spgvalidate(Oid opclassoid)
                 configOutLefttype = procform->amproclefttype;
                 configOutRighttype = procform->amprocrighttype;

+                /* Identify actual leaf datum type */
+                if (OidIsValid(configOut.leafType))
+                    configOutLeafType = configOut.leafType;
+                else if (OidIsValid(opckeytype))
+                    configOutLeafType = opckeytype;
+                else
+                    configOutLeafType = procform->amproclefttype;
+
                 /*
                  * When leaf and attribute types are the same, compress
                  * function is not required and we set corresponding bit in
                  * functionset for later group consistency check.
                  */
-                if (!OidIsValid(configOut.leafType) ||
-                    configOut.leafType == configIn.attType)
+                if (configOutLeafType == configIn.attType)
                 {
                     foreach(lc, grouplist)
                     {
@@ -156,7 +166,7 @@ spgvalidate(Oid opclassoid)
                     ok = false;
                 else
                     ok = check_amproc_signature(procform->amproc,
-                                                configOut.leafType, true,
+                                                configOutLeafType, true,
                                                 1, 1, procform->amproclefttype);
                 break;
             case SPGIST_OPTIONS_PROC:
diff --git a/src/include/access/spgist_private.h b/src/include/access/spgist_private.h
index b6f95421d8..eda93e6d01 100644
--- a/src/include/access/spgist_private.h
+++ b/src/include/access/spgist_private.h
@@ -151,8 +151,8 @@ typedef struct SpGistState
 typedef struct SpGistSearchItem
 {
     pairingheap_node phNode;    /* pairing heap node */
-    Datum        value;            /* value reconstructed from parent or
-                                 * leafValue if heaptuple */
+    Datum        value;            /* value reconstructed from parent, or
+                                 * leafValue at leaf level */
     void       *traversalValue; /* opclass-specific traverse value */
     int            level;            /* level of items on this page */
     ItemPointerData heapPtr;    /* heap info, if heap tuple */
@@ -208,7 +208,7 @@ typedef struct SpGistScanOpaqueData

     /* These fields are only used in amgettuple scans: */
     bool        want_itup;        /* are we reconstructing tuples? */
-    TupleDesc    indexTupDesc;    /* if so, tuple descriptor for them */
+    TupleDesc    indexTupDesc;    /* if so, descriptor for reconstructed tuples */
     int            nPtrs;            /* number of TIDs found on current page */
     int            iPtr;            /* index for scanning through same */
     ItemPointerData heapPtrs[MaxIndexTuplesPerPage];    /* TIDs from cur page */

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

03 апреля 2021 г., 02:05:02

I wrote:
> Also, both the code and docs thought that the "reconstructedValue"
> datums that are passed down the tree during a search should be of
> the leaf data type.  This seems to me to be arrant nonsense.
> As an example, if you made an opclass that indexes 1-D arrays
> by labeling each inner node with successive array elements,
> right down to the leaf which is the last array element, it will
> absolutely not work for the reconstructedValues to be of the
> leaf type --- they have to be of the array type.  (As I said
> in commit 1ebdec8c0, this'd be a fairly poorly-chosen opclass
> design, but it seems like it ought to physically work.)

So after trying to build an opclass that does that, I have a clearer
understanding of why opclasses that'd break the existing code are
so thin on the ground.  You can't do the above, because the opclass
cannot force the AM to add inner nodes that it doesn't want to.
For example, the first few index entries will simply be dumped into
the root page as undifferentiated leaf tuples.  This means that,
if you'd like to be able to return reconstructed index entries, the
leaf data type *must* be able to hold all the data that is in an
input value.  In principle you could represent it in some other
format, but the path of least resistance is definitely to make the
leaf type the same as the input.

I still want to make an opclass in which those types are different,
if only for testing purposes, but I'm having a hard time coming up
with a plan that's not totally lame.  Best idea I can think of is
to wrap the input in a bytea, which just begs the question "why
would you do that?".  Anybody have a less lame thought?

            regards, tom lane

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

03 апреля 2021 г., 17:16:27

I wrote:
> I still want to make an opclass in which those types are different,
> if only for testing purposes, but I'm having a hard time coming up
> with a plan that's not totally lame.  Best idea I can think of is
> to wrap the input in a bytea, which just begs the question "why
> would you do that?".  Anybody have a less lame thought?

I thought of a plan that's at least simple to code: make an opclass
that takes "name" but does all the internal storage as "text".  Then
all the code can be stolen from spgtextproc.c with very minor changes.
I'd been too fixated on finding an example in which attType and
leafType differ as to pass-by-ref vs pass-by-value, but actually a
test case with positive typlen vs. varlena typlen will do just as well
for finding wrong-type references.

And, having coded that up, my first test result is

regression=# create extension spgist_name_ops ;
ERROR:  storage type cannot be different from data type for access method "spgist"

evidently because SPGiST doesn't set amroutine->amstorage.

That's silly on its face because we have built-in opclasses in which
those types are different, but it probably helps explain why there are
no field reports of trouble with these bugs ...

            regards, tom lane

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

03 апреля 2021 г., 20:06:38

Here's a patch that, in addition to what I mentioned upthread,
rescinds the limitation that user-defined SPGIST opclasses can't
set the STORAGE parameter, and cleans up some residual confusion
about whether values are of the indexed type (attType) or the
storage type (leafType).  Once I'd wrapped my head around the idea
that indeed intermediate-level "reconstructed" values ought to be
of the leafType, there were fewer bugs of that sort than I thought
yesterday ... but still a nonzero number.

I've also attached a test module that exercises reconstruction
during index-only scan with leafType being meaningfully different
from attType.  I'm not quite sure whether this is worth
committing, but I'm leaning towards doing so.

            regards, tom lane

diff --git a/doc/src/sgml/spgist.sgml b/doc/src/sgml/spgist.sgml
index ea88ae45e5..9ed1f564ee 100644
--- a/doc/src/sgml/spgist.sgml
+++ b/doc/src/sgml/spgist.sgml
@@ -205,10 +205,12 @@
  </para>

  <para>
-  Leaf tuples of an <acronym>SP-GiST</acronym> tree contain values of the
-  same data type as the indexed column.  Leaf tuples at the root level will
-  always contain the original indexed data value, but leaf tuples at lower
-  levels might contain only a compressed representation, such as a suffix.
+  Leaf tuples of an <acronym>SP-GiST</acronym> tree usually contain values
+  of the same data type as the indexed column, although it is also possible
+  for them to contain lossy representations of the indexed column.
+  Leaf tuples stored at the root level will directly represent
+  the original indexed data value, but leaf tuples at lower
+  levels might contain only a partial value, such as a suffix.
   In that case the operator class support functions must be able to
   reconstruct the original value using information accumulated from the
   inner tuples that are passed through to reach the leaf level.
@@ -330,19 +332,26 @@ typedef struct spgConfigOut
      </para>

      <para>
-      <structfield>leafType</structfield> is typically the same as
-      <structfield>attType</structfield>.  For the reasons of backward
-      compatibility, method <function>config</function> can
-      leave <structfield>leafType</structfield> uninitialized; that would
-      give the same effect as setting <structfield>leafType</structfield> equal
-      to <structfield>attType</structfield>.  When <structfield>attType</structfield>
-      and <structfield>leafType</structfield> are different, then optional
+      <structfield>leafType</structfield> must match the index storage type
+      defined by the operator class's <structfield>opckeytype</structfield>
+      catalog entry.  For reasons of backward compatibility,
+      method <function>config</function> can
+      leave <structfield>leafType</structfield> uninitialized (zero);
+      that is interpreted as meaning <structfield>opckeytype</structfield>.
+      (Recall that <structfield>opckeytype</structfield> can in turn be zero,
+      implying the storage type is the same as the operator class's input
+      type, which is the most common situation.)  In what
+      follows, <structfield>leafType</structfield> should be understood as
+      referring to the fully resolved leaf data type.
+     </para>
+
+     <para>
+      When <structfield>attType</structfield>
+      and <structfield>leafType</structfield> are different, the optional
       method <function>compress</function> must be provided.
       Method <function>compress</function> is responsible
       for transformation of datums to be indexed from <structfield>attType</structfield>
       to <structfield>leafType</structfield>.
-      Note: both consistent functions will get <structfield>scankeys</structfield>
-      unchanged, without transformation using <function>compress</function>.
      </para>
      </listitem>
     </varlistentry>
@@ -677,8 +686,7 @@ typedef struct spgInnerConsistentOut
        <structfield>reconstructedValue</structfield> is the value reconstructed for the
        parent tuple; it is <literal>(Datum) 0</literal> at the root level or if the
        <function>inner_consistent</function> function did not provide a value at the
-       parent level. <structfield>reconstructedValue</structfield> is always of
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type.
+       parent level.
        <structfield>traversalValue</structfield> is a pointer to any traverse data
        passed down from the previous call of <function>inner_consistent</function>
        on the parent index tuple, or NULL at the root level.
@@ -713,9 +721,14 @@ typedef struct spgInnerConsistentOut
        necessarily so, so an array is used.)
        If value reconstruction is needed, set
        <structfield>reconstructedValues</structfield> to an array of the values
-       of <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type
        reconstructed for each child node to be visited; otherwise, leave
        <structfield>reconstructedValues</structfield> as NULL.
+       The reconstructed values are assumed to be of type
+       <structname>spgConfigOut</structname>.<structfield>leafType</structfield>.
+       (However, since the core system will do nothing with them except
+       possibly copy them, it is sufficient for them to have the
+       same <literal>typlen</literal> and <literal>typbyval</literal>
+       properties as <structfield>leafType</structfield>.)
        If ordered search is performed, set <structfield>distances</structfield>
        to an array of distance values according to <structfield>orderbys</structfield>
        array (nodes with lowest distances will be processed first).  Leave it
@@ -797,8 +810,7 @@ typedef struct spgLeafConsistentOut
        <structfield>reconstructedValue</structfield> is the value reconstructed for the
        parent tuple; it is <literal>(Datum) 0</literal> at the root level or if the
        <function>inner_consistent</function> function did not provide a value at the
-       parent level. <structfield>reconstructedValue</structfield> is always of
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield> type.
+       parent level.
        <structfield>traversalValue</structfield> is a pointer to any traverse data
        passed down from the previous call of <function>inner_consistent</function>
        on the parent index tuple, or NULL at the root level.
@@ -816,8 +828,8 @@ typedef struct spgLeafConsistentOut
        The function must return <literal>true</literal> if the leaf tuple matches the
        query, or <literal>false</literal> if not.  In the <literal>true</literal> case,
        if <structfield>returnData</structfield> is <literal>true</literal> then
-       <structfield>leafValue</structfield> must be set to the value of
-       <structname>spgConfigIn</structname>.<structfield>attType</structfield> type
+       <structfield>leafValue</structfield> must be set to the value (of type
+       <structname>spgConfigIn</structname>.<structfield>attType</structfield>)
        originally supplied to be indexed for this leaf tuple.  Also,
        <structfield>recheck</structfield> may be set to <literal>true</literal> if the match
        is uncertain and so the operator(s) must be re-applied to the actual
@@ -834,7 +846,7 @@ typedef struct spgLeafConsistentOut
    </variablelist>

  <para>
-  The optional user-defined method are:
+  The optional user-defined methods are:
  </para>

  <variablelist>
@@ -842,15 +854,22 @@ typedef struct spgLeafConsistentOut
      <term><function>Datum compress(Datum in)</function></term>
      <listitem>
       <para>
-       Converts the data item into a format suitable for physical storage in
-       a leaf tuple of index page.  It accepts
+       Converts a data item into a format suitable for physical storage in
+       a leaf tuple of the index.  It accepts a value of type
        <structname>spgConfigIn</structname>.<structfield>attType</structfield>
-       value and returns
-       <structname>spgConfigOut</structname>.<structfield>leafType</structfield>
-       value.  Output value should not be toasted.
+       and returns a value of type
+       <structname>spgConfigOut</structname>.<structfield>leafType</structfield>.
+       The output value must not contain an out-of-line TOAST pointer.
+      </para>
+
+      <para>
+       Note: the <function>compress</function> method is only applied to
+       values to be stored.  The consistent methods receive query scankeys
+       unchanged, without transformation using <function>compress</function>.
       </para>
      </listitem>
     </varlistentry>
+
     <varlistentry>
      <term><function>options</function></term>
      <listitem>
diff --git a/src/backend/access/spgist/spgscan.c b/src/backend/access/spgist/spgscan.c
index 20e67c3f7d..06011d2c02 100644
--- a/src/backend/access/spgist/spgscan.c
+++ b/src/backend/access/spgist/spgscan.c
@@ -81,7 +81,10 @@ pairingheap_SpGistSearchItem_cmp(const pairingheap_node *a,
 static void
 spgFreeSearchItem(SpGistScanOpaque so, SpGistSearchItem *item)
 {
-    if (!so->state.attLeafType.attbyval &&
+    /* value is of type attType if isLeaf, else of type attLeafType */
+    /* (no, that is not backwards; yes, it's confusing) */
+    if (!(item->isLeaf ? so->state.attType.attbyval :
+          so->state.attLeafType.attbyval) &&
         DatumGetPointer(item->value) != NULL)
         pfree(DatumGetPointer(item->value));

@@ -296,6 +299,7 @@ spgbeginscan(Relation rel, int keysz, int orderbysz)
 {
     IndexScanDesc scan;
     SpGistScanOpaque so;
+    TupleDesc    outTupDesc;
     int            i;

     scan = RelationGetIndexScan(rel, keysz, orderbysz);
@@ -314,8 +318,21 @@ spgbeginscan(Relation rel, int keysz, int orderbysz)
                                              "SP-GiST traversal-value context",
                                              ALLOCSET_DEFAULT_SIZES);

-    /* Set up indexTupDesc and xs_hitupdesc in case it's an index-only scan */
-    so->indexTupDesc = scan->xs_hitupdesc = RelationGetDescr(rel);
+    /*
+     * Set up indexTupDesc and xs_hitupdesc in case it's an index-only scan.
+     * (It's rather annoying to do this work when it might be wasted, but for
+     * most opclasses we can re-use the index reldesc instead of making one.)
+     */
+    if (so->state.attType.type ==
+        TupleDescAttr(RelationGetDescr(rel), 0)->atttypid)
+        outTupDesc = RelationGetDescr(rel);
+    else
+    {
+        outTupDesc = CreateTemplateTupleDesc(1);
+        TupleDescInitEntry(outTupDesc, 1, NULL,
+                           so->state.attType.type, -1, 0);
+    }
+    so->indexTupDesc = scan->xs_hitupdesc = outTupDesc;

     /* Allocate various arrays needed for order-by scans */
     if (scan->numberOfOrderBys > 0)
@@ -447,9 +464,10 @@ spgNewHeapItem(SpGistScanOpaque so, int level, ItemPointer heapPtr,
     item->level = level;
     item->heapPtr = *heapPtr;
     /* copy value to queue cxt out of tmp cxt */
+    /* caution: "leafValue" is of type attType not leafType */
     item->value = isnull ? (Datum) 0 :
-        datumCopy(leafValue, so->state.attLeafType.attbyval,
-                  so->state.attLeafType.attlen);
+        datumCopy(leafValue, so->state.attType.attbyval,
+                  so->state.attType.attlen);
     item->traversalValue = NULL;
     item->isLeaf = true;
     item->recheck = recheck;
@@ -497,6 +515,7 @@ spgLeafTest(SpGistScanOpaque so, SpGistSearchItem *item,
         in.nkeys = so->numberOfKeys;
         in.orderbys = so->orderByData;
         in.norderbys = so->numberOfNonNullOrderBys;
+        Assert(!item->isLeaf);    /* else reconstructedValue would be wrong type */
         in.reconstructedValue = item->value;
         in.traversalValue = item->traversalValue;
         in.level = item->level;
@@ -563,6 +582,7 @@ spgInitInnerConsistentIn(spgInnerConsistentIn *in,
     in->orderbys = so->orderByData;
     in->nkeys = so->numberOfKeys;
     in->norderbys = so->numberOfNonNullOrderBys;
+    Assert(!item->isLeaf);        /* else reconstructedValue would be wrong type */
     in->reconstructedValue = item->value;
     in->traversalMemoryContext = so->traversalCxt;
     in->traversalValue = item->traversalValue;
@@ -589,6 +609,7 @@ spgMakeInnerItem(SpGistScanOpaque so,
         : parentItem->level;

     /* Must copy value out of temp context */
+    /* (recall that reconstructed values are of type leafType) */
     item->value = out->reconstructedValues
         ? datumCopy(out->reconstructedValues[i],
                     so->state.attLeafType.attbyval,
diff --git a/src/backend/access/spgist/spgutils.c b/src/backend/access/spgist/spgutils.c
index 949352ada7..cc5cacf624 100644
--- a/src/backend/access/spgist/spgutils.c
+++ b/src/backend/access/spgist/spgutils.c
@@ -23,6 +23,7 @@
 #include "access/xact.h"
 #include "catalog/pg_amop.h"
 #include "commands/vacuum.h"
+#include "nodes/nodeFuncs.h"
 #include "storage/bufmgr.h"
 #include "storage/indexfsm.h"
 #include "storage/lmgr.h"
@@ -53,7 +54,7 @@ spghandler(PG_FUNCTION_ARGS)
     amroutine->amoptionalkey = true;
     amroutine->amsearcharray = false;
     amroutine->amsearchnulls = true;
-    amroutine->amstorage = false;
+    amroutine->amstorage = true;
     amroutine->amclusterable = false;
     amroutine->ampredlocks = false;
     amroutine->amcanparallel = false;
@@ -89,6 +90,66 @@ spghandler(PG_FUNCTION_ARGS)
     PG_RETURN_POINTER(amroutine);
 }

+/*
+ * GetIndexInputType
+ *        Determine the nominal input data type for an index column
+ *
+ * We define the "nominal" input type as the associated opclass's opcintype,
+ * or if that is a polymorphic type, the base type of the heap column or
+ * expression that is the index's input.  The reason for preferring the
+ * opcintype is that non-polymorphic opclasses probably don't want to hear
+ * about binary-compatible input types.  For instance, if a text opclass
+ * is being used with a varchar heap column, we want to report "text" not
+ * "varchar".  Likewise, opclasses don't want to hear about domain types,
+ * so if we do consult the actual input type, we make sure to flatten domains.
+ *
+ * At some point maybe this should go somewhere else, but it's not clear
+ * if any other index AMs have a use for it.
+ */
+static Oid
+GetIndexInputType(Relation index, AttrNumber indexcol)
+{
+    Oid            opcintype;
+    AttrNumber    heapcol;
+    List       *indexprs;
+    ListCell   *indexpr_item;
+
+    Assert(index->rd_index != NULL);
+    Assert(indexcol > 0 && indexcol <= index->rd_index->indnkeyatts);
+    opcintype = index->rd_opcintype[indexcol - 1];
+    if (!IsPolymorphicType(opcintype))
+        return opcintype;
+    heapcol = index->rd_index->indkey.values[indexcol - 1];
+    if (heapcol != 0)            /* Simple index column? */
+        return getBaseType(get_atttype(index->rd_index->indrelid, heapcol));
+
+    /*
+     * If the index expressions are already cached, skip calling
+     * RelationGetIndexExpressions, as it will make a copy which is overkill.
+     * We're not going to modify the trees, and we're not going to do anything
+     * that would invalidate the relcache entry before we're done.
+     */
+    if (index->rd_indexprs)
+        indexprs = index->rd_indexprs;
+    else
+        indexprs = RelationGetIndexExpressions(index);
+    indexpr_item = list_head(indexprs);
+    for (int i = 1; i <= index->rd_index->indnkeyatts; i++)
+    {
+        if (index->rd_index->indkey.values[i - 1] == 0)
+        {
+            /* expression column */
+            if (indexpr_item == NULL)
+                elog(ERROR, "wrong number of index expressions");
+            if (i == indexcol)
+                return getBaseType(exprType((Node *) lfirst(indexpr_item)));
+            indexpr_item = lnext(indexprs, indexpr_item);
+        }
+    }
+    elog(ERROR, "wrong number of index expressions");
+    return InvalidOid;            /* keep compiler quiet */
+}
+
 /* Fill in a SpGistTypeDesc struct with info about the specified data type */
 static void
 fillTypeDesc(SpGistTypeDesc *desc, Oid type)
@@ -109,6 +170,7 @@ spgGetCache(Relation index)
     if (index->rd_amcache == NULL)
     {
         Oid            atttype;
+        Oid            indexatttype;
         spgConfigIn in;
         FmgrInfo   *procinfo;
         Buffer        metabuffer;
@@ -121,11 +183,11 @@ spgGetCache(Relation index)
         Assert(index->rd_att->natts == 1);

         /*
-         * Get the actual data type of the indexed column from the index
-         * tupdesc.  We pass this to the opclass config function so that
+         * Get the actual (well, nominal) data type of the column being
+         * indexed.  We pass this to the opclass config function so that
          * polymorphic opclasses are possible.
          */
-        atttype = TupleDescAttr(index->rd_att, 0)->atttypid;
+        atttype = GetIndexInputType(index, 1);

         /* Call the config function to get config info for the opclass */
         in.attType = atttype;
@@ -136,18 +198,30 @@ spgGetCache(Relation index)
                           PointerGetDatum(&in),
                           PointerGetDatum(&cache->config));

+        /*
+         * The leafType had better agree with the actual index column type,
+         * which index.c will have derived from the opclass's opcintype.
+         */
+        indexatttype = TupleDescAttr(RelationGetDescr(index), 0)->atttypid;
+        if (OidIsValid(cache->config.leafType) &&
+            cache->config.leafType != indexatttype)
+            ereport(ERROR,
+                    (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
+                     errmsg("SP-GiST leaf data type %s does not match declared type %s",
+                            format_type_be(cache->config.leafType),
+                            format_type_be(indexatttype))));
+
         /* Get the information we need about each relevant datatype */
         fillTypeDesc(&cache->attType, atttype);

-        if (OidIsValid(cache->config.leafType) &&
-            cache->config.leafType != atttype)
+        if (indexatttype != atttype)
         {
             if (!OidIsValid(index_getprocid(index, 1, SPGIST_COMPRESS_PROC)))
                 ereport(ERROR,
                         (errcode(ERRCODE_INVALID_PARAMETER_VALUE),
                          errmsg("compress method must be defined when leaf type is different from input type")));

-            fillTypeDesc(&cache->attLeafType, cache->config.leafType);
+            fillTypeDesc(&cache->attLeafType, indexatttype);
         }
         else
         {
diff --git a/src/backend/access/spgist/spgvalidate.c b/src/backend/access/spgist/spgvalidate.c
index 8bc3889a4d..b2f1ffa30f 100644
--- a/src/backend/access/spgist/spgvalidate.c
+++ b/src/backend/access/spgist/spgvalidate.c
@@ -43,6 +43,7 @@ spgvalidate(Oid opclassoid)
     Form_pg_opclass classform;
     Oid            opfamilyoid;
     Oid            opcintype;
+    Oid            opckeytype;
     char       *opclassname;
     HeapTuple    familytup;
     Form_pg_opfamily familyform;
@@ -57,6 +58,7 @@ spgvalidate(Oid opclassoid)
     spgConfigOut configOut;
     Oid            configOutLefttype = InvalidOid;
     Oid            configOutRighttype = InvalidOid;
+    Oid            configOutLeafType = InvalidOid;

     /* Fetch opclass information */
     classtup = SearchSysCache1(CLAOID, ObjectIdGetDatum(opclassoid));
@@ -66,6 +68,7 @@ spgvalidate(Oid opclassoid)

     opfamilyoid = classform->opcfamily;
     opcintype = classform->opcintype;
+    opckeytype = classform->opckeytype;
     opclassname = NameStr(classform->opcname);

     /* Fetch opfamily information */
@@ -118,13 +121,20 @@ spgvalidate(Oid opclassoid)
                 configOutLefttype = procform->amproclefttype;
                 configOutRighttype = procform->amprocrighttype;

+                /* Identify actual leaf datum type */
+                if (OidIsValid(configOut.leafType))
+                    configOutLeafType = configOut.leafType;
+                else if (OidIsValid(opckeytype))
+                    configOutLeafType = opckeytype;
+                else
+                    configOutLeafType = procform->amproclefttype;
+
                 /*
                  * When leaf and attribute types are the same, compress
                  * function is not required and we set corresponding bit in
                  * functionset for later group consistency check.
                  */
-                if (!OidIsValid(configOut.leafType) ||
-                    configOut.leafType == configIn.attType)
+                if (configOutLeafType == configIn.attType)
                 {
                     foreach(lc, grouplist)
                     {
@@ -156,7 +166,7 @@ spgvalidate(Oid opclassoid)
                     ok = false;
                 else
                     ok = check_amproc_signature(procform->amproc,
-                                                configOut.leafType, true,
+                                                configOutLeafType, true,
                                                 1, 1, procform->amproclefttype);
                 break;
             case SPGIST_OPTIONS_PROC:
diff --git a/src/include/access/spgist_private.h b/src/include/access/spgist_private.h
index b6f95421d8..980aa7766b 100644
--- a/src/include/access/spgist_private.h
+++ b/src/include/access/spgist_private.h
@@ -151,8 +151,8 @@ typedef struct SpGistState
 typedef struct SpGistSearchItem
 {
     pairingheap_node phNode;    /* pairing heap node */
-    Datum        value;            /* value reconstructed from parent or
-                                 * leafValue if heaptuple */
+    Datum        value;            /* value reconstructed from parent, or
+                                 * leafValue if isLeaf */
     void       *traversalValue; /* opclass-specific traverse value */
     int            level;            /* level of items on this page */
     ItemPointerData heapPtr;    /* heap info, if heap tuple */
@@ -208,7 +208,7 @@ typedef struct SpGistScanOpaqueData

     /* These fields are only used in amgettuple scans: */
     bool        want_itup;        /* are we reconstructing tuples? */
-    TupleDesc    indexTupDesc;    /* if so, tuple descriptor for them */
+    TupleDesc    indexTupDesc;    /* if so, descriptor for reconstructed tuples */
     int            nPtrs;            /* number of TIDs found on current page */
     int            iPtr;            /* index for scanning through same */
     ItemPointerData heapPtrs[MaxIndexTuplesPerPage];    /* TIDs from cur page */
diff --git a/src/test/regress/expected/rangetypes.out b/src/test/regress/expected/rangetypes.out
index 05b882fde9..e6ca99d43d 100644
--- a/src/test/regress/expected/rangetypes.out
+++ b/src/test/regress/expected/rangetypes.out
@@ -1413,12 +1413,31 @@ RESET enable_bitmapscan;
 create table test_range_elem(i int4);
 create index test_range_elem_idx on test_range_elem (i);
 insert into test_range_elem select i from generate_series(1,100) i;
+SET enable_seqscan    = f;
 select count(*) from test_range_elem where i <@ int4range(10,50);
  count
 -------
     40
 (1 row)

+-- also test spgist index on anyrange expression
+create index on test_range_elem using spgist(int4range(i,i+10));
+explain (costs off)
+select count(*) from test_range_elem where int4range(i,i+10) <@ int4range(10,30);
+                               QUERY PLAN
+-------------------------------------------------------------------------
+ Aggregate
+   ->  Index Scan using test_range_elem_int4range_idx on test_range_elem
+         Index Cond: (int4range(i, (i + 10)) <@ '[10,30)'::int4range)
+(3 rows)
+
+select count(*) from test_range_elem where int4range(i,i+10) <@ int4range(10,30);
+ count
+-------
+    11
+(1 row)
+
+RESET enable_seqscan;
 drop table test_range_elem;
 --
 -- Btree_gist is not included by default, so to test exclusion
diff --git a/src/test/regress/sql/rangetypes.sql b/src/test/regress/sql/rangetypes.sql
index e1b8917c0c..cb5353de6f 100644
--- a/src/test/regress/sql/rangetypes.sql
+++ b/src/test/regress/sql/rangetypes.sql
@@ -375,8 +375,18 @@ create table test_range_elem(i int4);
 create index test_range_elem_idx on test_range_elem (i);
 insert into test_range_elem select i from generate_series(1,100) i;

+SET enable_seqscan    = f;
+
 select count(*) from test_range_elem where i <@ int4range(10,50);

+-- also test spgist index on anyrange expression
+create index on test_range_elem using spgist(int4range(i,i+10));
+explain (costs off)
+select count(*) from test_range_elem where int4range(i,i+10) <@ int4range(10,30);
+select count(*) from test_range_elem where int4range(i,i+10) <@ int4range(10,30);
+
+RESET enable_seqscan;
+
 drop table test_range_elem;

 --
diff --git a/src/test/modules/Makefile b/src/test/modules/Makefile
index 93e7829c67..dffc79b2d9 100644
--- a/src/test/modules/Makefile
+++ b/src/test/modules/Makefile
@@ -13,6 +13,7 @@ SUBDIRS = \
           libpq_pipeline \
           plsample \
           snapshot_too_old \
+          spgist_name_ops \
           test_bloomfilter \
           test_ddl_deparse \
           test_extensions \
diff --git a/src/test/modules/spgist_name_ops/.gitignore b/src/test/modules/spgist_name_ops/.gitignore
new file mode 100644
index 0000000000..5dcb3ff972
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/.gitignore
@@ -0,0 +1,4 @@
+# Generated subdirectories
+/log/
+/results/
+/tmp_check/
diff --git a/src/test/modules/spgist_name_ops/Makefile b/src/test/modules/spgist_name_ops/Makefile
new file mode 100644
index 0000000000..05b2464b27
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/Makefile
@@ -0,0 +1,23 @@
+# src/test/modules/spgist_name_ops/Makefile
+
+MODULE_big = spgist_name_ops
+OBJS = \
+    $(WIN32RES) \
+    spgist_name_ops.o
+PGFILEDESC = "spgist_name_ops - test opclass for SP-GiST"
+
+EXTENSION = spgist_name_ops
+DATA = spgist_name_ops--1.0.sql
+
+REGRESS = spgist_name_ops
+
+ifdef USE_PGXS
+PG_CONFIG = pg_config
+PGXS := $(shell $(PG_CONFIG) --pgxs)
+include $(PGXS)
+else
+subdir = src/test/modules/spgist_name_ops
+top_builddir = ../../../..
+include $(top_builddir)/src/Makefile.global
+include $(top_srcdir)/contrib/contrib-global.mk
+endif
diff --git a/src/test/modules/spgist_name_ops/README b/src/test/modules/spgist_name_ops/README
new file mode 100644
index 0000000000..119208b568
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/README
@@ -0,0 +1,8 @@
+spgist_name_ops implements an SP-GiST operator class that indexes
+columns of type "name", but with storage identical to that used
+by SP-GiST text_ops.
+
+This is not terribly useful in itself, perhaps, but it allows
+testing cases where the indexed data type is different from the leaf
+data type and yet we can reconstruct the original indexed value.
+That situation is not tested by any built-in SP-GiST opclass.
diff --git a/src/test/modules/spgist_name_ops/expected/spgist_name_ops.out
b/src/test/modules/spgist_name_ops/expected/spgist_name_ops.out
new file mode 100644
index 0000000000..b58900c506
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/expected/spgist_name_ops.out
@@ -0,0 +1,41 @@
+create extension spgist_name_ops;
+select opcname, amvalidate(opc.oid)
+from pg_opclass opc join pg_am am on am.oid = opcmethod
+where amname = 'spgist' and opcname = 'name_ops';
+ opcname  | amvalidate
+----------+------------
+ name_ops | t
+(1 row)
+
+create table t(f1 name);
+create index on t using spgist(f1);
+insert into t select proname from pg_proc;
+vacuum analyze t;
+explain (costs off)
+select * from t
+  where f1 > 'binary_upgrade_set_n' and f1 < 'binary_upgrade_set_p'
+  order by 1;
+                                            QUERY PLAN
+---------------------------------------------------------------------------------------------------
+ Sort
+   Sort Key: f1
+   ->  Index Only Scan using t_f1_idx on t
+         Index Cond: ((f1 > 'binary_upgrade_set_n'::name) AND (f1 < 'binary_upgrade_set_p'::name))
+(4 rows)
+
+select * from t
+  where f1 > 'binary_upgrade_set_n' and f1 < 'binary_upgrade_set_p'
+  order by 1;
+                          f1
+------------------------------------------------------
+ binary_upgrade_set_next_array_pg_type_oid
+ binary_upgrade_set_next_heap_pg_class_oid
+ binary_upgrade_set_next_index_pg_class_oid
+ binary_upgrade_set_next_multirange_array_pg_type_oid
+ binary_upgrade_set_next_multirange_pg_type_oid
+ binary_upgrade_set_next_pg_authid_oid
+ binary_upgrade_set_next_pg_enum_oid
+ binary_upgrade_set_next_pg_type_oid
+ binary_upgrade_set_next_toast_pg_class_oid
+(9 rows)
+
diff --git a/src/test/modules/spgist_name_ops/spgist_name_ops--1.0.sql
b/src/test/modules/spgist_name_ops/spgist_name_ops--1.0.sql
new file mode 100644
index 0000000000..dfb779578a
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/spgist_name_ops--1.0.sql
@@ -0,0 +1,39 @@
+/* src/test/modules/spgist_name_ops/spgist_name_ops--1.0.sql */
+
+-- complain if script is sourced in psql, rather than via CREATE EXTENSION
+\echo Use "CREATE EXTENSION spgist_name_ops" to load this file. \quit
+
+CREATE FUNCTION spgist_name_config(internal, internal)
+RETURNS void IMMUTABLE PARALLEL SAFE STRICT
+AS 'MODULE_PATHNAME' LANGUAGE C;
+
+CREATE FUNCTION spgist_name_choose(internal, internal)
+RETURNS void IMMUTABLE PARALLEL SAFE STRICT
+AS 'MODULE_PATHNAME' LANGUAGE C;
+
+CREATE FUNCTION spgist_name_inner_consistent(internal, internal)
+RETURNS void IMMUTABLE PARALLEL SAFE STRICT
+AS 'MODULE_PATHNAME' LANGUAGE C;
+
+CREATE FUNCTION spgist_name_leaf_consistent(internal, internal)
+RETURNS boolean IMMUTABLE PARALLEL SAFE STRICT
+AS 'MODULE_PATHNAME' LANGUAGE C;
+
+CREATE FUNCTION spgist_name_compress(name)
+RETURNS text IMMUTABLE PARALLEL SAFE STRICT
+AS 'MODULE_PATHNAME' LANGUAGE C;
+
+CREATE OPERATOR CLASS name_ops DEFAULT FOR TYPE name
+USING spgist AS
+    OPERATOR    1    < ,
+    OPERATOR    2    <= ,
+    OPERATOR    3    = ,
+    OPERATOR    4    >= ,
+    OPERATOR    5    > ,
+    FUNCTION    1    spgist_name_config(internal, internal),
+    FUNCTION    2    spgist_name_choose(internal, internal),
+    FUNCTION    3    spg_text_picksplit(internal, internal),
+    FUNCTION    4    spgist_name_inner_consistent(internal, internal),
+    FUNCTION    5    spgist_name_leaf_consistent(internal, internal),
+    FUNCTION    6    spgist_name_compress(name),
+    STORAGE text;
diff --git a/src/test/modules/spgist_name_ops/spgist_name_ops.c b/src/test/modules/spgist_name_ops/spgist_name_ops.c
new file mode 100644
index 0000000000..63d3faaef4
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/spgist_name_ops.c
@@ -0,0 +1,501 @@
+/*--------------------------------------------------------------------------
+ *
+ * spgist_name_ops.c
+ *        Test opclass for SP-GiST
+ *
+ * This indexes input values of type "name", but the index storage is "text",
+ * with the same choices as made in the core SP-GiST text_ops opclass.
+ * Much of the code is identical to src/backend/access/spgist/spgtextproc.c,
+ * which see for a more detailed header comment.
+ *
+ * Unlike spgtextproc.c, we don't bother with collation-aware logic.
+ *
+ *
+ * Portions Copyright (c) 1996-2021, PostgreSQL Global Development Group
+ * Portions Copyright (c) 1994, Regents of the University of California
+ *
+ * IDENTIFICATION
+ *        src/test/modules/spgist_name_ops/spgist_name_ops.c
+ *
+ * -------------------------------------------------------------------------
+ */
+#include "postgres.h"
+
+#include "access/spgist.h"
+#include "catalog/pg_type.h"
+#include "utils/datum.h"
+
+PG_MODULE_MAGIC;
+
+
+PG_FUNCTION_INFO_V1(spgist_name_config);
+Datum
+spgist_name_config(PG_FUNCTION_ARGS)
+{
+    /* spgConfigIn *cfgin = (spgConfigIn *) PG_GETARG_POINTER(0); */
+    spgConfigOut *cfg = (spgConfigOut *) PG_GETARG_POINTER(1);
+
+    cfg->prefixType = TEXTOID;
+    cfg->labelType = INT2OID;
+    cfg->leafType = TEXTOID;
+    cfg->canReturnData = true;
+    cfg->longValuesOK = true;    /* suffixing will shorten long values */
+    PG_RETURN_VOID();
+}
+
+/*
+ * Form a text datum from the given not-necessarily-null-terminated string,
+ * using short varlena header format if possible
+ */
+static Datum
+formTextDatum(const char *data, int datalen)
+{
+    char       *p;
+
+    p = (char *) palloc(datalen + VARHDRSZ);
+
+    if (datalen + VARHDRSZ_SHORT <= VARATT_SHORT_MAX)
+    {
+        SET_VARSIZE_SHORT(p, datalen + VARHDRSZ_SHORT);
+        if (datalen)
+            memcpy(p + VARHDRSZ_SHORT, data, datalen);
+    }
+    else
+    {
+        SET_VARSIZE(p, datalen + VARHDRSZ);
+        memcpy(p + VARHDRSZ, data, datalen);
+    }
+
+    return PointerGetDatum(p);
+}
+
+/*
+ * Find the length of the common prefix of a and b
+ */
+static int
+commonPrefix(const char *a, const char *b, int lena, int lenb)
+{
+    int            i = 0;
+
+    while (i < lena && i < lenb && *a == *b)
+    {
+        a++;
+        b++;
+        i++;
+    }
+
+    return i;
+}
+
+/*
+ * Binary search an array of int16 datums for a match to c
+ *
+ * On success, *i gets the match location; on failure, it gets where to insert
+ */
+static bool
+searchChar(Datum *nodeLabels, int nNodes, int16 c, int *i)
+{
+    int            StopLow = 0,
+                StopHigh = nNodes;
+
+    while (StopLow < StopHigh)
+    {
+        int            StopMiddle = (StopLow + StopHigh) >> 1;
+        int16        middle = DatumGetInt16(nodeLabels[StopMiddle]);
+
+        if (c < middle)
+            StopHigh = StopMiddle;
+        else if (c > middle)
+            StopLow = StopMiddle + 1;
+        else
+        {
+            *i = StopMiddle;
+            return true;
+        }
+    }
+
+    *i = StopHigh;
+    return false;
+}
+
+PG_FUNCTION_INFO_V1(spgist_name_choose);
+Datum
+spgist_name_choose(PG_FUNCTION_ARGS)
+{
+    spgChooseIn *in = (spgChooseIn *) PG_GETARG_POINTER(0);
+    spgChooseOut *out = (spgChooseOut *) PG_GETARG_POINTER(1);
+    Name        inName = DatumGetName(in->datum);
+    char       *inStr = NameStr(*inName);
+    int            inSize = strlen(inStr);
+    char       *prefixStr = NULL;
+    int            prefixSize = 0;
+    int            commonLen = 0;
+    int16        nodeChar = 0;
+    int            i = 0;
+
+    /* Check for prefix match, set nodeChar to first byte after prefix */
+    if (in->hasPrefix)
+    {
+        text       *prefixText = DatumGetTextPP(in->prefixDatum);
+
+        prefixStr = VARDATA_ANY(prefixText);
+        prefixSize = VARSIZE_ANY_EXHDR(prefixText);
+
+        commonLen = commonPrefix(inStr + in->level,
+                                 prefixStr,
+                                 inSize - in->level,
+                                 prefixSize);
+
+        if (commonLen == prefixSize)
+        {
+            if (inSize - in->level > commonLen)
+                nodeChar = *(unsigned char *) (inStr + in->level + commonLen);
+            else
+                nodeChar = -1;
+        }
+        else
+        {
+            /* Must split tuple because incoming value doesn't match prefix */
+            out->resultType = spgSplitTuple;
+
+            if (commonLen == 0)
+            {
+                out->result.splitTuple.prefixHasPrefix = false;
+            }
+            else
+            {
+                out->result.splitTuple.prefixHasPrefix = true;
+                out->result.splitTuple.prefixPrefixDatum =
+                    formTextDatum(prefixStr, commonLen);
+            }
+            out->result.splitTuple.prefixNNodes = 1;
+            out->result.splitTuple.prefixNodeLabels =
+                (Datum *) palloc(sizeof(Datum));
+            out->result.splitTuple.prefixNodeLabels[0] =
+                Int16GetDatum(*(unsigned char *) (prefixStr + commonLen));
+
+            out->result.splitTuple.childNodeN = 0;
+
+            if (prefixSize - commonLen == 1)
+            {
+                out->result.splitTuple.postfixHasPrefix = false;
+            }
+            else
+            {
+                out->result.splitTuple.postfixHasPrefix = true;
+                out->result.splitTuple.postfixPrefixDatum =
+                    formTextDatum(prefixStr + commonLen + 1,
+                                  prefixSize - commonLen - 1);
+            }
+
+            PG_RETURN_VOID();
+        }
+    }
+    else if (inSize > in->level)
+    {
+        nodeChar = *(unsigned char *) (inStr + in->level);
+    }
+    else
+    {
+        nodeChar = -1;
+    }
+
+    /* Look up nodeChar in the node label array */
+    if (searchChar(in->nodeLabels, in->nNodes, nodeChar, &i))
+    {
+        /*
+         * Descend to existing node.  (If in->allTheSame, the core code will
+         * ignore our nodeN specification here, but that's OK.  We still have
+         * to provide the correct levelAdd and restDatum values, and those are
+         * the same regardless of which node gets chosen by core.)
+         */
+        int            levelAdd;
+
+        out->resultType = spgMatchNode;
+        out->result.matchNode.nodeN = i;
+        levelAdd = commonLen;
+        if (nodeChar >= 0)
+            levelAdd++;
+        out->result.matchNode.levelAdd = levelAdd;
+        if (inSize - in->level - levelAdd > 0)
+            out->result.matchNode.restDatum =
+                formTextDatum(inStr + in->level + levelAdd,
+                              inSize - in->level - levelAdd);
+        else
+            out->result.matchNode.restDatum =
+                formTextDatum(NULL, 0);
+    }
+    else if (in->allTheSame)
+    {
+        /*
+         * Can't use AddNode action, so split the tuple.  The upper tuple has
+         * the same prefix as before and uses a dummy node label -2 for the
+         * lower tuple.  The lower tuple has no prefix and the same node
+         * labels as the original tuple.
+         *
+         * Note: it might seem tempting to shorten the upper tuple's prefix,
+         * if it has one, then use its last byte as label for the lower tuple.
+         * But that doesn't win since we know the incoming value matches the
+         * whole prefix: we'd just end up splitting the lower tuple again.
+         */
+        out->resultType = spgSplitTuple;
+        out->result.splitTuple.prefixHasPrefix = in->hasPrefix;
+        out->result.splitTuple.prefixPrefixDatum = in->prefixDatum;
+        out->result.splitTuple.prefixNNodes = 1;
+        out->result.splitTuple.prefixNodeLabels = (Datum *) palloc(sizeof(Datum));
+        out->result.splitTuple.prefixNodeLabels[0] = Int16GetDatum(-2);
+        out->result.splitTuple.childNodeN = 0;
+        out->result.splitTuple.postfixHasPrefix = false;
+    }
+    else
+    {
+        /* Add a node for the not-previously-seen nodeChar value */
+        out->resultType = spgAddNode;
+        out->result.addNode.nodeLabel = Int16GetDatum(nodeChar);
+        out->result.addNode.nodeN = i;
+    }
+
+    PG_RETURN_VOID();
+}
+
+/* The picksplit function is identical to the core opclass, so just use that */
+
+PG_FUNCTION_INFO_V1(spgist_name_inner_consistent);
+Datum
+spgist_name_inner_consistent(PG_FUNCTION_ARGS)
+{
+    spgInnerConsistentIn *in = (spgInnerConsistentIn *) PG_GETARG_POINTER(0);
+    spgInnerConsistentOut *out = (spgInnerConsistentOut *) PG_GETARG_POINTER(1);
+    text       *reconstructedValue;
+    text       *reconstrText;
+    int            maxReconstrLen;
+    text       *prefixText = NULL;
+    int            prefixSize = 0;
+    int            i;
+
+    /*
+     * Reconstruct values represented at this tuple, including parent data,
+     * prefix of this tuple if any, and the node label if it's non-dummy.
+     * in->level should be the length of the previously reconstructed value,
+     * and the number of bytes added here is prefixSize or prefixSize + 1.
+     *
+     * Recall that reconstructedValues are assumed to be the same type as leaf
+     * datums, so we must use "text" not "name" for them.
+     *
+     * Note: we assume that in->reconstructedValue isn't toasted and doesn't
+     * have a short varlena header.  This is okay because it must have been
+     * created by a previous invocation of this routine, and we always emit
+     * long-format reconstructed values.
+     */
+    reconstructedValue = (text *) DatumGetPointer(in->reconstructedValue);
+    Assert(reconstructedValue == NULL ? in->level == 0 :
+           VARSIZE_ANY_EXHDR(reconstructedValue) == in->level);
+
+    maxReconstrLen = in->level + 1;
+    if (in->hasPrefix)
+    {
+        prefixText = DatumGetTextPP(in->prefixDatum);
+        prefixSize = VARSIZE_ANY_EXHDR(prefixText);
+        maxReconstrLen += prefixSize;
+    }
+
+    reconstrText = palloc(VARHDRSZ + maxReconstrLen);
+    SET_VARSIZE(reconstrText, VARHDRSZ + maxReconstrLen);
+
+    if (in->level)
+        memcpy(VARDATA(reconstrText),
+               VARDATA(reconstructedValue),
+               in->level);
+    if (prefixSize)
+        memcpy(((char *) VARDATA(reconstrText)) + in->level,
+               VARDATA_ANY(prefixText),
+               prefixSize);
+    /* last byte of reconstrText will be filled in below */
+
+    /*
+     * Scan the child nodes.  For each one, complete the reconstructed value
+     * and see if it's consistent with the query.  If so, emit an entry into
+     * the output arrays.
+     */
+    out->nodeNumbers = (int *) palloc(sizeof(int) * in->nNodes);
+    out->levelAdds = (int *) palloc(sizeof(int) * in->nNodes);
+    out->reconstructedValues = (Datum *) palloc(sizeof(Datum) * in->nNodes);
+    out->nNodes = 0;
+
+    for (i = 0; i < in->nNodes; i++)
+    {
+        int16        nodeChar = DatumGetInt16(in->nodeLabels[i]);
+        int            thisLen;
+        bool        res = true;
+        int            j;
+
+        /* If nodeChar is a dummy value, don't include it in data */
+        if (nodeChar <= 0)
+            thisLen = maxReconstrLen - 1;
+        else
+        {
+            ((unsigned char *) VARDATA(reconstrText))[maxReconstrLen - 1] = nodeChar;
+            thisLen = maxReconstrLen;
+        }
+
+        for (j = 0; j < in->nkeys; j++)
+        {
+            StrategyNumber strategy = in->scankeys[j].sk_strategy;
+            Name        inName;
+            char       *inStr;
+            int            inSize;
+            int            r;
+
+            inName = DatumGetName(in->scankeys[j].sk_argument);
+            inStr = NameStr(*inName);
+            inSize = strlen(inStr);
+
+            r = memcmp(VARDATA(reconstrText), inStr,
+                       Min(inSize, thisLen));
+
+            switch (strategy)
+            {
+                case BTLessStrategyNumber:
+                case BTLessEqualStrategyNumber:
+                    if (r > 0)
+                        res = false;
+                    break;
+                case BTEqualStrategyNumber:
+                    if (r != 0 || inSize < thisLen)
+                        res = false;
+                    break;
+                case BTGreaterEqualStrategyNumber:
+                case BTGreaterStrategyNumber:
+                    if (r < 0)
+                        res = false;
+                    break;
+                default:
+                    elog(ERROR, "unrecognized strategy number: %d",
+                         in->scankeys[j].sk_strategy);
+                    break;
+            }
+
+            if (!res)
+                break;            /* no need to consider remaining conditions */
+        }
+
+        if (res)
+        {
+            out->nodeNumbers[out->nNodes] = i;
+            out->levelAdds[out->nNodes] = thisLen - in->level;
+            SET_VARSIZE(reconstrText, VARHDRSZ + thisLen);
+            out->reconstructedValues[out->nNodes] =
+                datumCopy(PointerGetDatum(reconstrText), false, -1);
+            out->nNodes++;
+        }
+    }
+
+    PG_RETURN_VOID();
+}
+
+PG_FUNCTION_INFO_V1(spgist_name_leaf_consistent);
+Datum
+spgist_name_leaf_consistent(PG_FUNCTION_ARGS)
+{
+    spgLeafConsistentIn *in = (spgLeafConsistentIn *) PG_GETARG_POINTER(0);
+    spgLeafConsistentOut *out = (spgLeafConsistentOut *) PG_GETARG_POINTER(1);
+    int            level = in->level;
+    text       *leafValue,
+               *reconstrValue = NULL;
+    char       *fullValue;
+    int            fullLen;
+    bool        res;
+    int            j;
+
+    /* all tests are exact */
+    out->recheck = false;
+
+    leafValue = DatumGetTextPP(in->leafDatum);
+
+    /* As above, in->reconstructedValue isn't toasted or short. */
+    if (DatumGetPointer(in->reconstructedValue))
+        reconstrValue = (text *) DatumGetPointer(in->reconstructedValue);
+
+    Assert(reconstrValue == NULL ? level == 0 :
+           VARSIZE_ANY_EXHDR(reconstrValue) == level);
+
+    /* Reconstruct the Name represented by this leaf tuple */
+    fullValue = palloc0(NAMEDATALEN);
+    fullLen = level + VARSIZE_ANY_EXHDR(leafValue);
+    Assert(fullLen < NAMEDATALEN);
+    if (VARSIZE_ANY_EXHDR(leafValue) == 0 && level > 0)
+    {
+        memcpy(fullValue, VARDATA(reconstrValue),
+               VARSIZE_ANY_EXHDR(reconstrValue));
+    }
+    else
+    {
+        if (level)
+            memcpy(fullValue, VARDATA(reconstrValue), level);
+        if (VARSIZE_ANY_EXHDR(leafValue) > 0)
+            memcpy(fullValue + level, VARDATA_ANY(leafValue),
+                   VARSIZE_ANY_EXHDR(leafValue));
+    }
+    out->leafValue = PointerGetDatum(fullValue);
+
+    /* Perform the required comparison(s) */
+    res = true;
+    for (j = 0; j < in->nkeys; j++)
+    {
+        StrategyNumber strategy = in->scankeys[j].sk_strategy;
+        Name        queryName = DatumGetName(in->scankeys[j].sk_argument);
+        char       *queryStr = NameStr(*queryName);
+        int            queryLen = strlen(queryStr);
+        int            r;
+
+        /* Non-collation-aware comparison */
+        r = memcmp(fullValue, queryStr, Min(queryLen, fullLen));
+
+        if (r == 0)
+        {
+            if (queryLen > fullLen)
+                r = -1;
+            else if (queryLen < fullLen)
+                r = 1;
+        }
+
+        switch (strategy)
+        {
+            case BTLessStrategyNumber:
+                res = (r < 0);
+                break;
+            case BTLessEqualStrategyNumber:
+                res = (r <= 0);
+                break;
+            case BTEqualStrategyNumber:
+                res = (r == 0);
+                break;
+            case BTGreaterEqualStrategyNumber:
+                res = (r >= 0);
+                break;
+            case BTGreaterStrategyNumber:
+                res = (r > 0);
+                break;
+            default:
+                elog(ERROR, "unrecognized strategy number: %d",
+                     in->scankeys[j].sk_strategy);
+                res = false;
+                break;
+        }
+
+        if (!res)
+            break;                /* no need to consider remaining conditions */
+    }
+
+    PG_RETURN_BOOL(res);
+}
+
+PG_FUNCTION_INFO_V1(spgist_name_compress);
+Datum
+spgist_name_compress(PG_FUNCTION_ARGS)
+{
+    Name        inName = PG_GETARG_NAME(0);
+    char       *inStr = NameStr(*inName);
+
+    PG_RETURN_DATUM(formTextDatum(inStr, strlen(inStr)));
+}
diff --git a/src/test/modules/spgist_name_ops/spgist_name_ops.control
b/src/test/modules/spgist_name_ops/spgist_name_ops.control
new file mode 100644
index 0000000000..f02df7ef85
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/spgist_name_ops.control
@@ -0,0 +1,4 @@
+comment = 'Test opclass for SP-GiST'
+default_version = '1.0'
+module_pathname = '$libdir/spgist_name_ops'
+relocatable = true
diff --git a/src/test/modules/spgist_name_ops/sql/spgist_name_ops.sql
b/src/test/modules/spgist_name_ops/sql/spgist_name_ops.sql
new file mode 100644
index 0000000000..0493ac3553
--- /dev/null
+++ b/src/test/modules/spgist_name_ops/sql/spgist_name_ops.sql
@@ -0,0 +1,18 @@
+create extension spgist_name_ops;
+
+select opcname, amvalidate(opc.oid)
+from pg_opclass opc join pg_am am on am.oid = opcmethod
+where amname = 'spgist' and opcname = 'name_ops';
+
+create table t(f1 name);
+create index on t using spgist(f1);
+insert into t select proname from pg_proc;
+vacuum analyze t;
+
+explain (costs off)
+select * from t
+  where f1 > 'binary_upgrade_set_n' and f1 < 'binary_upgrade_set_p'
+  order by 1;
+select * from t
+  where f1 > 'binary_upgrade_set_n' and f1 < 'binary_upgrade_set_p'
+  order by 1;

Re: SP-GiST confusion: indexed column's type vs. index column type

От

Tom Lane

Дата:

04 апреля 2021 г., 18:40:01

I wrote:
> I propose changing things so that
> (B) We enforce that leafType agrees with the opclass opckeytype,
> ensuring the index tupdesc can be used for leaf tuples.

After looking at PostGIS I realized that a hard restriction of this
sort won't fly, because it'd make upgrades impossible for them.
They have some lossy SPGiST opclasses, in which leafType is returned
as different from the original input datatype.  Since up to now
we've disallowed the STORAGE clause for user-defined SPGiST
opclasses, they are unable to declare these opclasses honestly in
existing releases, but it didn't matter.  If we enforce that STORAGE
matches leafType then their existing opclass definitions won't work
in v14, but they can't fix them before upgrading either.

So I backed off the complaint about that to be just an amvalidate
warning, and pushed it.

This means the INCLUDE patch will still have to account for the
possibility that the index tupdesc is an inaccurate representation
of the actual leaf tuple contents, but we can treat that case less
efficiently without feeling bad about it.  So we should be able to
do something similar for the leaf tupdesc as for the index-only-scan
output tupdesc, that is use the relcache's tupdesc if it's got the
right first column type, otherwise copy-and-modify that tupdesc.

            regards, tom lane

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Обсуждение: SP-GiST confusion: indexed column's type vs. index column type

SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type

Re: SP-GiST confusion: indexed column's type vs. index column type