Re: Designing an extension for feature-space similarity search

Поиск

Список

Период

Сортировка

От	Jay Levitt
Тема	Re: Designing an extension for feature-space similarity search
Дата	17 февраля 2012 г. 18:00:46
Msg-id	4F3EA3CD.1070103@gmail.com обсуждение исходный текст
Ответ на	Re: Designing an extension for feature-space similarity search (Tom Lane <tgl@sss.pgh.pa.us>)
Ответы	Re: Designing an extension for feature-space similarity search (Alexander Korotkov <aekorotkov@gmail.com>)
Список	pgsql-hackers

Дерево обсуждения

Tom Lane wrote:
> Jay Levitt<jay.levitt@gmail.com>  writes:
>> - Does KNN-GiST run into problems when<->  returns values that don't "make
>> sense" in the physical world?
>
> If the indexed entities are records, it would be
> entirely your own business how you handled individual fields being NULL.

This turns out to be a bit challenging. Let's say I'm building a 
nullable_point type that allows the Y axis to be NULL (or any sentinel value 
for "missing data"), where the semantics are "NULL is infinitely far from 
the query".   I'll need my GiST functions to return useful results with NULL 
- not just correct results, but results that help partition the tree nicely.

At first I thought this posed a challenge for union; if I have these points:

(1,2)
(2,1)
(1,NULL)

what's the union? I think the answer is to treat NULL box coordinates like 
LL = -infinity, UR = infinity, or (equivalently, I think) to store a 
saw_nulls bit in addition to LL and UR.

The real challenge is probably in picksplit and penalty - where in the tree 
should I stick (1,NULL)? - at which point you say "Yes, algorithms for 
efficient indexes are hard work and computer-science-y" and point me at 
surrogate splitters.

Just thinking out loud, I guess; if other GiST types have addressed this 
problem, I'd love to hear about it.

Jay

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Jeff MacDonald
Дата: 17 февраля 2012 г., 17:32:12
Сообщение: Re: MySQL search query is not executing in Postgres DB

Следующее

От: Robert Haas
Дата: 17 февраля 2012 г., 18:10:13
Сообщение: Re: Simulating Clog Contention

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Designing an extension for feature-space similarity search

Предыдущее

Следующее