Re: scoring differences between bitmasks

Поиск

Список

Период

Сортировка

От	Todd A. Cook
Тема	Re: scoring differences between bitmasks
Дата	2 октября 2005 г. 21:02:46
Msg-id	43402A95.5090701@blackducksoftware.com обсуждение исходный текст
Ответ на	scoring differences between bitmasks (Ben <bench@silentmedia.com>)
Список	pgsql-general

Дерево обсуждения

Hi,

It may be that I don't understand your problem. :)

Are you searching the table for the closest vector?  If so, is
"closeness" defined only as the number of bits that are different?
Or, do you need to know which bits as well?

-- todd


Ben wrote:
> Hrm, I don't understand. Can you give me an example with some
> reasonably sized vectors?
>
> On Oct 2, 2005, at 10:59 AM, Todd A. Cook wrote:
>
>> Hi,
>>
>> Try breaking the vector into 4 bigint columns and building a multi-
>> column
>> index, with index columns going from the most evenly distributed to  the
>> least.  Depending on the distribution of your data, you may only  need 2
>> or 3 columns in the index.  If you can cluster the table in that  order,
>> it should be really fast.  (This structure is a tabular form of a  linked
>> trie.)
>>
>> -- todd
>>
>>
>> Ben wrote:
>>
>>> Yes, that's the straightforward way to do it. But given that my
>>> vectors are 256 bits in length, and that I'm going to eventually
>>> have  about 4 million of them to search through, I was hoping
>>> greater minds  than mine had figured out how to do it faster, or  how
>>> compute some  kind of indexing....... somehow.

В списке pgsql-general по дате отправления:

Предыдущее

От: Ben
Дата: 02 октября 2005 г., 20:57:08
Сообщение: Re: scoring differences between bitmasks

Следующее

От: Martijn van Oosterhout
Дата: 02 октября 2005 г., 21:14:51
Сообщение: Re: 8.1 'make check' fails

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: scoring differences between bitmasks

Предыдущее

Следующее