Re: speed up verifying UTF-8

Поиск
Список
Период
Сортировка
От Heikki Linnakangas
Тема Re: speed up verifying UTF-8
Дата
Msg-id 8f9ceae6-9d16-bb66-9292-a82cc8f150e4@iki.fi
обсуждение исходный текст
Ответ на Re: speed up verifying UTF-8  (Heikki Linnakangas <hlinnaka@iki.fi>)
Ответы Re: speed up verifying UTF-8  (John Naylor <john.naylor@enterprisedb.com>)
Список pgsql-hackers
On 03/06/2021 22:16, Heikki Linnakangas wrote:
> On 03/06/2021 22:10, John Naylor wrote:
>> On Thu, Jun 3, 2021 at 3:08 PM Heikki Linnakangas <hlinnaka@iki.fi
>> <mailto:hlinnaka@iki.fi>> wrote:
>>   >                 x1 = half1 + UINT64CONST(0x7f7f7f7f7f7f7f7f);
>>   >                 x2 = half2 + UINT64CONST(0x7f7f7f7f7f7f7f7f);
>>   >
>>   >                 /* then check that the high bit is set in each byte. */
>>   >                 x = (x1 | x2);
>>   >                 x &= UINT64CONST(0x8080808080808080);
>>   >                 if (x != UINT64CONST(0x8080808080808080))
>>   >                         return 0;
>>
>> That seems right, I'll try that and update the patch. (Forgot to attach
>> earlier anyway)
> 
> Ugh, actually that has the same issue as before. If one of the bytes is
> in one half is zero, but not in the other half, this fail to detect it.
> Sorry for the noise..

If you replace (x1 | x2) with (x1 & x2) above, I think it's correct.

- Heikki



В списке pgsql-hackers по дате отправления:

Предыдущее
От: David Christensen
Дата:
Сообщение: Re: [PATCH] expand the units that pg_size_pretty supports on output
Следующее
От: Pavel Stehule
Дата:
Сообщение: Re: security_definer_search_path GUC