Re: DatabaseMetaData.getExtraNameCharacters

Поиск

Список

Период

Сортировка

От	Kris Jurka
Тема	Re: DatabaseMetaData.getExtraNameCharacters
Дата	25 мая 2005 г. 17:00:08
Msg-id	Pine.BSO.4.56.0505251452050.30233@leary.csoft.net обсуждение исходный текст
Ответ на	Re: DatabaseMetaData.getExtraNameCharacters (Giuseppe Sacco <giuseppe@eppesuigoccas.homedns.org>)
Ответы	Re: DatabaseMetaData.getExtraNameCharacters
Список	pgsql-jdbc

Дерево обсуждения

On Wed, 25 May 2005, Giuseppe Sacco wrote:

> Il giorno mer, 25-05-2005 alle 13:25 -0500, Kris Jurka ha scritto:
> [...]
> > ident_start             [A-Za-z\200-\377_]
> > ident_cont              [A-Za-z\200-\377_0-9\$]
> > identifier              {ident_start}{ident_cont}*
> >
> > So \200-\377 is octal for any character with the high bit set.  The list
> > of what characters this could map to numbers in the tens of thousands for
> > unicode, so it's not really feasibly to return in this method.
>
> if I understand correclty, the valid charset is the one computed by the
> attached class. It seems to me that this is 191 characters long.
> Could you please let me know where I am wrong?

You are assuming that each character is only one byte.  The backend lexing
rules are a byte by byte operation, but the JDBC side is returning a
String of characters.  Consider the character "Latin Small Letter s with
Acute" (\u015B) gets encoded in UTF-8 as C5 9B or \305\233 in octal.  This
is one character in the result of getExtraNameCharacters.

Kris Jurka

В списке pgsql-jdbc по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: DatabaseMetaData.getExtraNameCharacters