encoding question

Поиск
Список
Период
Сортировка
От Ben K.
Тема encoding question
Дата
Msg-id Pine.GSO.4.64.0603201643250.21796@coe.tamu.edu
обсуждение исходный текст
Ответы Re: encoding question  (Ivo Rossacher <rossacher@bluewin.ch>)
Re: encoding question  (Tom Lane <tgl@sss.pgh.pa.us>)
Список pgsql-admin
Hi,

My main server is on Solaris (LANG=C), postgresql 8.1.0, with encoding
SQL_ASCII.

I'm trying to set up a backup server on linux (LANG=en_US.UTF-8), with
encoding UTF8.

I thought UTF8 was safe if all I have is ascii, since there was a relevant
discussion in this list a short while ago, but I'm getting this error, for
a few records (out of tens of thousands in that database).

===============
ERROR:  invalid UTF-8 byte sequence detected near byte 0x85
CONTEXT:  COPY tblcoursesinfo, line 30, column title: "Seminar in<85>
..."
===============

The dump was created by 8.1.0's pg_dump with no options.

That line in the dump has the following contents.

Text:

n<85>       Knowledg

Binary:

02e2100: 6e85 2020 2020 2020 2020 2020 2020 2020  n.
02e2110: 2020 2020 2020 2020 2020 2020 2020 2020
02e2120: 2020 2020 2020 2020 2020 2020 2020 2020
02e2130: 2020 2020 2020 2020 2020 2020 2020 2020
02e2140: 2020 2020 2020 2020 2020 2020 2020 2020
02e2150: 2020 2020 2020 2020 2020 2020 2020 2020
02e2160: 2020 2020 2020 2020 2020 2020 2020 2020
02e2170: 2020 2020 2020 2009 4b6e 6f77 6c65 6467         .Knowledg

Where could this be coming from? Would appreciate help.


Regards,

Ben K.
Developer
http://benix.tamu.edu

В списке pgsql-admin по дате отправления:

Предыдущее
От: Ivo Rossacher
Дата:
Сообщение: Re: Character encoding problems and dump import
Следующее
От: Ivo Rossacher
Дата:
Сообщение: Re: encoding question