Re: how to ignore invalid byte sequence for encoding without using sql_ascii?

Поиск
Список
Период
Сортировка
От Martijn van Oosterhout
Тема Re: how to ignore invalid byte sequence for encoding without using sql_ascii?
Дата
Msg-id 20071002073025.GA12469@svana.org
обсуждение исходный текст
Ответ на how to ignore invalid byte sequence for encoding without using sql_ascii?  ("detrox@gmail.com" <detrox@gmail.com>)
Ответы Re: how to ignore invalid byte sequence for encoding without using sql_ascii?  ("detrox yang" <detrox@gmail.com>)
Список pgsql-general
On Thu, Sep 27, 2007 at 02:28:27AM -0700, detrox@gmail.com wrote:
> I am now importing the dump file of wikipedia into my postgresql using
> maintains/importDump.php. It fails on 'ERROR: invalid byte sequence
> for encoding UTF-8'. Is there any way to let pgsql just ignore the
> invalid characters ( i mean that drop the invalid ones ), that the
> script will keep going without die on this error.

No, postgres does not destroy data. It you want bits of your data
removed you need to write your own tool to do it.

That said, are you sure that the data you're importing is UTF-8?

Have a nice day,
--
Martijn van Oosterhout   <kleptog@svana.org>   http://svana.org/kleptog/
> From each according to his ability. To each according to his ability to litigate.

Вложения

В списке pgsql-general по дате отправления:

Предыдущее
От: "Albe Laurenz"
Дата:
Сообщение: Re: Find out encoding of data
Следующее
От: Martijn van Oosterhout
Дата:
Сообщение: Re: more problems with count(*) on large table