Обсуждение: Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a

Поиск
Список
Период
Сортировка

Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a

От
James Dooley
Дата:
I downloaded the hunspell en_GB from

http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29

and when building the Ispell dictionary I got the following error

ERROR:  invalid byte sequence for encoding "UTF8": 0xe9640a
HINT:  This error can also happen if the byte sequence does not match the encoding expected by the server, which is controlled by "client_encoding".
CONTEXT:  line 220 of configuration file "C:/Program Files/PostgreSQL/8.3/share/tsearch_data/en_gb.dict"

CREATE TEXT SEARCH DICTIONARY english_ispell (
    TEMPLATE = ispell,
    DictFile = en_GB,
    AffFile = en_GB,
    StopWords = english
);

Re: Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a

От
Tom Lane
Дата:
James Dooley <jamdooley@gmail.com> writes:
> and when building the Ispell dictionary I got the following error

> ERROR:  invalid byte sequence for encoding "UTF8": 0xe9640a

What PG version?  8.3.x before 8.3.4 had some problems in this area.

            regards, tom lane

Re: Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a

От
James Dooley
Дата:
It's postgresql-8.3.5-2 (windows)



On Tue, Feb 3, 2009 at 4:37 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
James Dooley <jamdooley@gmail.com> writes:
> and when building the Ispell dictionary I got the following error

> ERROR:  invalid byte sequence for encoding "UTF8": 0xe9640a

What PG version?  8.3.x before 8.3.4 had some problems in this area.

                       regards, tom lane

Re: Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a

От
Oleg Bartunov
Дата:
James,

you forgot to convert files to UTF8.
iconv -f ISO8859-1 -t utf8 en_GB.dic > en_gb.dict
iconv -f ISO8859-1 -t utf8 en_GB.aff > en_gb.affix

Oleg
On Tue, 3 Feb 2009, James Dooley wrote:

> I downloaded the hunspell en_GB from
>
> http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29
>
> and when building the Ispell dictionary I got the following error
>
> ERROR:  invalid byte sequence for encoding "UTF8": 0xe9640a
> HINT:  This error can also happen if the byte sequence does not match the
> encoding expected by the server, which is controlled by "client_encoding".
> CONTEXT:  line 220 of configuration file "C:/Program
> Files/PostgreSQL/8.3/share/tsearch_data/en_gb.dict"
>
> CREATE TEXT SEARCH DICTIONARY english_ispell (
>    TEMPLATE = ispell,
>    DictFile = en_GB,
>    AffFile = en_GB,
>    StopWords = english
> );
>

     Regards,
         Oleg
_____________________________________________________________
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83