Обсуждение: Text search, ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a
I downloaded the hunspell en_GB from
http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29
and when building the Ispell dictionary I got the following error
ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a
HINT: This error can also happen if the byte sequence does not match the encoding expected by the server, which is controlled by "client_encoding".
CONTEXT: line 220 of configuration file "C:/Program Files/PostgreSQL/8.3/share/tsearch_data/en_gb.dict"
CREATE TEXT SEARCH DICTIONARY english_ispell (
TEMPLATE = ispell,
DictFile = en_GB,
AffFile = en_GB,
StopWords = english
);
http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29
and when building the Ispell dictionary I got the following error
ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a
HINT: This error can also happen if the byte sequence does not match the encoding expected by the server, which is controlled by "client_encoding".
CONTEXT: line 220 of configuration file "C:/Program Files/PostgreSQL/8.3/share/tsearch_data/en_gb.dict"
CREATE TEXT SEARCH DICTIONARY english_ispell (
TEMPLATE = ispell,
DictFile = en_GB,
AffFile = en_GB,
StopWords = english
);
James Dooley <jamdooley@gmail.com> writes: > and when building the Ispell dictionary I got the following error > ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a What PG version? 8.3.x before 8.3.4 had some problems in this area. regards, tom lane
It's postgresql-8.3.5-2 (windows)
On Tue, Feb 3, 2009 at 4:37 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
James Dooley <jamdooley@gmail.com> writes:What PG version? 8.3.x before 8.3.4 had some problems in this area.
> and when building the Ispell dictionary I got the following error
> ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a
regards, tom lane
James, you forgot to convert files to UTF8. iconv -f ISO8859-1 -t utf8 en_GB.dic > en_gb.dict iconv -f ISO8859-1 -t utf8 en_GB.aff > en_gb.affix Oleg On Tue, 3 Feb 2009, James Dooley wrote: > I downloaded the hunspell en_GB from > > http://wiki.services.openoffice.org/wiki/Dictionaries#English_.28AU.2CCA.2CGB.2CNZ.2CUS.2CZA.29 > > and when building the Ispell dictionary I got the following error > > ERROR: invalid byte sequence for encoding "UTF8": 0xe9640a > HINT: This error can also happen if the byte sequence does not match the > encoding expected by the server, which is controlled by "client_encoding". > CONTEXT: line 220 of configuration file "C:/Program > Files/PostgreSQL/8.3/share/tsearch_data/en_gb.dict" > > CREATE TEXT SEARCH DICTIONARY english_ispell ( > TEMPLATE = ispell, > DictFile = en_GB, > AffFile = en_GB, > StopWords = english > ); > Regards, Oleg _____________________________________________________________ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83