The following bug has been logged on the website:
Bug reference: 10589
Logged by: Sörös Zoltán
Email address: zsoros@gmail.com
PostgreSQL version: 9.3.4
Operating system: Linux
Description:
Hi!
The 'hungarian.stop' file (for tsearch, located in
src/backend/snowball/stopwords in the source tarball) contains the õ
('otilde' in HTML) character instead of the correct 'Å' character. (There
are 7 occuerences in this file.)
Our database uses latin2 encoding, where we use the correct 'Å' characters.
Here's an excerpt from today's log:
< 2014-06-10 08:49:24.416 CEST >ERROR: character with byte sequence 0xc3
0xb5 in encoding "UTF8" has no equivalent in encoding "LATIN2"
< 2014-06-10 08:49:24.416 CEST >CONTEXT: line 58 of configuration file
"/usr/pgsql-9.3/share/tsearch_data/hungarian.stop"
After I replaced the tilde-capped letters in hungarian.stop file, the
problem vanished, and tsearch works fine.
I'm sorry, I can't give you the utf8 byte sequence for 'Å', but I can send
the corrected hungarian.stop file if needed.
Please fix this file in the next release.
Thanks in advance,
Zoltán Sörös