Re: BUG #15548: Unaccent does not remove combining diacritical characters

Поиск
Список
Период
Сортировка
От raam narayana
Тема Re: BUG #15548: Unaccent does not remove combining diacritical characters
Дата
Msg-id 154982918542.11785.1374991294537224097.pgcf@coridan.postgresql.org
обсуждение исходный текст
Ответы Re: BUG #15548: Unaccent does not remove combining diacritical characters  (Thomas Munro <thomas.munro@enterprisedb.com>)
Re: BUG #15548: Unaccent does not remove combining diacritical characters  (Hugh Ranalli <hugh@whtc.ca>)
Список pgsql-hackers
Hi,

After the latest commit in master branch, I was trying to test the python script. Ironically I still see that the
outputfrom the script is completely different from the unaccent.rules file content. Am I missing anything.My testing
includesthe following
 

Downloaded the following files

http://unicode.org/Public/8.0.0/ucd/UnicodeData.txt
 
http://unicode.org/cldr/trac/export/14746/tags/release-34/common/transforms/Latin-ASCII.xml

Executed the below python script

python generate_unaccent_rules.py --unicode-data-file UnicodeData.txt --latin-ascii-file  Latin-ASCII.xml >
unaccent.rules
 

I am using python 3.7.1 and running on Windows 10 Platform

The new status of this patch is: Needs review

В списке pgsql-hackers по дате отправления:

Предыдущее
От: Peter Geoghegan
Дата:
Сообщение: Re: Fixing findDependentObjects()'s dependency on scan order(regressions in DROP diagnostic messages)
Следующее
От: Thomas Munro
Дата:
Сообщение: Re: BUG #15548: Unaccent does not remove combining diacritical characters