Hungarian Károli Bible - double acute accented vowels - encoding issue
The Hungarian Károli Bible has four incorrectly encoded characters for what should be the double acute accented vowels. These are the "ő" and "ű" characters, and capitals "Ő" and "Ű". They require correcting by the following replacements:
õ -> ő
û -> ű
Õ -> Ő
Û -> Ű
Using TextPipe, I have already done the necessary search and replace operation in my own downloaded copy of hu.xml
The character counts were (respectively):
32206
3750
507
7
The same electronic text was used as found in the Unbound Bible project of Biola University. The ThML file at CCEL was derived from this source. It is most likely that the vast majority of electronic copies of the Károli Bible on the worldwide web have the same problem.
The incorrect encodings probably arose from the use of a legacy font, in which the double acute accented vowels were represented by ANSII codes that are usually reserved for Latin characters not required by Hungarian. This must all date back to before the file conversion to Unicode.
For further background reading, please see
http://en.wikipedia.org/wiki/Hungarian_alphabet
http://en.wikipedia.org/wiki/Double_acute_accent
David Haslam
