Posted on: 2009/4/24 14:28
Re: utf-8 and portuguese
The encoding is a big problem for legacy sites in European languages (eg French, German, ...), which uses a lot of accented characters (128 - 255 in ascii). At the time being users with such sites are adviced to leave their site and database in the original encoding. Conversion is not straight forward, because the many ISO variants used. And on top of that, the database tables are often also cluttered by windows character sets.
For the language files, there is a need to have two encodings for the same language. The conversion (from ansi to utf without bom) can easily be done with eg Notepad++. This is needed for all language definitions and templates.
There is no need to rename the language directories itself as Trabis suggests, but the archives should contain two extra directories in the root: iso and utf, which will then contain the normal directory tree with all the language files.
A partial example for the mrbs module and french could be:
The module itself contains by preference default the utf language files. When the other encoding is needed, a simple copy or upload can easily be used to overwrite all relevant files.