UTF-8 in Midgard
The default character encoding in Midgard is [UTF-8 ][1], enabling usage of non-western characters, through the "utf-8" parser.
To use UTF-8, set the MidgardParser directive in your midgard-data.conf:
MidgardParser utf-8
To migrate an old, latin-1 encoded Midgard database to UTF-8, do the following:
- Dump database to repligard format
- Change database encoding setting in repligard config to UTF-8
- Import database, forcing override of all data
Make sure Your Apache configuration file doesn't send default charset set to iso-8859-1.
It is default Apache configuration described with AddDefaultCharset directive.
If You want to set default charset utf-8 for VirtualHosts use directive:
AddDefaultCharset utf-8
In old Midgard versions, the UTF-8 parser was known as the Russian parser because it was initially developed in 1999 for Russian language support.
