Open Source Content Management Framework

UTF-8 in Midgard

The default character encoding in Midgard is [UTF-8 ][1], enabling usage of non-western characters, through the "utf-8" parser.

To use UTF-8, set the MidgardParser directive in your midgard-data.conf:

MidgardParser utf-8

To migrate an old, latin-1 encoded Midgard database to UTF-8, do the following:

  1. Dump database to repligard format
  2. Change database encoding setting in repligard config to UTF-8
  3. Import database, forcing override of all data

Make sure Your Apache configuration file doesn't send default charset set to iso-8859-1.
It is default Apache configuration described with AddDefaultCharset directive.
If You want to set default charset utf-8 for VirtualHosts use directive:

AddDefaultCharset utf-8

In old Midgard versions, the UTF-8 parser was known as the Russian parser because it was initially developed in 1999 for Russian language support.

Designed by Nemein, hosted by Anykey