Instructions for adding a new Unicode collation
By Alexander Barkov and Peter Gulutzan Copyright (c) 2007 by MySQL AB All rights reserved
[ Previously published as a note on a bug report.]
MySQL offers 42 built-in collations (sets of rules for comparison and sorting) for the Unicode character sets utf8 and ucs2. But you can add your own. We will describe here how to add Vietnamese Unicode collations. This is intended as an example, so that you can try out all the steps and then modify based on your own rules for some other language. We chose Vietnamse because many people asked for it via our "Feature Request" option in bugs.mysql.com, http://bugs.mysql.com/bug.php?id=4745. There are two ways to add a new collation: * Change the source code. This is the trickier way, but it's the only way for pre-August-2007 versions. * Change the LDML "markup" file. This hasn't been tested much, but it's simpler, it requires no recompilation. (LDML stands for …[Read more]