Subject Re: [Firebird-Architect] UTF-8 and Compression
Author Dimitry Sibiryakov
On 1 Mar 2005 at 8:22, Pavel Cisar wrote:

>Pardon my ignorance, but doesn't the decoupling of the character set
>from collation cause problems? I mean that collation would handle only
>a subset of UTF-8 for particular charset.

If we have a universal character set we can also have a universal
collation and special collations for some languages. I mean a base
collation class that handle everything (including Czech and Chinese)
and some derived classes for languages with special characters
ordering/processing.

>would break that, wouldn't be possible to store for example Czech and
>Chinese characters in field and then ask for Czech collation which
>wouldn't handle Chinese with odd results? Well, Czech&Chinese example
>is a little bit stretched, but you see the point.

Because Czech and Chinese characters don't cross they can be
handled independently. Do you see a problem if Czech and Chinese
strings are sorted properly but all Chinese is placed after all
Czech?
This case is not different from current situation when russian (and
I quess Czech) characters are sorted after all latin.
--
SY, Dimitry Sibiryakov.