Subject | Re: [Firebird-Architect] Re: UTF-8 vs UTF-16 |
---|---|
Author | Dimitry Sibiryakov |
Post date | 2003-08-27T05:43:53Z |
On 26 Aug 2003 at 12:44, mailmur wrote:
character (accented or not - doesn't matter) with the same binary
representation in UNICODE (can't say better, sorry) depend on
language where it is used?
For example: nobody cares if "A with two dots" (0xXXXX) comes after
"Z" or "A" but if it comes after "Z" in Finland and after "A" in
French - that's a problem.
If it is true (David said - yes and I believe him), the concept of
collation as an attribute of column must be saved even in UNICODE-
only engine. This adds extra complexity and as a result - slowness.
SY, Dimitry Sibiryakov.
>In Finland: "A" and "O" is sorted as usual. But then same letters withActually my question was a bit different: does sorting order of a
>two dots on top, "Д" and "Ц", is sorted to the end. (a with ring is a
>swedish-O) ....,X,Y,Z,Е,Д,Ц
>
>So, probably even some accent chars are not equal to regular
>counterparts in some language.
character (accented or not - doesn't matter) with the same binary
representation in UNICODE (can't say better, sorry) depend on
language where it is used?
For example: nobody cares if "A with two dots" (0xXXXX) comes after
"Z" or "A" but if it comes after "Z" in Finland and after "A" in
French - that's a problem.
If it is true (David said - yes and I believe him), the concept of
collation as an attribute of column must be saved even in UNICODE-
only engine. This adds extra complexity and as a result - slowness.
SY, Dimitry Sibiryakov.