Subject | Unicode_FSS internals (was Re: Unique Index with Unicode_FSS) |
---|---|
Author | peter_jacobi.rm |
Post date | 2003-07-08T21:03Z |
Hi Helen, All,
--- In firebird-support@yahoogroups.com, Helen Borrie wrote:
> 3 bytes per char
This reminds me to ask about the dirty details
of UNICODE_FSS.
Am I right in assuming, that this is UTF-8?
But, for rechnical reasons, Firebird will always
allocate everything (indices, wire data, storage, ...)
as if each character alwyas takes three bytes, in contrast
to UTF-8 as designed, which uses a variable number of
bytes.
So all advantages of UTF-8 are gone?
What about a character set UTF-16BE instead?
As I understand there are internal reasons for not
supporting 'wide character' character representations -
but you can define UTF-16BE as a rather strange MB character
set, which happens to use always 2 bytes per character.
Regards,
Peter Jacobi