Subject Re: [firebird-support] Why does 'é' (e with cute accent Unicode 0xE9
Author Adriano dos Santos Fernandes
Olivier Mascia escreveu:
> Hello,
>
> This code point '=E9' (unicode E9) uses 2 octets in UTF-8.
> Is it different with Firebird? (2.x with x >=3D1)
> Would it mean than Firebird UTF8 actually means 3-bytes per char=20=20
> UNICODE_FSS with ony the logical length taken into account?
>
>
> Using a WIN1252 connection to a DEFAULT CHARACTER SET UTF8 database:
> INSERT INTO MYTABLE(NAME) VALUES('=E9=E9=E9=E9=E9'); // 5 times letter =
> =E9
>
> Using a UTF8 connection to that database:
> SELECT * FROM MYTABLE;
> gets me 15 bytes.
> Actually 5 times the following three bytes D4C7DC in hex.
>
>
> In my book I would have get 5 times C3A9 in hex which is *the* UTF8=20=20
> representation of E9.
This is what I get in V2.1.0 and HEAD.

> So what is this UTF8 thing in Firebird parlance?
>
I think by some reason you're not sending the character encoded in WIN1252.


Adriano