Subject | Re: Firebird and unicode |
---|---|
Author | hay77772000 |
Post date | 2003-11-20T21:37:06Z |
Hi Peter,
I have finally narrowed down what our immediate requirements - to
support simplified Chinese, and Korean.
I've been doing a lot of digging, but getting pretty confused! Can
Firebird handle this? Is there any problem interacting with
Firebird through Java (doesn't Java support UCS-2?)?
Many thanks for any light you can shed!!
David
--- In firebird-support@yahoogroups.com, "Peter Jacobi"
<peter_jacobi@g...> wrote:
> our UTF-8 is strictly speaking CESUErrr...sorry to be dumb, but what is CESU?
I have finally narrowed down what our immediate requirements - to
support simplified Chinese, and Korean.
I've been doing a lot of digging, but getting pretty confused! Can
Firebird handle this? Is there any problem interacting with
Firebird through Java (doesn't Java support UCS-2?)?
Many thanks for any light you can shed!!
David
--- In firebird-support@yahoogroups.com, "Peter Jacobi"
<peter_jacobi@g...> wrote:
> Josef Gschwendtner wrote:problematic" mean?
> > What does the statement "use of non BMP characters is
> > In the Unicode-glossary I read "BMP-character: A Unicode encodedcharacter
> > having a BMP code point."The first
> >
> > What kind of characters are that?
> > What do I have think of as a developer?
>
> Unicode has now settled to include about 17 * 65536 characters.
> 65536 characters are the BMP, and were considered the ones,practically
> thinking people have to care about, with the other spacesavailable for
> Klingon, Hieroglyphs and Linear B.Chinese
>
> In the meantime I've learned, that vast amounts of new ideographic
> characters, allocated outside the BMP, are in fact used by newer
> Chinese stanards (like GB18030 if I got the number right) and by
> Government Order new software not supporting these characters mustnot
> be sold in China. This forced Microsoft to include support for themstill
> in XP.
>
> So, you see, if you are not targetting the Chinese market, you may
> ignore them.imply
>
> Firebird sort-of-support for them is by surrogate pairs, which
> that the character counts start to get wrong once you use them andthat
> our UTF-8 is strictly speaking CESU.
>
> Regards,
> Peter Jacobi
>
> --
> NEU FÜR ALLE - GMX MediaCenter - für Fotos, Musik, Dateien...
> Fotoalbum, File Sharing, MMS, Multimedia-Gruß, GMX FotoService
>
> Jetzt kostenlos anmelden unter http://www.gmx.net
>
> +++ GMX - die erste Adresse für Mail, Message, More! +++