[ksh93-integration-discuss] [i18n-discuss] ksh93 "printf" builtin vs. CR #6558816("printfvariants behaving incorrectly for multibyte decimal point") ...
Roland Mainz
roland.mainz at nrubsig.org
Wed Jan 16 19:29:22 PST 2008
Ienup Sung wrote:
> Roland Mainz wrote at 01/16/08 17:09:
> > Who is the locale owner ?
>
> That person left. Now the locales have been inherited by
> European Globalization Center.
There is no specific engineer assigned to it, right ?
> >> experiencing problems mentioned in the noted
> >> CR 6558816 and other bugs
> >
> > What are the CR #-numbers of the other bugs ?
>
> If you go to http://bugs.opensolaris.org/ , type in 6538608, and hit search,
> there will be three related bugs showing up. The Related Bugs field of
> 6558816 also has two of them. I was talking about them.
Ok... thanks! :-)
> > Slightly offtopic: It seems that CDE's dtterm default font in the
> > en_US.UTF-8 locale doesn't have a glyph available for U+066C ... ;-(
>
> Please file a bug. Either we might not have bought the glyph from Monotype
> or that might have been missed or incorrectly mapped in the ttmap file for
> the Monotype fonts and in that case, the bug filing might be the start of
> fixing that problem. If we didn't buy the glyph, then, Sun would have to
> buy the glyph from Monotype to have the support of that particular glyph.
Done, see CR #6651490 ('dtterm/X11 can't display U+066C ("ARABIC
THOUSANDS SEPARATOR")').
> >> To me, probably the best compromise might have been having ASCII period for
> >> decimal_point and an empty string for thousands_sep.
> >
> > What about extending this compromise (refining my proposal from
> > http://mail.opensolaris.org/pipermail/ksh93-integration-discuss/2008-January/005846.html
> > a bit) a bit (AFAIK the idea of multibyte charcters for { decimal_point,
> > thousands_se, etc } sounds intesting but I can also see the issues that
> > non-multibyte aware applications won't like this)):
> > 1) Create "ar_SA.UTF-8 at ascii_numeric" (uses ASCII characters for
> > |decimal_point| and |thousands_sep|)
> > 2) Create "ar_SA.UTF-8 at arabic_numeric" (uses multibyte characters (to
> > represent these characters as arabic (multibyte) characters) for
> > |decimal_point| and |thousands_sep|)
> > 3) Make "ar_SA.UTF-8" an alias to "ar_SA.UTF-8 at ascii_numeric" for now
>
> IMO, it might be just too much to have variants; I think that sticking with
> a single byte definition for those locale data might be just good enough
> considering the current status of the standard and implementation.
Erm... technically it would be only one variant (=
"ar_SA.UTF-8 at arabic_numeric"), the other stuff (=
"ar_SA.UTF-8 at ascii_numeric" and "ar_SA.UTF-8") is to define a way to
explicily specify the behaviour ("ar_SA.UTF-8 at ascii_numeric") and an
alias which defines the default ("ar_SA.UTF-8", which either maps to
"ar_SA.UTF-8 at ascii_numeric" or "ar_SA.UTF-8 at arabic_numeric").
The issue is that without something like "ar_SA.UTF-8 at arabic_numeric" we
can't really start making any fixes/adjustments... that's why I've
proposed the "ar_SA.UTF-8 at arabic_numeric"-idea above (otherwise we have
a chicken&&egg-like problem).
> (There are quite well known areas of improvement in the POSIX standard in
> terms of I18N support by I18N experts if you compare the support to other
> competing platforms and software components and let me just put it that way
> for now.)
Do you have a list somewhere ?
> If you're willing to contribute the locales as you proposed, I think that
> g11n-ar-discuss and i18n-discuss can be used for further discussion on
> the contribution.
Erm... AFAIK the locale data aren't opensource, right (the contribution
itself should be easy, AFAIK only some extra entries in the build
system+alias list are neccesary since the remaining parts of the "new"
locale are based on the current "ar_SA.UTF-8") ?
----
Bye,
Roland
--
__ . . __
(o.\ \/ /.o) roland.mainz at nrubsig.org
\__\/\/__/ MPEG specialist, C&&JAVA&&Sun&&Unix programmer
/O /==\ O\ TEL +49 641 7950090
(;O/ \/ \O;)
More information about the ksh93-integration-discuss
mailing list