[ksh93-integration-discuss] Re: Some comments/questions about ksh93_20060831...

Glenn Fowler gsf at research.att.com
Fri Sep 8 20:27:57 PDT 2006


On Fri, 08 Sep 2006 22:50:38 +0200 Roland Mainz wrote:
> Glenn Fowler wrote:
> [CC:'ing Werner Fink (to take a look at the GNU/Linux issue here) and
> Ienup Sung (because it's i18n-related)]
> > On Sun, 3 Sep 2006 01:34:30 +0200 Roland Mainz wrote:
> > > I have a few questions/comments about ksh93_2006083:
> > > 1. On SuSE Linux x86 all multibyte characters, even german "öäü" are
> > > broken in "gmacs" and "emacs" editing ... ;-((
> > > (the code has been compiled with % (LANG=C LC_ALL=C CC="gcc
> > > -I${PWD}/arch/linux.i386/src/lib/libast" CCFLAGS="-fPIC
> > > -fno-strict-aliasing -g -pipe -Wall -Wno-missing-braces
> > > -Wno-unknown-pragmas -Wno-parentheses -Wno-uninitialized
> > > -D_map_libc=1" ./bin/package make) 2>&1 | tee -a buildlog.log # ... I
> > > hope the "-I${PWD}/arch/linux.i386/src/lib/libast"-hack isn't causing
> > > this problem).

> Ok... thanks! :-)
> ... but note that I hit the problem with LANG=en_US.UTF-8 and
> LC_ALL=en_US.UTF-8 - that's a multibyte locale while "de" is usually a
> link to "de_DE.ISO8859-1" or "de_DE.ISO8859-15" (which are both
> single-byte locales in both Solaris and Linux) ... ;-(

this is where I have a disconnect with utf8 vs 8bit vs 7bit locales
you set LC_ALL to utf8 for a ksh
you mouse snarf the 8bit german chars above and paste into ksh edit mode
how does the edit sw know that the snarfed text is 3 8bit chars
and not some utf8 encoding?

-- Glenn Fowler -- AT&T Research, Florham Park NJ --




More information about the ksh93-integration-discuss mailing list