Magnus Hagander wrote:
> > Here's what I see on our FreeBSD server:
> >
> > $ iconv -l|grep -i 2312
> > CHINESE GB_2312-80 ISO-IR-58 CSISO58GB231280
> > CN-GB EUC-CN EUCCN GB2312 CSGB2312
> > HZ HZ-GB-2312
> >
> > I see "GB2312". What does the "-80" mean in "gb_2312-80"?
> > Can we just remove the "-80"?
>
> Hmm. Weird. Mine also shows gb_2312-80 :-) I guess I jumped the gun a
> bit.
>
> Ok. Checking further, it seems the file contains characters that are not
> gb_2312-80. Try a simple:
> iconv -f gb_2312-80 -t utf-8 FAQ_chinese.html > /dev/null
>
> and it'll show you an error.
>
>
> Whereas:
> iconv -f euc-jp -t utf-8 FAQ_japanese.html > /dev/null
>
> works just fine.
OK, here's where we get stuck. I CC'ed the original two Chinese posters
so let's hope they can address it.
--
Bruce Momjian | http://candle.pha.pa.us
pgman@candle.pha.pa.us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073