在MS-Windows平台下使用UTF-8编码
Windows平台下UTF-8编码是「二等公民」——唯一对UTF-8的支持可能是代码页65001 还会导致命令行工具出现问题
但这不能阻止UTF-8成为国际标准的编码 更不能阻止我们在Windows平台上使用这一编码
对于Emacs 在配置文件里面加入下面的语句可以让文件编码默认是UTF-8 而且结尾是UNIX风格的
1 | (prefer-coding-system 'utf-8-unix) |
但这会带来问题 因为这会使Emacs运行其他程序的命令行也参数也变成UTF-8编码(比如用M-&
运行的程序) 下面的设置能纠正这一问题
1 | (set-default 'process-coding-system-alist |
Update:
关于Windows下使用UTF-8 Emacs的开发者Eli Zaretskii最近的一个值得一读
This works on the Windows XP shell (cmd.exe):
C:> chcp 65001
C:> grep λ *.txt
λfoo(grep.exe is from MSYS2 and the .txt file encoding is utf-8).
IOW, this grep.exe is a Cygwin program (MSYS2 is a fork of Cygwin),
and thus supports UTF-8 locales, like Cygwin does.However, M-x grep from Emacs displays no results (“Grep finished with no
matches found…”).Yes, because the native Windows build of Emacs always encodes the
command-line arguments of programs it invokes using the current system
ANSI codepage. And the system codepage cannot be set to 65001,
because support for that codepage in Windows is half-hearted: for
example, it cannot be used as the codeset in the arguments to
’setlocale’ functions. Which means any native Windows program will be
unable to correctly handle UTF-8 encoded characters, since the data
used by C runtime functions that compare, collate, and otherwise
process characters will not behave as you expect.Is there a method for executing shell commands encoded in utf-8 from
Emacs on Windows?Sadly, no. Not until Windows improves their support for the UTF-8
codepage, such that it becomes a first-class citizen (and then we will
have a not-so-easy job of adapting to that in Emacs). Until then, if
you want UTF-8 communications with subprocesses, use the Cygwin build
of Emacs.
这段回复有以下几个要点:
-
Cygwin程序可以在命令行使用UTF-8
-
原生的Windows程序总会把命令行参数用当前代码页编码 而系统的代码页不能设置成65001也就是不能用UTF-8编码 所以原生的Windows程序没法在命令行使用UTF-8
-
除非微软添加对UTF-8的支持 改变UTF-8是「二等公民」的现状 这种限制会一直存在
或者您可以把评论发在别处,添加指向本页的连接,然后把网址告诉我:
本文标题:在MS-Windows平台下使用UTF-8编码
文章作者:Chris
发布时间:2015-09-24
最后更新:2022-03-23
原始链接:https://chriszheng.science/2015/09/24/Prefer-UTF-8-in-MS-Windows/
版权声明:本博客所有文章除特别声明外,均采用 CC BY 4.0 许可协议。转载请注明出处!
分享
加载更多评论