Iva*_*tov 12 php whitespace space utf-8
在PHP中,获取utf8编码的所有Unicode 空白字符的完整列表(字符串数组)的最优雅方法是什么?
我需要它来生成测试数据.
dev*_*vio 12
此电子邮件包含以UTF-8,UTF-16和HTML编码的所有Unicode空白字符的列表.
编辑
最初回答于2010年2月9日(!).真的,如果信息已经过时,你可以添加自己的答案,而不是抱怨.只需谷歌搜索我的答案中提到的网址,并获得一些代表:
邮件已在这里存档(花了我几秒钟),在介绍中甚至提到了空白表
static $whitespace = array(
    "SPACE" => "\x20",
    "NO-BREAK SPACE" => "\xc2\xa0",
    "OGHAM SPACE MARK" => "\xe1\x9a\x80",
    "EN QUAD" => "\xe2\x80\x80",
    "EM QUAD" => "\xe2\x80\x81",
    "EN SPACE" => "\xe2\x80\x82",
    "EM SPACE" => "\xe2\x80\x83",
    "THREE-PER-EM SPACE" => "\xe2\x80\x84",
    "FOUR-PER-EM SPACE" => "\xe2\x80\x85",
    "SIX-PER-EM SPACE" => "\xe2\x80\x86",
    "FIGURE SPACE" => "\xe2\x80\x87",
    "PUNCTUATION SPACE" => "\xe2\x80\x88",
    "THIN SPACE" => "\xe2\x80\x89",
    "HAIR SPACE" => "\xe2\x80\x8a",
    "ZERO WIDTH SPACE" => "\xe2\x80\x8b",
    "NARROW NO-BREAK SPACE" => "\xe2\x80\xaf",
    "MEDIUM MATHEMATICAL SPACE" => "\xe2\x81\x9f",
    "IDEOGRAPHIC SPACE" => "\xe3\x80\x80",
);
多年以后,在查找unicode空白字符时,这个问题在Google上仍有最佳结果.devio的答案很棒,但不完整.在撰写本文时(2017年10月),维基百科有一个空格字符列表:https://en.wikipedia.org/wiki/Whitespace_character
此列表指定了25个代码点,而当前接受的答案列表为18.包括其他七个代码点,列表为:
U+0009  character tabulation
U+000A  line feed
U+000B  line tabulation
U+000C  form feed
U+000D  carriage return
U+0020  space
U+0085  next line
U+00A0  no-break space
U+1680  ogham space mark
U+180E  mongolian vowel separator
U+2000  en quad
U+2001  em quad
U+2002  en space
U+2003  em space
U+2004  three-per-em space
U+2005  four-per-em space
U+2006  six-per-em space
U+2007  figure space
U+2008  punctuation space
U+2009  thin space
U+200A  hair space
U+200B  zero width space
U+200C  zero width non-joiner
U+200D  zero width joiner
U+2028  line separator
U+2029  paragraph separator
U+202F  narrow no-break space
U+205F  medium mathematical space
U+2060  word joiner
U+3000  ideographic space
U+FEFF  zero width non-breaking space
| 归档时间: | 
 | 
| 查看次数: | 21566 次 | 
| 最近记录: |