相关疑难解决方法(0)

将str_word_count用于UTF8文本

我有这个文字:

$text  = "Ba?ka, küskün otomobil kaçt? buraya küskün otomobil neden kaçt?
          kaçt? buraya, oraya KISMEN @here #there J.J.Johanson hep.
          Danny:Where is mom? I don't know! Café est weiß for 2 €uros.
          My 2nd nickname is mike18.";
Run Code Online (Sandbox Code Playgroud)

最近我用这个.

$a1= array_count_values(str_word_count($text, 1, 'ÇçÖö???I???Üü@#é߀1234567890'));
arsort($a1);
Run Code Online (Sandbox Code Playgroud)

您可以查看这个小提琴:http:
//ideone.com/oVUGYa

但是这个解决方案并不能解决所有UTF8问题.我不能将整个UTF8集写入str_word_count作为参数.

所以我创造了这个:

$wordsArray = explode(" ",$text);
foreach ($wordsArray as $k => $w) {
    $wordsArray[$k] = str_replace(array(",","."),"",$w);
}
$wordsArray2 = array_count_values($wordsArray);
arsort($wordsArray2);
Run Code Online (Sandbox Code Playgroud)

输出应该是这样的:

Array (
 [kaçt?] => 3
 [küskün] => 2
 [buraya] …
Run Code Online (Sandbox Code Playgroud)

php utf-8 text-manipulation

12
推荐指数
2
解决办法
3572
查看次数

标签 统计

php ×1

text-manipulation ×1

utf-8 ×1