我正在尝试从csv文件中读取数据,并将字符编码指定为UTF-8。通过阅读?read.csv()指令,似乎将fileEncoding设置为UTF-8即可完成此操作,但是,在检查时我看不到这一点。导入数据时,是否有更好的方法将字符串编码指定为UTF-8?
样本数据:
fruit<- read.csv("fruit.csv", header = TRUE, fileEncoding = "UTF-8")
fruit[] <- lapply(fruit, as.character)
Encoding(fruit$Fruit)
Run Code Online (Sandbox Code Playgroud)
输出是“未知”,但我希望它是“ UTF-8”。确保所有导入的字符均为UTF-8的最佳方法是什么?谢谢。
fruit <- read.csv("fruit.csv", header = TRUE)
fruit[] <- lapply(fruit, as.character)
fruit$Fruit <- paste0(fruit$Fruit, "\xfcmlaut") # Get non-ASCII char and jam it in!
Encoding(fruit$Fruit)
Run Code Online (Sandbox Code Playgroud)
[1]“ latin1”“ latin1”“ latin1”
fruit$Fruit <- enc2utf8(fruit$Fruit)
Encoding(fruit$Fruit)
Run Code Online (Sandbox Code Playgroud)
[1]“ UTF-8”“ UTF-8”“ UTF-8”