我使用下面的代码将我的MySQL数据导出到.CSV文件中.一切正常,但是当我尝试导出这些字母?, š, ?, ?, ž, ý, á, í, é(捷克语字母)时,这些字母将?, ?, ?被导出为?.其他字母输出正常.
你能帮我解决这个问题吗?
<?php
/*******EDIT LINES 3-8*******/
$DB_Server = "xxx"; //MySQL Server
$DB_Username = "xxx"; //MySQL Username
$DB_Password = "xxx"; //MySQL Password
$DB_DBName = "xxx"; //MySQL Database Name
$DB_TBLName = "wp_comments"; //MySQL Table Name
$DB_Query = "comment_author, comment_content"; //MySQL Query (what to select from db, you can use * for all)
$filename = "excelfilename"; //File Name
$filename_columns = array("Autor", "Content"); //File Name of columns
/*******YOU DO NOT NEED TO EDIT ANYTHING BELOW THIS LINE*******/
//headers
header('Pragma: public');
header('Expires: 0');
header('Cache-Control: must-revalidate, post-check=0, pre-check=0');
header('Content-Description: File Transfer');
header('Content-Encoding: UTF-8');
header('Content-Type: text/csv; charset=UTF-8');
header('Content-Disposition: attachment; filename='.$filename.'.csv;');
header('Content-Transfer-Encoding: binary');
//create MySQL connection
mysql_connect($DB_Server,$DB_Username,$DB_Password);
mysql_select_db($DB_DBName);
$sql = "SELECT $DB_Query FROM $DB_TBLName";
$result = mysql_query($sql);
$fh = fopen('php://output', 'w');
$fp = fwrite($fh, $bom =( chr(0xEF) . chr(0xBB) . chr(0xBF) )); // Write UTF-8 BOM
if($fp)
{
fwrite($fh, "sep=\t" . PHP_EOL); // Hint for MS Excel
while($row = mysql_fetch_row($result)) {
fputcsv($fh, $row, "\t");
}
}
fclose($fh);
Run Code Online (Sandbox Code Playgroud)
由于您没有明确设置数据库连接的编码,因此libmysql将使用编译的默认编码(通常为latin1).在将结果集转码为该字符集时,MySQL会替换它无法表示的任何字符?.
为了避免这种情况,您应该mysql_set_charset('utf8')在打开数据库连接后进行调用- 请参阅UTF-8.
也就是说,你真的不应该使用ext/mysql:它现在已被弃用,并且手册已经包含警告,反对在新代码中使用它近三年.请考虑使用MySQLi或PDO.
最后,如果MySQL服务器与PHP在同一台机器上并且您有FILE权限,那么为什么不避免将数据完全交给PHP并简单地使用MySQL的SELECT ... INTO OUTFILE命令来生成输出文件?
//create MySQL connection
$DB_DSN = "mysql:host=$DB_Server;dbname=$DB_DBName;charset=utf8";
new PDO($DB_DSN, $DB_Username, $DB_Password)->exec("
SELECT $DB_Query
INTO OUTFILE '/tmp/$filename.tsv'
CHARACTER SET utf8
FROM $DB_TBLName
");
echo "\xef\xbb\xbf" // Write UTF-8 BOM
, "sep=\t", PHP_EOL; // Hint for MS Excel
readfile("/tmp/$filename.tsv");
Run Code Online (Sandbox Code Playgroud)
请注意,您可能需要确保并发进程未使用临时文件.
PS:当字段分隔符是逗号字符时,格式仅称为CSV("逗号分隔值"); 当使用制表符作为字段分隔符时,格式更正确地称为TSV("制表符分隔值"),并且应具有.tsv或.tab扩展名.