我有一个HTML文件,我使用curl下载并插入到字符串中.HTML文件有很多内容,但我希望解析文档的某个部分并将此部分插入到数组中.这个棘手的部分是,我试图解析部分没有HTML,它是在JavaScript代码块:
<!-- script block -->
<script type="text/javascript" src="//external.site.com/76b07.js"></script>
<script>....code.....
"235533":{"itemId":"235533","type":"0","image":{"url":"thispic.jpg"}:"summary":"This Item"},
"235534":{"itemId":"235534","type":"1","image":{"url":"thisotherpic.jpg"}:"summary":"This Other Item"},
</script>
Run Code Online (Sandbox Code Playgroud)
如何将项目信息作为数组导入?:
$array = array( "itemId" => "235533", "type" => "0", "image" => "thispic.jpg", "summary" =>"This Item" );
Run Code Online (Sandbox Code Playgroud)
您可以使用RegExp匹配标记"....":{....}
之间的位置<script>
.您感兴趣的字符串是JSON变量.
一旦你在字符串中有每个json变量,你可以试试json_decode()
$json_string = '"235533":{"itemId":"235533","type":"0","image":{"url":"thispic.jpg"}:"summary":"This Item"}';
$json = json_decode($json_string);
$myArray = (array)$json;
Run Code Online (Sandbox Code Playgroud)