干草,我似乎无法在网上找到任何正则表达式删除
<h1></h1>
Run Code Online (Sandbox Code Playgroud)
标签(及其内容).
任何人都可以帮助你.
不要使用正则表达式,使用像PHP Simple HTML DOM这样的工具.
// Construct dom from string
$dom = str_get_html($html);
// ...or construct dom from file/url
$dom = file_get_html($path);
// strip h1 tags (and their content)
foreach ($dom->find('h1') as $node) {
$node->outertext = '';
}
Run Code Online (Sandbox Code Playgroud)
preg_replace('@<h1[^>]*?>.*?<\/h1>@si', '', $htmlsource);
Run Code Online (Sandbox Code Playgroud)
您还可以使用PHP的DOM扩展模块:
$domDocument = new DOMDocument;
$domDocument->loadHTMLFile('http://example.com');
$domNodeList = $domDocument->getElementsByTagname('h1');
$domElemsToRemove = array();
foreach ($domNodeList as $domElement) {
$domElemsToRemove[] = $domElement;
}
foreach($domElemsToRemove as $domElement) {
$domElement->parentNode->removeChild($domElement);
}
var_dump($domDocument->saveHTML());
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
6910 次 |
| 最近记录: |