小编Sam*_*mbo的帖子

屏幕刮痧

嗨,我正在尝试在我的网站上实现屏幕抓取方案,到目前为止已经设置了以下内容.我最终要做的是替换$ results变量中包含"ResultsDetails.aspx?"的所有链接.到"结果刮 - 细节/"然后再输出.谁能指出我正确的方向?

<?php 
$url = "http://mysite:90/Testing/label/stuff/ResultsIndex.aspx";
$raw = file_get_contents($url);
$newlines = array("\t","\n","\r","\x20\x20","\0","\x0B");
$content = str_replace($newlines, "", html_entity_decode($raw));
$start = strpos($content,"<div id='pageBack'");
$end = strpos($content,'</body>',$start) + 6;
$results = substr($content,$start,$end-$start);
$pattern = 'ResultsDetails.aspx?';
$replacement = 'results-scrape-details/';
preg_replace($pattern, $replacement, $results);
echo $results;
Run Code Online (Sandbox Code Playgroud)

php screen-scraping screen

2
推荐指数
1
解决办法
581
查看次数

标签 统计

php ×1

screen ×1

screen-scraping ×1