use*_*240 3 perl multithreading
请帮我让我的脚本多线程.我已经阅读了该threads::shared模块的文档,但它没有帮助理解如何做到这一点
use threads;
use threads::shared;
use LWP::UserAgent;
use HTTP::Cookies;
my $NUM_WORKERS = 2;
sub worker {
my ($i) = @_;
my ($web, $ck) = browser();
($username, $password) = split ':', $acc;
my $url = 'http://www.site.ru/?tkn'. int(rand(10000));
my $response = $web->post($url, Content =>
[//////]);
while(1)
{
my $url = 'http://www.site.ru/dk?st.page='.$i.'&st.name=%D0%B0';
my $response = $web->get($url);
@list = ($response->content =~ /card_wrp"><div class="photoWrapper"><div><a href="\/(.*?)\?/g);
@popl = ($response->content =~/<\/div><div class="info">(.*?)<\/div>/g);
for ($j = 0; $j <= scalar @list - 1; $j++)
{
$popl[$j] =~ s/ //g;
open F, ">>gr.txt";
print F $list[$j].':'.$popl[$j]. "\n";
close F;
}
print "[+] Page $i \n";
}
}
my $i :shared = 1;
my $last = 79265;
my @workers;
for (1..$NUM_WORKERS) {
push @workers, async {
while (1) {
my $i;
{
lock $I;
return if $i == $last;
$i = ++$I;
}
worker($i);
}
};
}
$_->join() for @workers;
sub browser
{
my $web = new LWP::UserAgent;
my $ck = new HTTP::Cookies;
$web->cookie_jar($ck);
$web->agent('Opera/9.80 (Windows 7; U; en) Presto/2.9.168 Version/11.50');
$web->requests_redirectable(0);
$web->timeout(5);
return $web, $ck;
}
sub loadf {
open (F, "<".$_[0]) or erroropen($_[0]);
chomp(my @data = <F>);
close F;
return @data;
}
Run Code Online (Sandbox Code Playgroud)
我无法理解我需要分享的变量.非常感谢所有能帮助我的会员
如果你没有线程,工作循环看起来就像
for my $i (1..79265) {
worker($i);
}
Run Code Online (Sandbox Code Playgroud)
问题是该变量不可共享并for维持一个无法共享 的内部状态,因此我们需要将其重写为没有这些问题的内容.
选项1:
my @a = 1..79265;
while (@a) {
worker(shift(@a));
}
Run Code Online (Sandbox Code Playgroud)
选项2:
my $i = 0;
while (++$i <= 79265) {
worker($i);
}
Run Code Online (Sandbox Code Playgroud)
并行化任一版本所需的只是确保@a/ $i不会在您检查它和使用它之间发生变化.这是通过添加锁来完成的.
选项1:
my @a :shared = 1..79265;
while (1) {
my $i;
{ lock @a; return if !@a; $i = shift(@a); }
worker($i);
}
Run Code Online (Sandbox Code Playgroud)
选项2:
my $I :shared = 1;
while (1) {
my $i;
{ lock $I; $i = $I; return if ++$I > 79265; }
worker($i);
}
Run Code Online (Sandbox Code Playgroud)
选项1是后面的Thread :: Queue解决方案的基础(尽管它可以按原样使用).选项2在下面用作原样.
我通常使用Thread :: Queue或Thread :: Queue :: Any:
use threads;
use Thread::Queue qw( );
my $NUM_WORKERS = 5;
sub worker {
my ($i) = @_;
... put your download code here ...
}
my $q = Thread::Queue->new();
my @workers;
for (1..$NUM_WORKERS) {
push @workers, async {
while (defined(my $i = $q->dequeue())) {
worker($i);
}
};
}
$q->enqueue($_) for 1..79265;
$q->enqueue(undef) for @workers;
$_->join() for @workers;
Run Code Online (Sandbox Code Playgroud)
但是我们可以轻而易举地做到:
use threads;
use threads::shared;
my $NUM_WORKERS = 5;
sub worker {
my ($i) = @_;
... put your download code here ...
}
my $I :shared = 1;
my $last = 79265;
my @workers;
for (1..$NUM_WORKERS) {
push @workers, async {
while (1) {
my $i;
{
lock $I;
$i = $I;
return if ++$I > $last;
}
worker($i);
}
};
}
$_->join() for @workers;
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
689 次 |
| 最近记录: |