从HTML表中提取数据的Perl脚本

-6 html perl html-table html-parsing

我需要一个Perl脚本来解析一个包含2列的简单HTML表.

该表包括:第1列用户名和第2列他/她在线的时间.

我需要将数据作为:名称Time_online

我尝试过HTML :: TreeBuilderHTML :: Parser但没有结果.

我想以某种方式获取和操纵这些数据.谢谢!

<h1>Users currently online</h1>
<table class="list-table" cellpadding="3" cellspacing="1" summary="Data for online_users">
<thead>
<tr class="list-header">
<th class="list-table" id="online_users_name">
User name
</th>
<th class="list-table" align="right" id="online_users_online_time_pretty">
Online time        
</th>
    </tr>
    </thead>  

    <tbody>
              <tr class="odd"> 
              <td class="list-table" headers="online_users_name">
                <a href="Link">Jack Boauldwin</a>
              </td>

              <td class="list-table" align="right" headers="online_users_online_time_pretty">
                11 minutes
              </td>
            </tr>
        <tr class="even">  

              <td class="list-table" headers="online_users_name">
                <a href="Link">Sarah James</a>
              </td>

              <td class="list-table" align="right" headers="online_users_online_time_pretty">
                1 minutes
              </td>            
          </tr>

              <tr class="odd">


              <td class="list-table" headers="online_users_name">
                <a href="Link">Greg Brown</a>
              </td>

              <td class="list-table" align="right" headers="online_users_online_time_pretty">
                0 minutes
              </td>

          </tr>    
</tbody>
</table>
Run Code Online (Sandbox Code Playgroud)

cho*_*oba 5

使用HTML :: TableExtract.

  • 确实.这是[我的回复(在我的博客上)到他之前的问题](http://blog.nu42.com/2012/04/htmltableextract-is-beautiful.html). (2认同)