Art*_*uro 10 html facebook meta-tags scraper web-scraping
我正在使用下面的函数(GWT)动态创建HTML元标记.在DOM上花费1秒钟.除Facebook外,它工作正常.当我从我的网络共享链接时,刮刀获取HTML中的元标记:无.我怎样才能解决这个问题?
/**
* Include the HTML attributes: title, description and keywords (meta tags)
*/
private void createHTMLheader(MyClass thing) {
String title=thing.getTitle();
String description=thing.getDescription();
Document.get().setTitle(title);
MetaElement metaDesc = Document.get().createMetaElement();
metaDesc.setName("description");
metaDesc.setContent(description);
NodeList<Element> nodes = Document.get().getElementsByTagName("head");
nodes.getItem(0).appendChild(metaDesc);
}
Run Code Online (Sandbox Code Playgroud)
这是DOM上的结果HEAD.标题aaaa
和元描述已动态加载.(感谢CBroe提示).在"查看源"功能中,不显示这些动态标记(仅在开发人员工具上 - 查看dom).
<head>
<title>aaaa</title>
<meta content="text/html; charset=utf-8" http-equiv="content-type">
<meta name="description" content="My description">
<script language="javascript" type="text/javascript" src="dialective/dialective.nocache.js"></script><script defer="defer">dialective.onInjectionDone('dialective')</script>
</head>
Run Code Online (Sandbox Code Playgroud)
原始HTML没有TITLE或META-DESCRIPTION标记.