XSLT文件中的XPath语句会将下面的HTML转换为下面的XML吗?

fan*_*ang -2 java xml xslt xpath

我想要下面的表数据:

<html>
<table border="1">
<tr>
<td rowspan="2">2015</td>
<td>First Event of 2015</td>
</tr>
<tr><td>Second Event of 2015</td></tr>
<tr>
<td rowspan="2">2014</td>
<td>First Event of 2014</td>
</tr>
<tr><td>Second Event of 2014</td></tr>
</table>
</html>
Run Code Online (Sandbox Code Playgroud)

使用XPath转换为以下XML:

<events>
<event year="2015" name="First Event of 2015">
<event year="2015" name="Second Event of 2015">
<event year="2014" name="First Event of 2014">
<event year="2014" name="Second Event of 2014">
</events>
Run Code Online (Sandbox Code Playgroud)

如何处理xpath中的rowspans以获得此输出?

为了记录,我使用以下Java代码来执行XSLT转换:

String xsltCode = ... // the xslt Im asking for....
File xmlInput = ... // the file with the html code above
File xmlOutput = new File("output.xml");
Transformer transformer = TransformerFactory.newInstance().newTransformer(new StreamSource(new StringReader(xsltCode)));
transformer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");
transformer.setOutputProperty(OutputKeys.INDENT, "yes");
Source xmlSource = new StreamSource(xmlInput);
Result resultOutput = new StreamResult(xmlOutput);
transformer.transform(xmlSource,resultOutput);
Run Code Online (Sandbox Code Playgroud)

Mat*_*ler 6

我很高兴我们终于找到了你需要的东西.请尽量从一开始就明确您的未来问题 - 这将为您节省时间和投票.

编写匹配/并输出输出最外层元素的第一个模板events.然后,编写第二个模板,该模板匹配所有td没有@rowspan属性的元素.在一年中的信息必须从第一在前被选择td该元素确实有一个@rowspan属性.

XSLT样式表

<?xml version="1.0" encoding="UTF-8" ?>
<xsl:transform xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="1.0">
    <xsl:output method="xml" encoding="UTF-8" indent="yes" />

    <xsl:strip-space elements="*"/>

    <xsl:template match="/">
      <events>
          <xsl:apply-templates/>
      </events>
    </xsl:template>

    <xsl:template match="td[not(@rowspan)]">
        <event year="{preceding::td[@rowspan][1]}">
            <xsl:value-of select="."/>
        </event>
    </xsl:template>

    <xsl:template match="text()"/>
</xsl:transform>
Run Code Online (Sandbox Code Playgroud)

XML输出

<?xml version="1.0" encoding="UTF-8"?>
<events>
   <event year="2015">First Event of 2015</event>
   <event year="2015">Second Event of 2015</event>
   <event year="2014">First Event of 2014</event>
   <event year="2014">Second Event of 2014</event>
</events>
Run Code Online (Sandbox Code Playgroud)

在线尝试这种解决方案在这里.