如何检测和删除空XML标签?

vik*_*ata 0 xml tags powershell

我有一堆XML文件,我想检测并删除里面的空标签.喜欢:

<My></My>
<Your/>

<sometags>
    <his>
    </his>
    <hasContent>sdfaf</hasContent>
</sometags>
Run Code Online (Sandbox Code Playgroud)

他们是各种空标签(My,Your,his)我想删除.PowerShell是否支持这种空标记检测,无论它们嵌入其他标记的深度有多深?

Dav*_*ant 6

function Format-XML
{ 
    param (
        [parameter(Mandatory = $true)][xml] $xml, 
        [parameter(Mandatory = $false)][int] $indent = 4
    ) 

    try
    {
        $Error.Clear()

        $StringWriter = New-Object System.IO.StringWriter 
        $XmlWriter = New-Object System.XMl.XmlTextWriter $StringWriter 
        $xmlWriter.Formatting = "indented" 
        $xmlWriter.Indentation = $indent 
        $xml.WriteContentTo($XmlWriter) 
        $XmlWriter.Flush() 
        $StringWriter.Flush() 

        return $StringWriter.ToString() 
    }

    catch
    {
        Write-Host "$($MyInvocation.InvocationName): $_"; return $null
    }
}


$xml = [xml] @"
<document>
    <My></My>
    <Your/>
    <sometags>
        <his>
        </his>
        <hasContent>sdfaf</hasContent>
    </sometags>
</document>
"@

# The "magic" part is in this XPath expression

$nodes = $xml.SelectNodes("//*[count(@*) = 0 and count(child::*) = 0 and not(string-length(text())) > 0]")

$nodes | %{
    $_.ParentNode.RemoveChild($_)
}

Format-Xml $xml
Run Code Online (Sandbox Code Playgroud)

  • 谢谢,但是你能帮忙解释一下魔法XPath字符串的3个子部分吗? (2认同)