如果最后一个字符是使用批处理文件的分号,则使用下一行合并行

Jun*_*aid 4 batch-file

我有一个包含以下4行的文件.

A;1;abc;<xml/>;
;2;def;<xml
>hello world</xml>;
;3;ghi;<xml/>;
Run Code Online (Sandbox Code Playgroud)

使用批处理文件,我需要组合行,如果行不以分号(;)结尾,则将下一行合并到当前行.

所以期望的输出应该是

A;1;abc;<xml/>;
;2;def;<xml>hello world</xml>;
;3;ghi;<xml/>;
Run Code Online (Sandbox Code Playgroud)

我对批处理脚本不太熟悉,但for /F到目前为止尝试使用但没有运气.

据我所知,逻辑应该是检查每一行的最后一个字符,如果它不是分号,则将下一行读入当前行.

除此之外,我设法得到了该行的最后一个字符,但是如果它没有,我的脚本只读取该行; .有任何想法吗?

@echo off
for /f "tokens=*" %%i in (myfile.txt) do (
  set var=%%i
  echo %%i
  if "%var:~-1%"==";" (
    echo test
  )
)
Run Code Online (Sandbox Code Playgroud)

注意:上面的查询只读取第1行和第3行.

dbe*_*ham 6

你的代码有很多问题:)

1)如您所述,您的代码会忽略以;- 开头的行- 这是由于默认的FOR/F EOL选项.但是由于"TOKENS =*",您的代码也会从每一行中删除前导空格.您需要将EOL和DELIMS都设置为空.语法很奇怪,但它有效:

for /f delims^=^ eol^= %%i ...
Run Code Online (Sandbox Code Playgroud)

2)您尝试在带括号的代码块中设置和扩展var.这不起作用,因为解析行时会发生扩展,并且会立即解析整个代码块.因此,值%var%是循环执行之前存在的值.当然不是你想要的.解决方案是使用延迟扩展.键入FOR /?从命令提示符关于延迟扩展的更多信息(大约一半时帮助列表)

3)!如果在启用延迟扩展时扩展,则包含的变量内容将被破坏.解决方案是在循环内根据需要打开和关闭延迟扩展.但这会导致并发症,因为您需要在ENDLOCAL屏障中保留增长线的值.我使用FOR/F在障碍物上传输值.

这是一个应该完成工作的完整批处理脚本.它的局限性在于它不能处理大于〜8191字节的最大长度的行.

此代码已被重写以修复重大错误

@echo off
setlocal disableDelayedExpansion
set "ln="
set "print=0"
for /f delims^=^ eol^= %%i in (myfile.txt) do (
  set "var=%%i"
  setlocal enableDelayedExpansion
  for /f delims^=^ eol^= %%A in ("!ln!!var!") do (
    if "!var:~-1!"==";" (
      endlocal
      echo %%A
      set "ln="
    ) else (
      endlocal
      set "ln=%%A"
    )
  )
)
Run Code Online (Sandbox Code Playgroud)

SET/P解决方案

有一个更简单的解决方案,可以立即打印每一行,这样您就不必担心在ENDLOCAL中传输变量了.;使用SET/P打印不以新行结尾的行.

该解决方案具有以下限制:

1)通过SET/P打印的行将剥离前导空格.此限制仅适用于Vista和较新版本的Windows.这不是XP的问题.

2)感谢David Ruhmann,我现在知道如果行开头,SET/P将会失败=.非常不幸:(

@echo off
setlocal disableDelayedExpansion
set "ln="
for /f delims^=^ eol^= %%i in (myfile.txt) do (
  set "var=%%i"
  setlocal enableDelayedExpansion
  if "!var:~-1!"==";" (echo !var!) else (<nul set /p ="!var!")
  endlocal
)
Run Code Online (Sandbox Code Playgroud)

混合批处理/ JScript正则表达式解决方案(防弹吗?)

我编写了一个混合批处理/ JScript REPL.BAT实用程序,它允许简单的正则表达式搜索和替换文件内容.它使工作变得非常简单.

以下命令应适用于任何输入,没有限制.它已更新为支持Windows和Unix样式行.它比纯批量解决方案快得多.

findstr "^." myfile.txt|repl "([^;\r])\r?\n" "$1" m >"outFile.txt"
Run Code Online (Sandbox Code Playgroud)

这是REPL.BAT实用程序.完整文档嵌入在脚本中.

@if (@X)==(@Y) @end /* Harmless hybrid line that begins a JScript comment

::************ Documentation ***********
:::
:::REPL  Search  Replace  [Options  [SourceVar]]
:::REPL  /?
:::
:::  Performs a global search and replace operation on each line of input from
:::  stdin and prints the result to stdout.
:::
:::  Each parameter may be optionally enclosed by double quotes. The double
:::  quotes are not considered part of the argument. The quotes are required
:::  if the parameter contains a batch token delimiter like space, tab, comma,
:::  semicolon. The quotes should also be used if the argument contains a
:::  batch special character like &, |, etc. so that the special character
:::  does not need to be escaped with ^.
:::
:::  If called with a single argument of /? then prints help documentation
:::  to stdout.
:::
:::  Search  - By default this is a case sensitive JScript (ECMA) regular
:::            expression expressed as a string.
:::
:::            JScript syntax documentation is available at
:::            http://msdn.microsoft.com/en-us/library/ae5bf541(v=vs.80).aspx
:::
:::  Replace - By default this is the string to be used as a replacement for
:::            each found search expression. Full support is provided for
:::            substituion patterns available to the JScript replace method.
:::            A $ literal can be escaped as $$. An empty replacement string
:::            must be represented as "".
:::
:::            Replace substitution pattern syntax is documented at
:::            http://msdn.microsoft.com/en-US/library/efy6s3e6(v=vs.80).aspx
:::
:::  Options - An optional string of characters used to alter the behavior
:::            of REPL. The option characters are case insensitive, and may
:::            appear in any order.
:::
:::            I - Makes the search case-insensitive.
:::
:::            L - The Search is treated as a string literal instead of a
:::                regular expression. Also, all $ found in Replace are
:::                treated as $ literals.
:::
:::            E - Search and Replace represent the name of environment
:::                variables that contain the respective values. An undefined
:::                variable is treated as an empty string.
:::
:::            M - Multi-line mode. The entire contents of stdin is read and
:::                processed in one pass instead of line by line. ^ anchors
:::                the beginning of a line and $ anchors the end of a line.
:::
:::            X - Enables extended substitution pattern syntax with support
:::                for the following escape sequences:
:::
:::                \\     -  Backslash
:::                \b     -  Backspace
:::                \f     -  Formfeed
:::                \n     -  Newline
:::                \r     -  Carriage Return
:::                \t     -  Horizontal Tab
:::                \v     -  Vertical Tab
:::                \xnn   -  Ascii (Latin 1) character expressed as 2 hex digits
:::                \unnnn -  Unicode character expressed as 4 hex digits
:::
:::                Escape sequences are supported even when the L option is used.
:::
:::            S - The source is read from an environment variable instead of
:::                from stdin. The name of the source environment variable is
:::                specified in the next argument after the option string.
:::

::************ Batch portion ***********
@echo off
if .%2 equ . (
  if "%~1" equ "/?" (
    findstr "^:::" "%~f0" | cscript //E:JScript //nologo "%~f0" "^:::" ""
    exit /b 0
  ) else (
    call :err "Insufficient arguments"
    exit /b 1
  )
)
echo(%~3|findstr /i "[^SMILEX]" >nul && (
  call :err "Invalid option(s)"
  exit /b 1
)
cscript //E:JScript //nologo "%~f0" %*
exit /b 0

:err
>&2 echo ERROR: %~1. Use REPL /? to get help.
exit /b

************* JScript portion **********/
var env=WScript.CreateObject("WScript.Shell").Environment("Process");
var args=WScript.Arguments;
var search=args.Item(0);
var replace=args.Item(1);
var options="g";
if (args.length>2) {
  options+=args.Item(2).toLowerCase();
}
var multi=(options.indexOf("m")>=0);
var srcVar=(options.indexOf("s")>=0);
if (srcVar) {
  options=options.replace(/s/g,"");
}
if (options.indexOf("e")>=0) {
  options=options.replace(/e/g,"");
  search=env(search);
  replace=env(replace);
}
if (options.indexOf("l")>=0) {
  options=options.replace(/l/g,"");
  search=search.replace(/([.^$*+?()[{\\|])/g,"\\$1");
  replace=replace.replace(/\$/g,"$$$$");
}
if (options.indexOf("x")>=0) {
  options=options.replace(/x/g,"");
  replace=replace.replace(/\\\\/g,"\\B");
  replace=replace.replace(/\\b/g,"\b");
  replace=replace.replace(/\\f/g,"\f");
  replace=replace.replace(/\\n/g,"\n");
  replace=replace.replace(/\\r/g,"\r");
  replace=replace.replace(/\\t/g,"\t");
  replace=replace.replace(/\\v/g,"\v");
  replace=replace.replace(/\\x[0-9a-fA-F]{2}|\\u[0-9a-fA-F]{4}/g,
    function($0,$1,$2){
      return String.fromCharCode(parseInt("0x"+$0.substring(2)));
    }
  );
  replace=replace.replace(/\\B/g,"\\");
}
var search=new RegExp(search,options);

if (srcVar) {
  WScript.Stdout.Write(env(args.Item(3)).replace(search,replace));
} else {
  while (!WScript.StdIn.AtEndOfStream) {
    if (multi) {
      WScript.Stdout.Write(WScript.StdIn.ReadAll().replace(search,replace));
    } else {
      WScript.Stdout.WriteLine(WScript.StdIn.ReadLine().replace(search,replace));
    }
  }
}
Run Code Online (Sandbox Code Playgroud)