标签: invisible-xml

使用 Invisible XML 从文本中提取记录

我有一份包含结构化条目的期刊参考书目的 OCR 文本。我想使用不可见的 XML标准来提取和解析条目。

\n

输入示例:

\n
\n1  2  Hype.  1990?- 1993.  Frequency:  Bimonthly.  River  Edge, \n\nNJ.  Published  by  Word  Up!  Video,  Inc.  Last  issue  66  pages. \nHeight  28  cm.  Line  drawings;  Photographs  (some  in  color); \nCommercial  advertising;  Table  of  contents.  Previous  editor(s): \nMarica  A.  Cole.  ISSN  1056-4632.  LC  card  no.  sn91-1965. \nOCLC  no.  23715422.  Subject  focus  and/or  Features:  Hip  hop \nculture,  Music,  Rap  music. \n\nWHi  v.l,  n.6;  v.2,  n.5  Pam  01-5450  Aug,  1992;  Aug,  1993 …
Run Code Online (Sandbox Code Playgroud)

xml grammar text-parsing invisible-xml

2
推荐指数
1
解决办法
105
查看次数

标签 统计

grammar ×1

invisible-xml ×1

text-parsing ×1

xml ×1