从php中的Markdown生成目录

giò*_*giò 11 php markdown parsing

我想从Markdown创建一个目录.
例如,当您插入时,在stackedit.io https://stackedit.io/editor#table-of-contents中:

[TOC]
Run Code Online (Sandbox Code Playgroud)

有没有办法从降价中产生这个?

例如,如果你有:

## header 1
## header 2
Run Code Online (Sandbox Code Playgroud)

ToC应该是:

<ol>
   <li><a href="#header1">Header 1</a></li>
   <li><a href="#header2">Header 2</a></li>
</ol>
Run Code Online (Sandbox Code Playgroud)

我应该创建自己的降价解析器才能获得ToC吗?

cFr*_*eed 6

以下是执行基本工作的函数:它返回已找到标题的JSON列表,每个标题都有其级别和文本.
此JSON元素可以进一步用于生成所需的HTML结构或其他任何内容.

原理图它的工作原理如下:

  1. 将markdown文件作为字符串获取,并仅将换行符标准化\n(这对于下面的步骤#3很重要)
  2. 套用一个简单的正则表达式/^(?:=|-|#).*$/mPREG_OFFSET_CAPTURE:使所有线匹配这两种:
    • <h1>(当"=")或<h2>(当" - ")标题的"下划线"
    • 是标题(以"#"开头)
  3. 迭代匹配的行:
    • 对于"underliners",查看上一行的源文件,该文件位于当前行偏移量与前一行换行符之间的字符串中; 然后从下划线类型和前一行的文本中获取级别
    • 否则只需从当前行本身获取级别和文本

这是功能:

function markdown_toc($file_path) {
  $file = file_get_contents($file_path);

  // ensure using only "\n" as line-break
  $source = str_replace(["\r\n", "\r"], "\n", $file);

  // look for markdown TOC items
  preg_match_all(
    '/^(?:=|-|#).*$/m',
    $source,
    $matches,
    PREG_PATTERN_ORDER | PREG_OFFSET_CAPTURE
  );

  // preprocess: iterate matched lines to create an array of items
  // where each item is an array(level, text)
  $file_size = strlen($source);
  foreach ($matches[0] as $item) {
    $found_mark = substr($item[0], 0, 1);
    if ($found_mark == '#') {
      // text is the found item
      $item_text = $item[0];
      $item_level = strrpos($item_text, '#') + 1;
      $item_text = substr($item_text, $item_level);
    } else {
      // text is the previous line (empty if <hr>)
      $item_offset = $item[1];
      $prev_line_offset = strrpos($source, "\n", -($file_size - $item_offset + 2));
      $item_text =
        substr($source, $prev_line_offset, $item_offset - $prev_line_offset - 1);
      $item_text = trim($item_text);
      $item_level = $found_mark == '=' ? 1 : 2;
    }
    if (!trim($item_text) OR strpos($item_text, '|') !== FALSE) {
      // item is an horizontal separator or a table header, don't mind
      continue;
    }
    $raw_toc[] = ['level' => $item_level, 'text' => trim($item_text)];
  }

  // create a JSON list (the easiest way to generate HTML structure is using JS)
  return json_encode($raw_toc);
}
Run Code Online (Sandbox Code Playgroud)

以下是从您提供的链接主页返回的结果:

[
  {"level":1,"text":"Welcome to StackEdit!"},
  {"level":2,"text":"Documents"},
  {"level":4,"text":"<\/i> Create a document"},
  {"level":4,"text":"<\/i> Switch to another document"},
  {"level":4,"text":"<\/i> Rename a document"},
  {"level":4,"text":"<\/i> Delete a document"},
  {"level":4,"text":"<\/i> Export a document"},
  {"level":2,"text":"Synchronization"},
  {"level":4,"text":"<\/i> Open a document"},
  {"level":4,"text":"<\/i> Save a document"},
  {"level":4,"text":"<\/i> Synchronize a document"},
  {"level":4,"text":"<\/i> Manage document synchronization"},
  {"level":2,"text":"Publication"},
  {"level":4,"text":"<\/i> Publish a document"},
  {"level":2,"text":"- Markdown, to publish the Markdown text on a website that can interpret it (**GitHub** for instance),"},
  {"level":2,"text":"- HTML, to publish the document converted into HTML (on a blog for example),"},
  {"level":4,"text":"<\/i> Update a publication"},
  {"level":4,"text":"<\/i> Manage document publication"},
  {"level":2,"text":"Markdown Extra"},
  {"level":3,"text":"Tables"},
  {"level":3,"text":"Definition Lists"},
  {"level":3,"text":"Fenced code blocks"},
  {"level":3,"text":"Footnotes"},
  {"level":3,"text":"SmartyPants"},
  {"level":3,"text":"Table of contents"},
  {"level":3,"text":"MathJax"},
  {"level":3,"text":"UML diagrams"},
  {"level":3,"text":"Support StackEdit"}
]
Run Code Online (Sandbox Code Playgroud)