从URL链接中隔离数字ID

tec*_*t82 0 php regex

我正在浏览stackoverflow,并在此处找到了一个很棒的正则表达式代码.可能还有其他方法可以隔离youtube视频ID,但我选择使用正则表达式进行学习.带有input1(如下所示)的正则表达式代码忽略了&字符后的所有内容.这会清除视频ID,从而提供错误或空ID结果.为什么正则表达式会在之后清除所有内容&

错误:

输入1:http://www.youtube.com/watch? feature&v = 317a815FLWQ

结果1:http // www.youtube.com/watch?功能

正常:

输入2:http://www.youtube.com/watch? v = spDj54kf-vY&feature = g-vrec

结果2:http://www.youtube.com/watch? v = spDj54kf-vY

正则表达式代码(附原文注释)

$text = preg_replace('~
        # Match non-linked youtube URL in the wild. (Rev:20111012)
        https?://         # Required scheme. Either http or https.
        (?:[0-9A-Z-]+\.)? # Optional subdomain.
        (?:               # Group host alternatives.
          youtu\.be/      # Either youtu.be,
        | youtube\.com    # or youtube.com followed by
          \S*             # Allow anything up to VIDEO_ID,
          [^\w\-\s]       # but char before ID is non-ID char.
        )                 # End host alternatives.
        ([\w\-]{11})      # $1: VIDEO_ID is exactly 11 chars.
        (?=[^\w\-]|$)     # Assert next char is non-ID or EOS.
        (?!               # Assert URL is not pre-linked.
          [?=&+%\w]*      # Allow URL (query) remainder.
          (?:             # Group pre-linked alternatives.
            [\'"][^<>]*>  # Either inside a start tag,
          | </a>          # or inside <a> element text contents.
          )               # End recognized pre-linked alts.
        )                 # End negative lookahead assertion.
        [?=&+%\w-]*        # Consume any URL (query) remainder.
        ~ix', 
        '<a href="http://www.youtube.com/watch?v=$1">YouTube link: $1</a>',
        $text);
    return $text;
Run Code Online (Sandbox Code Playgroud)

tho*_*son 6

忘了regex,使用parse_url:

Array
(
    [scheme] => http
    [host] => hostname
    [user] => username
    [pass] => password
    [path] => /path
    [query] => arg=value
    [fragment] => anchor
)
Run Code Online (Sandbox Code Playgroud)

然后使用parse_strquery的URL的一部分提取的变量.

编辑

这是一个更好的演示:

$url = "http://www.youtube.com/watch?feature&v=317a815FLWQ";

$parsed_url = parse_url($url);
$query = $parsed_url['query'];

$parsed_query = array();
parse_str($query, $parsed_query);

var_dump($parsed_query);
Run Code Online (Sandbox Code Playgroud)

输出:

array(2) {
  ["feature"]=>
  string(0) ""
  ["v"]=>
  string(11) "317a815FLWQ"
}
Run Code Online (Sandbox Code Playgroud)

编辑2

另一个从注释中给出的第二个链接中提取ID的示例:

$url = "http://www.youtube.com/sandalsResorts#p/c/54B8C800269D7C1B/2/PPS-8DMrAn4";

$parsed_url = parse_url($url);
$fragment = $parsed_url['fragment'];
$fragment_parts = explode('/', $fragment);
$video_id = array_pop($fragment_parts);

print($video_id);
Run Code Online (Sandbox Code Playgroud)

输出:

PPS-8DMrAn4
Run Code Online (Sandbox Code Playgroud)

但是,如果您要求用户提供链接,则需要非常具体.第二个示例中的链接不是视频链接,但如果您想要宽容用户的输入,则可以通过两个代码段运行链接并检查是否获得了ID.