MongoDB docker 副本集连接错误“找不到主机”

Chu*_* Lu 8 mongodb docker replicaset docker-compose

我按照这个 SO 答案创建了一个本地 MongoDB 副本集。

docker-compose 文件:

services:
  mongo1:
    container_name: mongo1
    image: mongo:4.2
    ports:
      - 27017:27017
    restart: always
    command: ["--bind_ip_all", "--replSet", "rs" ]
  mongo2:
    container_name: mongo2
    image: mongo:4.2
    ports:
      - 27018:27017
    restart: always
    command: ["--bind_ip_all", "--replSet", "rs" ]
  mongo3:
    container_name: mongo3
    image: mongo:4.2
    ports:
      - 27019:27017
    restart: always
    command: ["--bind_ip_all", "--replSet", "rs" ]
  replica_set:
    image: mongo:4.2
    container_name: replica_set
    depends_on:
      - mongo1
      - mongo2
      - mongo3
    volume:
      - ./initiate_replica_set.sh:/initiate_replica_set.sh
    entrypoint: 
      - /initiate_replica_set.sh
Run Code Online (Sandbox Code Playgroud)

initiate_replica_set.sh 文件:

#!/bin/bash

echo "Starting replica set initialize"
until mongo --host mongo1 --eval "print(\"waited for connection\")"
do
    sleep 2
done
echo "Connection finished"
echo "Creating replica set"
mongo --host mongo1 <<EOF
rs.initiate(
  {
    _id : 'rs0',
    members: [
      { _id : 0, host : "mongo1:27017" },
      { _id : 1, host : "mongo2:27017" },
      { _id : 2, host : "mongo3:27017" }
    ]
  }
)
EOF
echo "replica set created"
Run Code Online (Sandbox Code Playgroud)

副本集已成功启动并运行良好,但当我尝试连接到副本集时出现错误:

$ mongo "mongodb://localhost:27017,localhost:27018,localhost:27019/?replicaSet=rs"
MongoDB shell version v5.0.2
connecting to: mongodb://localhost:27017,localhost:27018,localhost:27019/?compressors=disabled&gssapiServiceName=mongodb&replicaSet=rs
{"t":{"$date":"2021-08-05T21:35:40.667Z"},"s":"I",  "c":"NETWORK",  "id":4333208, "ctx":"ReplicaSetMonitor-TaskExecutor","msg":"RSM host selection timeout","attr":{"replicaSet":"rs","error":"FailedToSatisfyReadPreference: Could not find host matching read preference { mode: \"nearest\" } for set rs"}}
Error: Could not find host matching read preference { mode: "nearest" } for set rs, rs/localhost:27017,localhost:27018,localhost:27019 :
connect@src/mongo/shell/mongo.js:372:17
@(connect):2:6
exception: connect failed
exiting with code 1
Run Code Online (Sandbox Code Playgroud)

更详细的日志:

{
  "t": {
    "$date": "2021-08-05T21:35:54.531Z"
  },
  "s": "I",
  "c": "-",
  "id": 4333222,
  "ctx": "ReplicaSetMonitor-TaskExecutor",
  "msg": "RSM received error response",
  "attr": {
    "host": "mongo1:27017",
    "error": "HostUnreachable: Error connecting to mongo1:27017 :: caused by :: Could not find address for mongo1:27017: SocketException: Host not found (authoritative)",
    "replicaSet": "rs",
    "response": "{}"
  }
}
Run Code Online (Sandbox Code Playgroud)

问题的原因是什么以及如何解决?

Chu*_* Lu 13

关于这个问题,各地都有一些部分答案,以下是我认为完整的答案。

原因

  • Mongo 客户端使用副本集配置中列出的主机名,而不是种子列表

    尽管连接字符串是"mongodb://localhost:27017,localhost:27018,localhost:27019/?replicaSet=rs",mongo 客户端不会连接到具有种子地址等的副本集成员localhost:27017,而是连接到从种子主机返回的副本配置集中的成员,即调用中的成员rs.initiate。这就是错误消息而Error connecting to mongo1:27017不是 的原因Error connecting to localhost:27017

  • 容器主机名在容器网络之外不可寻址

    与 mongo 服务器容器位于同一容器网络内的 mongo 客户端可以通过以下地址连接到服务器mongo1:27017:但是,位于容器网络外部的主机上的客户端无法解析mongo1为 IP。此问题的典型解决方案是代理,详细信息请参阅使用容器名称从主机访问 docker 容器。

修复方法

因为问题涉及到 docker 网络,而 Linux 和 Mac 之间的 docker 网络有所不同。两个平台上的修复有所不同。

Linux

代理修复(通过第 3 方软件或修改/etc/hosts文件)工作正常,但有时不可行,例如在远程 CI 主机上运行。一个简单的独立可移植解决方案是更新intiate_replia_set.sh脚本以使用成员 IP 而不是主机名来启动副本集。

intiate_replia_set.sh

echo "Starting replica set initialization"
until mongo --host mongo1 --eval "print(\"waited for connection\")"
do
   sleep 2
done
echo "Connection finished"
echo "Creating replica set"

MONGO1IP=$(getent hosts mongo1 | awk '{ print $1 }')
MONGO2IP=$(getent hosts mongo2 | awk '{ print $1 }')
MONGO3IP=$(getent hosts mongo3 | awk '{ print $1 }')

read -r -d '' CMD <<EOF
rs.initiate(
  {
    _id : 'rs',
    members: [
      { _id : 0, host : '${MONGO1IP}:27017' },
      { _id : 1, host : '${MONGO2IP}:27017' },
      { _id : 2, host : '${MONGO3IP}:27017' }
    ]
  }
)
EOF

echo $CMD | mongo --host mongo1
echo "replica set created"
Run Code Online (Sandbox Code Playgroud)

这样,mongo 副本集成员的地址中就有容器 IP,而不是主机名。并且容器 IP 可从主机访问。

或者,我们可以在 docker-compose 文件中显式为每个容器分配静态 IP,并在启动副本集时使用静态 IP。这是一个类似的修复,但需要更多工作。

苹果

遗憾的是,上述解决方案不适用于 Mac,因为 Mac 上的 docker 容器 IP 未在主机网络接口上公开。https://docs.docker.com/docker-for-mac/networking/#per-container-ip-addressing-is-not-possible

使其工作的最简单方法是在/etc/hosts文件中添加以下映射:

127.0.0.1   mongo1 mongo2 mongo3
Run Code Online (Sandbox Code Playgroud)