如何在基于Docker的微服务中管理每个环境数据？

Question

如何在基于Docker的微服务中管理每个环境数据？

Ste*_*ins 25 architecture deployment configuration docker microservices

在微服务架构中,我很难掌握如何管理特定于环境的配置(例如,数据库或消息代理的IP地址和凭据).

假设您有三个微服务("A","B"和"C"),每个都由不同的团队拥有和维护.每个团队都需要一个团队集成环境......他们使用微服务的最新快照,以及所有依赖微服务的稳定版本.当然,您还需要QA /登台/制作环境.大图的简化视图如下所示:

"微服务A"团队环境

微服务A(SNAPSHOT)
微服务B(STABLE)
微服务C(STABLE)

"微服务B"团队环境

微服务A(稳定)
微服务B(SNAPSHOT)
微服务C(STABLE)

"微服务C"团队环境

微服务A(稳定)
微服务B(STABLE)
微服务C(SNAPSHOT)

质量保证/分期/生产

微服务A(稳定,释放等)
微服务B(稳定,释放等)
微服务C(稳定,释放等)

这是很多部署,但这个问题可以通过持续集成服务器解决,也许像Chef/Puppet /等.在真正困难的部分是每个微服务需要特别在它部署的每个地方某些环境中的数据.

例如,在"A"团队环境中,"A"需要一个地址和一组凭据才能与"B"交互.然而,随着在"B"团队环境,即 "A"的使用,必须有不同的地址和凭据进行交互的是 "B"的部署.

此外,随着您越来越接近生产,像这样的环境配置信息可能需要安全限制(即只有某些人能够修改甚至查看它).

那么,使用微服务架构,如何维护特定于环境的配置信息并使其可供应用程序使用？我想到了一些方法,尽管它们都有问题:

让构建服务器在构建时将它们烘焙到应用程序中 - 我想您可以创建每个环境属性文件或脚本的存储库,并让每个微服务的构建过程伸出并拉入相应的脚本(您也可以有一个单独的,有限的访问回购生产的东西).但是,你需要大量的脚本.对于可以部署微服务的每个地方的每个微服务,基本上都是单独的.
将它们烘焙到每个环境的基础Docker镜像中 - 如果构建服务器将您的微服务应用程序放入Docker容器中作为构建过程的最后一步,那么您可以为每个环境创建自定义基本映像.基本映像将包含一个shell脚本,用于设置所需的所有环境变量.您的Dockerfile将设置为在启动应用程序之前调用此脚本.这与上一个要点有类似的挑战,因为现在你正在管理大量的Docker镜像.
在运行时从某种注册表中提取环境信息 - 最后,您可以将每个环境的配置存储在Apache ZooKeeper(甚至只是一个普通的数据库)之类的内容中,并让您的应用程序代码在运行时将其拉入它启动了.每个微服务应用程序都需要一种方法来告诉它所处的环境(例如启动参数),以便它知道从注册表中获取哪组变量.这种方法的优点是,现在您可以从团队环境到生产一直使用完全相同的构建工件(即应用程序或Docker容器).另一方面,您现在将拥有另一个运行时依赖项,并且您仍然必须管理注册表中的所有数据.

人们如何在微服务架构中解决这个问题？听起来这似乎是常见的事情.

Answer 1

dri*_*her 29

Overview

Long post!

ENTRYPOINT is your friend
Building Microservices by Sam Newman is great
Inter-service security tip: 2-way TLS may work, but may present latency issues
I will get into a real example from my team. We could not use a configuration server, and things have gotten ... interesting. Manageable for now. But may not scale as the company has more services.
Configuration servers seem like a better idea

Update: Almost two years later, we might move to Kubernetes, and start using the etcd-powered ConfigMaps feature that ships with it. I'll mention this again in the configuration servers section. The post could still be worthwhile reading if you are interested in these subjects. We'll still be using ENTRYPOINT and some of the same concepts, just some different tools.

`ENTRYPOINT`

I suggest that ENTRYPOINT is the key to managing environment-specific configuration for your Docker containers.

In short: create a script to bootstrap your service before starting, and use ENTRYPOINT to execute this script.

I will go into detail contextualizing this, and also explain how how we do this without a configuration server. It gets a bit deep, but it's not unmanageable. Then, I end with details on configuration servers, a better solution for many teams.

Building Microservices

You're right that these are common concerns, but there just aren't one-size-fits-all solutions. The most general solution is a configuration server. (The most general but still not one-size-fits-all.) But perhaps you cannot use one of these: we were barred from using a configuration server by the Security team.

I strongly recommend reading Building Microservices by Sam Newman, if you haven't yet. It examines all the common challenges and discusses many possible solutions, while also giving helpful perspective from a seasoned architect. (Side note: don't worry about a perfect solution to your configuration management; start with a "good enough" solution for your current set of microservices and environments. You can iterate and improve, so you should try to get useful software to your customers ASAP, then improve in subsequent releases.)

Cautionary tale?

Rereading this again ... I cringe a little at how much it takes to explain this fully. From the Zen of Python:

If the implementation is hard to explain, it's a bad idea.
If the implementation is easy to explain, it may be a good idea.

Run Code Online (Sandbox Code Playgroud)

我对我们的解决方案并不感到兴奋.然而,这是一个可行的解决方案,因为我们无法使用配置服务器.这也是一个现实世界的例子.

如果你读它并思考,"哦,上帝不,我为什么要这么做!" 那么你知道,你需要仔细研究配置服务器.

服务间安全性

您似乎也关注不同的微服务如何相互验证.

对于与此身份验证相关的工件和配置......将它们视为任何其他配置工件.

What are your requirements around inter-service security? In your post, it sounds like you're describing app-tier, username/password authentication. Maybe that makes sense for the services you have in mind. But you should also consider Two-Way TLS: "this configuration requires the client to provide their certificate to the server, in addition to the server providing their's to the client." Generating and managing these certificates can get complicated ... but however you choose to do it, you'll shuffle around the config/artifacts like any other config/artifacts.

Note that 2-way TLS may introduce latency issues at high volumes. We're not there yet. We are using other measures besides 2-way TLS and we may ditch 2-way TLS once those are proven out, over time.

Real-world example from my team

My current team is doing something that combines two of the approaches you mentioned (paraphrased):

Bake configuration at build-time
Pull configuration at run-time

My team is using Spring Boot. Spring Boot has really complex Externalized Configuration with a "profiles" system. Spring Boot's configuration handling is complex and powerful, with all the pros/cons that go with that (won't get into that here).

While this is out-of-the-box with Spring Boot, the ideas are general. I prefer Dropwizard for Java microservices, or Flask in Python; in both of those cases, you could do similar thing to what Spring Boot has going on ... You'll just have to do more things yourself. Good and bad: These nimble little frameworks are more flexible than Spring, but when you're writing more code and doing more integrations, there's more responsibility on YOU to QA and test your complex/flexible config support.

I'll continue with the Spring Boot example because of first-hand experience, but not because I'm recommending it! Use what is right for your team.

In the case of Spring Boot, you can activate multiple profiles at a time. That means you can have a base configuration, then override with more specific configuration. We keep a base configuration, application.yml in src/main/resources. So, this config is packaged with the shippable JAR, and when the JAR is executed this config is always picked up. Therefore we include all default settings (common to all environments) in this file. Example: the configuration block that says, "Embedded Tomcat, always use TLS with these cipher suites enabled." (server.ssl.ciphers)

When just one or two variables needs to be overwritten for a certain environment, we leverage Spring Boot's support for getting configuration from environment variables. Example: we set the URL to our Service Discovery using an environment variable. This overrides any default in the shipped/pulled configuration files. Another example: we use an environment variable SPRING_PROFILES_ACTIVE to specify which Configuration Profiles are active.

We also want to make sure master contains a tested, working config for development environments. src/main/resources/application.yml has sane defaults. In addition we put dev-only config in config/application-dev.yml, and check that in. The config directory is picked up easily, but not shipped in the JAR. Nice feature. Developers know (from the README and other documentation) that in a dev environment, all of our Spring Boot microservices require the dev profile to be activated.

For environments besides dev, you can probably already see some options... Any one of these options could do (almost) everything you need. You can mix and match as you need. These options have overlap with some ideas you mention in your original post.

Maintain environment-specific profiles like application-stage.yml, application-prod.yml, and so on, that override settings with deviations from defaults (in a very heavily-locked-down git repository)
Maintain modular, vendor-specific profiles like application-aws.yml, application-mycloudvendor.yml (where you store this will depend on whether it contains secrets). These may contain values that cut across stage, prod, etc.
Use environment variables to override any relevant settings, at runtime; including picking profile(s) from 1 and 2
Use automation to bake in hardcoded values (templates) at build or deployment time (output into a heavily-locked down repository of some sort, possibly distinct from (1)'s repository)

(1), (2), and (3) work well together. We are happily doing all three and it's actually pretty easy to document, reason about, and maintain (after getting the initial hang of it).

You said ...

I suppose you could create a repo of per-environment properties files or script [...] You would need a ton of scripts, though.

It can be manageable. The scripts that pull or bake-in config: these can be uniform across all services. Maybe the script is copied when somebody clones your microservice template (btw: you should have an official microservice template!). Or maybe it's a Python script on an internal PyPI server. More on this after we talk about Docker.

Since Spring Boot has such good support for (3), and support for using defaults/templating in YML files, you may not need need (4). But here's where things get very specific to your organization. The Security Engineer on our team wanted us to use (4) to bake in some specific values for environments beyond dev: passwords. This Engineer didn't want the passwords "floating around" in environment variables, mainly because then -- who would set them? The Docker caller? AWS ECS Task Definition (viewable through AWS web UI)? In those cases, the passwords could be exposed to automation engineers, who wouldn't necessarily have access to the "locked-down git repository" containing application-prod.yml. (4) might not be needed if you do (1); you could just keep the passwords, hardcoded, in the tightly-controlled repository. But maybe there are secrets to generate at deployment-automation time, that you don't want in the same repository as (1). This is our case.

More on (2): we use an aws profile and Spring Boot's "configuration as code" to make a startup-time call to get AWS metadata, and override some config based on that. Our AWS ECS Task Definitions activate the aws profile. The Spring Cloud Netflix documentation gives an example like this:

@Bean
@Profile("aws")
public EurekaInstanceConfigBean eurekaInstanceConfig() {
  EurekaInstanceConfigBean b = new EurekaInstanceConfigBean();
  AmazonInfo info = AmazonInfo.Builder.newBuilder().autoBuild("eureka");
  b.setDataCenterInfo(info);
  return b;
}

Run Code Online (Sandbox Code Playgroud)

Next, Docker. Environment Variables are a very good way to pass in configuration arguments in Docker. We don't use any command-line or positional arguments because of some gotchas we encountered with ENTRYPOINT. It's easy to pass --env SPRING_PROFILES_ACTIVE=dev or --env SPRING_PROFILES_ACTIVE=aws,prod ... whether from command-line, or from a supervisor/scheduler such as AWS ECS or Apache Mesosphere/Marathon. Our entrypoint.sh also facilitates passing JVM flags that have nothing to do with Spring: we use the common JAVA_OPTS convention for this.

(Oh, I should mention ... we also use Gradle for our builds. At the moment ... We wrap docker build, docker run, and docker push with Gradle tasks. Our Dockerfile is templated, so again, option #4 from above. We have variables like @agentJar@ that get overrwritten at build time. I really don't like this, and I think this could be better handled with plain old configuration (-Dagent.jar.property.whatever). This will probably go way. But I'm just mentioning it for completeness. Something I am happy about with this: nothing is done in the build, Dockerfile, or entrypoint.sh script, that is coupled tightly to a certain deployment context (such as AWS). All of it works in dev environments as well as deployed environments. So we don't have to deploy the Docker image to test it: it's portable, as it should be.)

We have a folder src/main/docker containing the Dockerfile and entrypoint.sh (the script called by ENTRYPOINT; this is baked into the Dockerfile). Our Dockerfile and entrypoint.sh are nearly completely uniform across all microservices. These are duplicated when you clone our microservice template. Unfortunately, sometimes you have to copy/paste updates. We haven't found a good way around this yet, but it's not terribly painful.

The Dockerfile does the following (build-time):

Derives from our "golden" base Dockerfile for Java applications
Grabs our tool for pulling configuration. (Grabs from an internal server available to any dev or Jenkins machine doing a build.) (You could also just use Linux tools like wget as well as DNS/convention-based naming for where to get it. You could also use AWS S3 and convention-based naming.)
Copy some things into the Dockerfile, like the JAR, entrypoint.sh...
ENTRYPOINT exec /app/entrypoint.sh

The entrypoint.sh does the following (run-time):

Uses our tool to pull configuration. (Some logic to understand that if aws profile is not active, the aws config file is not expected.) Dies immediately and loudly if there are any issues.
exec java $JAVA_OPTS -jar /app/app.jar (picks up all the properties files, environment variables, etc.)

So we've covered that at application startup time, configuration is pulled from somewhere ... but where? To points from earlier, they could be in a git repository. You could pull down all profiles then use SPRING_PROFILES_ACTIVE to say which are active; but then you might pull down application-prod.yml onto a stage machine (not good). So instead, you could look at SPRING_PROFILES_ACTIVE (in your configuration-puller logic), and pull only what is needed.

If you are using AWS, you could use S3 repository/ies instead of a git repository. This may allow for better access control. Instead of an application-prod.yml and application-stage.yml living in the same repo/bucket, you could make it so that application-envspecific.yml always has the required configuration, in the S3 bucket by some conventional name in the given AWS account. i.e. "Get the config from s3://ecs_config/$ENV_NAME/application-envspecific.yml" (where $ENV_NAME comes from entrypoint.sh script or ECS Task Definition).

I mentioned that the Dockerfile works portably, and isn't coupled to certain deployment contexts. That is because entrypoint.sh is defined to check for config files in a flexible way; it just wants the config files. So if you use Docker's --volume option to mount a folder with config, the script will be happy, and it won't try to pull anything from an external server.

I won't get into the deployment automation much ... but just mention quickly that we use terraform, boto3, and some custom Python wrapping code. jinja2 for templating (baking in those couple values that need to be baked in).

Here's serious limitation of this approach: the microservice process has to be killed/restarted to re-download and reload config. Now, with a cluster of stateless services, this does not necessarily represent downtime (given some things, like client-side load-balancing, Ribbon configured for retries, and horizontal scale so some instances are always running in the pool). So far it is working out, but the microservices still have pretty low load. Growth is coming. We shall see.

There are many more ways to solve these challenges. Hopefully this exercise has got you thinking about what will work for your team. Just try to get some things going. Prototype rapidly and you'll shake out the details as you go.

Perhaps better: configuration servers

I think this is a more common solution: Configuration Servers. You mentioned ZooKeeper. There's also Consul. Both ZooKeeper and Consul offer both Configuration Management and Service Discovery. There's also etcd.

In our case, the Security team wasn't comfortable with a centralized Configuration Management server. We decided to use NetflixOSS's Eureka for Service Discovery, but hold off on a Configuration Server. If we wind up disliking the methods above, we may switch to Archaius for Configuration Management. Spring Cloud Netflix aims to make these integrations easy for Spring Boot users. Though I think it wants you to use Spring Cloud Config (Server/Client) instead of Archaius. Haven't tried it yet.

Configuration servers seem much easier to explain and think about. If you can, you should start off with a configuration server.

If the implementation is hard to explain, it's a bad idea.
If the implementation is easy to explain, it may be a good idea.

Run Code Online (Sandbox Code Playgroud)

Comparisons of configuration servers

If you decide to try a config server, you'll need to do a research spike. Here are some good resources to start you off:

Service Discovery: Zookeeper vs etcd vs Consul
Aphyr's write-up on consistency in etcd and Consul (the particular problems identified have now been fixed; but it's worth reading and thinking about)

If you try Consul, you should watch this talk, "Operating Consul as an Early Adopter". Even if you try something else besides Consul, the talk has nuggets of advice and insight for you.

16/05/11 EDIT: The ThoughtWorks Technology Radar has now moved Consul into the "Adopt" category (history of their evaluation is here).

17/06/01 EDIT: We are considering moving to Kubernetes for multiple reasons. If we do we will leverage the etcd-powered ConfigMaps feature that ships with K8S. That's all for now on this subject :-)

More resources

My favorite one-stop shop on microservices: Martin Fowler's "Microservices Resource Guide"
If you don't want to buy/read all of Building Microservices yet, there's

Answer 2

Jen*_*ens 2

Docker compose 支持扩展 compose 文件，这对于覆盖配置的特定部分非常有用。

这至少对于开发环境非常有用，并且在小型部署中也可能很有用。

这个想法是拥有一个基本的共享撰写文件，您可以为不同的团队或环境覆盖。

您可以将其与具有不同设置的环境变量结合起来。

如果您想替换简单的值，环境变量是很好的选择，如果您需要进行更复杂的更改，则可以使用扩展文件。

例如，您可以拥有一个如下所示的基本撰写文件：

# docker-compose.yml
version: '3.3'
services:
  service-a:
    image: "image-name-a"
    ports:
     - "${PORT_A}"
  service-b:
    image: "image-name-b"
    ports:
     - "${PORT_B}"
  service-c:
    image: "image-name-c"
    ports:
     - "${PORT_C}"

Run Code Online (Sandbox Code Playgroud)

如果您想更改端口，您只需为变量传递不同的值即可PORT_X。

对于复杂的更改，您可以使用单独的文件来覆盖撰写文件的特定部分。您可以覆盖特定服务的特定参数，任何参数都可以被覆盖。

例如，您可以使用不同的映像为服务 A 提供一个覆盖文件，并添加一个用于开发的卷：

# docker-compose.override.yml
services:
  service-a:
    image: "image-alternative-a"
    volumes:
      - /my-dev-data:/var/lib/service-a/data

Run Code Online (Sandbox Code Playgroud)

docker-compose.ymlDocker compose 会默认选择docker-compose.override.yml，如果您有更多文件或具有不同名称的文件，则需要按顺序指定它们：

docker-compose -f docker-compose.yml -f docker-compose.dev.yml -f docker-compose.dev-service-a.yml up -d

Run Code Online (Sandbox Code Playgroud)

对于更复杂的环境，解决方案将取决于您使用的内容，我知道这是一个 docker 问题，但现在很难找到纯粹的 docker 系统，因为大多数人都使用 Kubernetes。在任何情况下，您总是会拥有环境提供的某种秘密管理并在外部进行管理，然后从 docker 方面来看，您只拥有该环境提供的变量。

归档时间：	10 年，4 月前
查看次数：	7852 次
最近记录：	8 年，4 月前