GCE - 没有节点的stackdriver内存指标

wir*_*tsi 5 google-compute-engine kubernetes stackdriver

我在GCE上设置了我的Kubernetes 1.3.4集群

export KUBE_ENABLE_CLUSTER_MONITORING=google

这很好用,我得到了应用程序日志(出于某种原因,在容器引擎部分,但很好)以及pod和节点指标.

唯一缺少的是节点内存指标,只显示CPU(见截图)

没有内存指标

在heapster日志中,我看到很多这样的线条

{
 metadata: {
  severity: "ERROR"    
  projectId: "<project-id>"    
  serviceName: "container.googleapis.com"    
  zone: "europe-west1-d"    
  labels: {
   container.googleapis.com/cluster_name: "production"     
   compute.googleapis.com/resource_type: "instance"     
   compute.googleapis.com/resource_name: "fluentd-cloud-logging-production-minion-group-p0w8"     
   container.googleapis.com/instance_id: "6772154497331326454"     
   container.googleapis.com/pod_name: "heapster-v1.1.0-2102007506-23b3e"     
   compute.googleapis.com/resource_id: "6772154497331326454"     
   container.googleapis.com/stream: "stderr"     
   container.googleapis.com/namespace_name: "kube-system"     
   container.googleapis.com/container_name: "heapster"     
  }
  timestamp: "2016-09-13T14:40:08.000Z"    
  projectNumber: "930564692351"    
 }
 textPayload: "E0913 14:40:08.665035       1 gcm.go:179] Error while sending request to GCM googleapi: Error 400: Timeseries 76, point: start is not older than end, for a cumulative metric, invalidParameter
"   
 insertId: "pt5bo7g132r266"   
 log: "heapster"   
}
Run Code Online (Sandbox Code Playgroud)

不确定这是否相关.

有任何想法吗?

mig*_*o85 1

如果您使用 GCE 而不是 GKE 运行集群,您应该安装stackdriver 代理并验证代理用于与 stackdriver链接通信的凭据

如果您使用的是 Linux,则可以通过执行以下命令来安装代理:

curl -sSO https://dl.google.com/cloudagents/install-monitoring-agent.sh
sudo bash install-monitoring-agent.sh
Run Code Online (Sandbox Code Playgroud)

您可以运行以下命令检查您的凭据:

sudo cat $GOOGLE_APPLICATION_CREDENTIALS
sudo cat /etc/google/auth/application_default_credentials.json
Run Code Online (Sandbox Code Playgroud)