容器狂占cpu怎么办？

简介

简介
如何感知某个项目占用了过多的cpu
cpu 隔离在不同系统的不同概念
- CPU Resources in Docker, Mesos and Marathon
如何避免系统被应用拖垮

笔者曾经碰到一个现象，物理机load average 达到120，第一号进程的%CPU 达到了 1091，比第二名大了50倍，导致整个服务器非常卡，本文尝试分析和解决这个问题

如何感知某个项目占用了过多的cpu

进程cpu耗费如何计算

The bandwidth allowed for a group（进程所属的组） is specified using a quota and period. Within each given “period” (microseconds), a group is allowed to consume only up to “quota” microseconds of CPU time. When the CPU bandwidth consumption of a group exceeds this limit (for that period), the tasks belonging to its hierarchy will be throttled and are not allowed to run again until the next period. 有几个点

cpu 不像内存一样有明确的大小单位，单个cpu 是独占的，只能以cpu 时间片来衡量。
进程耗费的限制方式：在period（毫秒/微秒）内该进程只能占用 quota （毫秒/微秒）。PS：内存隔离是申请内存的时候判断判断已申请内存有没有超过阈值。cpu 隔离则是判断period周期内，已耗费时间有没有超过 quota。PS：频控、限流等很多系统也是类似思想
period 指的是一个判断周期，quota 表示一个周期内可用的多个cpu的时间和。所以quota 可以超过period ，比如period=100 and quota=200，表示在100单位时间里，进程要使用cpu 200单位，需要两个cpu 各自执行100单位
每次拿cpu 说事儿得提两个值（period 和 quota）有点麻烦，可以通过进程消耗的 CPU 时间片quota来统计出进程占用 CPU 的百分比。这也是我们看到的各种工具中都使用百分比来说明 CPU 使用率的原因（下文多出有体现）。

linux

top 命令输出

top - 18:31:39 up 158 days,  4:45,  2 users,  load average: 2.63, 3.48, 3.53
Tasks: 260 total,   2 running, 258 sleeping,   0 stopped,   0 zombie
%Cpu(s): 38.1 us,  4.2 sy,  0.0 ni, 53.5 id,  2.3 wa,  0.0 hi,  1.9 si,  0.0 st
KiB Mem : 16255048 total,   238808 free,  7608872 used,  8407368 buff/cache
KiB Swap: 33554428 total, 31798304 free,  1756124 used.  7313144 avail Mem

PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND
32080 root      20   0 8300552 4.125g  11524 S  86.4 26.6   1157:05 java
995 root      20   0  641260  41312  39196 S  28.6  0.3   7420:54 rsyslogd

top 命令找到%CPU 排位最高的进程id=32080，根据docker ps -q | xargs docker inspect -f ' '| grep 32080 找到对应的容器

docker

docker stats returns a live data stream for running containers.

docker stats 命令输出

CONTAINER ID        NAME                                         CPU %               MEM USAGE / LIMIT     MEM %               NET I/O             BLOCK I/O           PIDS
4aeb15578094        mesos-19ba2ecd-7a98-4e92-beed-c132b063578d   0.21%               376.4MiB / 1GiB       36.75%              3.65MB / 1.68MB     119kB / 90.1kB      174
77747f26dff4        mesos-e3d34892-8af6-4ab7-a649-0a4b424ccd04   0.31%               752.5MiB / 800MiB     94.06%              11.9MB / 3.06MB     86.1MB / 47MB       132
d64e482d2843        mesos-705b5dc6-7169-42e8-a143-6a7dc2e32600   0.18%               680.5MiB / 800MiB     85.06%              43.1MB / 17.1MB     194MB / 228MB       196
808a4bd888fb        mesos-65c9d5a6-3967-4a4a-9834-b83ae8c033be   1.81%               1.45GiB / 2GiB        72.50%              1.32GB / 1.83GB     8.36MB / 19.8MB     2392

CPU % 体现了 quota/period 值
MEM USAGE / LIMIT 反映了内存占用
NET I/O 反映了进出带宽
BLOCK I/O 反映了磁盘带宽，The amount of data the container has read to and written from block devices on the host ，貌似是一个累计值，但可以部分反映项目对磁盘的写程度，有助于解决容器狂打日志怎么办？
PID 反映了对应的进程号，也列出了进程id 与容器id的关系。根据pid 查询容器 id docker stats --no-stream | grep 1169

cpu 隔离在不同系统的不同概念

	示例	备注
–cpu-period	和cpu-quota结合使用
–cpu-quota	和cpu-period结合使用
–cpus	1	表示quota/period=1	貌似是docker 提出的概念
–cpuset-cpus	`--cpuset-cpus="1,3"`
–cpu-share	默认为1024	绝对值没有意义，只有和另一个进程对比起来才有意义

现在的多核系统中每个核心都有自己的缓存，如果频繁的调度进程在不同的核心上执行势必会带来缓存失效等开销。–cpuset-cpus 可以让容器始终在一个或某几个 CPU 上运行。–cpuset-cpus 选项的一个缺点是必须指定 CPU 在操作系统中的编号，这对于动态调度的环境(无法预测容器会在哪些主机上运行，只能通过程序动态的检测系统中的 CPU 编号，并生成 docker run 命令)会带来一些不便。

CPU Resources in Docker, Mesos and Marathon

CPU Resources in Docker, Mesos and Marathon 几个要点（假设物理机有8个核心）

即便marathon cpus 配置为0.1 ，容器依然可以访问物理机所有的8个核心，即cpus和物理机核心没有对应关系
mesos convert the cpus value into a value for Docker’s --cpu-shares setting,which according to the Docker documentation is just a priority weight for that process relative to all others on the machine. An application run with cpus=2 should receive twice the priority as one using cpus=1.
This is another effect that the cpus parameter has: it specifies the CPU capacity used up by the application.Maybe cpu-capacity or cpu-weight would be more descriptive
因为cpus 还作为CPU capacity 的描述载体，它以cpu 核心数作为上限。你如何配置marathon app 的cpus，完全取决于你打算让一个机器跑多少app，以及它相对其它app的重要性。

如何避免系统被应用拖垮

监控报警，先感知到再说
当发现cpu-share 不足以约束应用时，改用 docker 本身的cpus 概念进一步限制

技术

生活

标签

Docker 30

WEB 10

Data 21

Java 53

Linux 14

Tool 9

Algorithm 9

Go 6

CoreOS 4

Kubernetes 20

Other 13

Network 19

Python 1

Netty 16

Spring 19

Mesos 6

Distribute 8

Scala 6

Basic 2

Life 7

Ops 4

MQ 2

eureka 2

Apollo 2

OAuth2 1

MySql 1

简介