错误:未安装 cgroup 命名空间“freezer”。中止

8 hpc slurm

尝试运行 slurmd:

\n
sudo systemctl start slurmd\n
Run Code Online (Sandbox Code Playgroud)\n

我显示守护进程的状态,屏幕上显示错误:

\n
>>sudo systemctl status slurmd\n\xe2\x97\x8f slurmd.service - Slurm node daemon\n   Loaded: loaded (/lib/systemd/system/slurmd.service; enabled; vendor preset: enabled)\n   Active: failed (Result: exit-code) since Mon 2020-06-29 18:13:06 MSK; 2s ago\n     Docs: man:slurmd(8)\n  Process: 13402 ExecStart=/usr/sbin/slurmd $SLURMD_OPTIONS (code=exited, status=1/FAILURE)\n\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm systemd[1]: Starting Slurm node daemon...\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: Message aggregation disabled\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: error: cgroup namespace 'freezer' not mounted. aborting\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: error: unable to create freezer cgroup namespace\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: error: Couldn't load specified plugin name for proctrack/cgroup: Plugin init() callback failed\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: error: cannot create proctrack context for proctrack/cgroup\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm systemd[1]: slurmd.service: Control process exited, code=exited, status=1/FAILURE\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm slurmd-ecm[13402]: error: slurmd initialization failed\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm systemd[1]: slurmd.service: Failed with result 'exit-code'.\n\xd0\xb8\xd1\x8e\xd0\xbd 29 18:13:06 ecm systemd[1]: Failed to start Slurm node daemon.\n
Run Code Online (Sandbox Code Playgroud)\n

我不知道如何解决它。我希望得到你的帮助。我使用 slurm 版本 18.08.05 和 debian 10。

\n

UPD。\n我将 slurm.config 中的 ProctrackType 值更改为 proctrack/linuxproc:

\n
ProctrackType=proctrack/linuxproc\n
Run Code Online (Sandbox Code Playgroud)\n

一切都是工作。

\n

小智 5

与文档(man cgroup.conf)不同,参数CgroupMountpoint的默认值不好。

echo CgroupMountpoint=/sys/fs/cgroup >> /etc/slurm-llnl/cgroup.conf

并且您可以重置 ProctrackType 的值。在Debian10.7 slurmd版本上测试:slurm-wlm 18.08.5-2