Rmpi无法以非root用户身份加载共享库

Rwo*_*ems 8 r shared-libraries mpi

我遇到了Rmpi的问题,我尝试加载它,我收到此错误消息:

> library('Rmpi')
Error in dyn.load(file, DLLpath = DLLpath, ...) :
  unable to load shared library '/usr/lib64/R/library/Rmpi/libs/Rmpi.so':
  libmpi.so.0: cannot open shared object file: No such file or directory
In addition: Warning message:
.Last.lib failed in detach() for 'Rmpi', details:
  call: dyn.unload(file.path(libpath, "libs", paste("Rmpi", .Platform$dynlib.ext,
  error: dynamic/shared library '/usr/lib64/R/library/Rmpi/libs/Rmpi.so' was not loaded
Error in library("Rmpi") : .First.lib failed for 'Rmpi'
Run Code Online (Sandbox Code Playgroud)

但是,当我以root用户身份登录时,不会发生此错误.

它似乎不是权限问题.我检查了libmpi.so.0的权限:

[meehan@cnl10 /]$ ll /usr/lib64/lam/lib/
total 7.4M
-rw-r--r-- 1 root root  207 May 25  2008 lam.module
-rw-r--r-- 1 root root 885K May 25  2008 liblam.a
-rw-r--r-- 1 root root 361K May 25  2008 liblamf77mpi.a
lrwxrwxrwx 1 root root   21 Apr 12  2010 liblamf77mpi.so -> liblamf77mpi.so.0.0.0
lrwxrwxrwx 1 root root   21 Apr 12  2010 liblamf77mpi.so.0 -> liblamf77mpi.so.0.0.0
-rwxr-xr-x 1 root root  73K May 25  2008 liblamf77mpi.so.0.0.0
-rw-r--r-- 1 root root 2.2M May 25  2008 liblammpi++.a
-rw-r--r-- 1 root root 509K May 25  2008 liblammpio.a
lrwxrwxrwx 1 root root   20 Apr 12  2010 liblammpi++.so -> liblammpi++.so.0.0.0
lrwxrwxrwx 1 root root   20 Apr 12  2010 liblammpi++.so.0 -> liblammpi++.so.0.0.0
-rwxr-xr-x 1 root root 167K May 25  2008 liblammpi++.so.0.0.0
lrwxrwxrwx 1 root root   15 Apr 12  2010 liblam.so -> liblam.so.0.0.0
lrwxrwxrwx 1 root root   15 Apr 12  2010 liblam.so.0 -> liblam.so.0.0.0
-rwxr-xr-x 1 root root 332K May 25  2008 liblam.so.0.0.0
-rw-r--r-- 1 root root 2.2M May 25  2008 libmpi.a
lrwxrwxrwx 1 root root   15 Apr 12  2010 libmpi.so -> libmpi.so.0.0.0
lrwxrwxrwx 1 root root   15 Apr 12  2010 libmpi.so.0 -> libmpi.so.0.0.0
-rwxr-xr-x 1 root root 655K May 25  2008 libmpi.so.0.0.0
Run Code Online (Sandbox Code Playgroud)

和Rmpi.so:

[meehan@cnl10 /]$ ll /usr/lib64/R/library/Rmpi/libs/
total 108K
-rwxr-xr-x 1 root root 104K Jan 20  2011 Rmpi.so
Run Code Online (Sandbox Code Playgroud)

无论如何,我正在运行R作为sudo.

相关系统信息:-Linux发行版:CentOS 5.5 -R版本:2.11.1(2010-05-31)-Rmpi版本:0.5-8 -MPI实现是openmpi

[meehan@cnl10 /]$  echo $LD_LIBRARY_PATH
/opt/lib:/opt/open-mpi/tcp-`gnu41/lib:/opt/intel/mkl/10.2/lib/em64t:/opt/intel/fce/11.1/lib:/opt/intel/cce/11.1/lib:`
Run Code Online (Sandbox Code Playgroud)

非常感激任何的帮助!

小智 3

这里的问题是 OpenMPI 默认情况下不会向系统链接器注册其库目录。这就是为什么一些安装指南建议您将其目录放入LD_LIBRARY_PATH变量中,以便可以在运行时找到这些库。但是,每次加载新 shell 时都必须执行“将目录添加到 LD_LIBRARY_PATH”,这就是为什么这些指南建议将其放入~/.bashrc或类似操作,以便在每次登录时恢复设置。

\n\n

但是,该~/.bashrc文件(或~/.profile或任何此类文件)是特定于用户的设置。假设在安装 openmpi 和 Rmpi​​ 等时以 root 身份登录(这似乎很可能),这意味着添加到这些特定于用户的文件只会在以 root 身份运行时设置库路径,而不是以通常的运行时用户身份运行。

\n\n

一般来说,修复方法是告诉链接器在哪里可以找到这些文件。在我自己的系统上,运行 CentOS 7、OpenMPI 1.10.0(使用 Scientific Linux RPM)、R 3.2.3 和 Rmpi​​ 0.6-5,当我无法设置库路径时会发生以下情况:

\n\n
[dchurch@workstation ~]$ R -q -e "library(\'Rmpi\')"\n> library(\'Rmpi\')\nError : .onLoad failed in loadNamespace() for \'Rmpi\', details:\n  call: dyn.load(file, DLLpath = DLLpath, ...)\n  error: unable to load shared object \'/usr/lib64/R/library/Rmpi/libs/Rmpi.so\':\n  libmpi.so.12: cannot open shared object file: No such file or directory\nError: package or namespace load failed for \xe2\x80\x98Rmpi\xe2\x80\x99\nExecution halted        \n
Run Code Online (Sandbox Code Playgroud)\n\n

如果我使用临时变量临时设置链接器路径,则它适用于此调用:

\n\n
[dchurch@workstation ~]$ LD_LIBRARY_PATH=/usr/lib64/openmpi/lib R -q -e "library(\'Rmpi\')"\n> library(\'Rmpi\')       \n>\n>\n
Run Code Online (Sandbox Code Playgroud)\n\n

但是,要使此更改永久生效,最好的方法是通过在 中创建一个新文件/etc/ld.so.conf.d并运行,将 openmpi 库目录注册到系统链接器本身ldconfig,如下所示:

\n\n
[dchurch@workstation ~]$ sudo sh -c \'echo /usr/lib64/openmpi/lib > /etc/ld.so.conf.d/openmpi.conf; ldconfig\'\n[dchurch@workstation ~]$ R -q -e "library(\'Rmpi\')"\n> library(\'Rmpi\')\n>\n>\n
Run Code Online (Sandbox Code Playgroud)\n\n

完成此操作后,无论环境变量如何,任何用户都应该能够加载 Rmpi​​。

\n