构建parquet-cpp时如何静态链接Arrow?

Mar*_*cok 2 c++ makefile cmake parquet

parquet-cpp主页

默认情况下,Parquet链接到Arrow的共享库。如果希望改为静态链接箭头符号,则传递-DPARQUET_ARROW_LINKAGE = static。

我确实想静态链接Arrow,因为我想在没有安装Arrow的其他服务器上使用我的程序。我尝试了-DPARQUET_ARROW_LINKAGE=static,但是收到有关“缺少传递依赖项”的错误:

# cmake -DPARQUET_BUILD_TESTS=Off -DCMAKE_BUILD_TYPE=Release -DPARQUET_MINIMAL_DEPENDENCY=ON -DPARQUET_ARROW_LINKAGE=static .
-- The C compiler identification is GNU 4.8.5
...
-- [ /usr/local/share/cmake-3.9/Modules/FindBoost.cmake:1717 ] Boost_FOUND = 1
-- Boost version: 1.55.0
...
-- THRIFT_HOME:
-- Thrift compiler/libraries NOT found:  (THRIFT_INCLUDE_DIR-NOTFOUND, THRIFT_STATIC_LIB-NOTFOUND). Looked in system search paths.
-- Thrift include dir: /root/tmp/parquet-cpp-master/thrift_ep/src/thrift_ep-install/include
-- Thrift static library: /root/tmp/parquet-cpp-master/thrift_ep/src/thrift_ep-install/lib/libthrift.a
-- Thrift compiler: /root/tmp/parquet-cpp-master/thrift_ep/src/thrift_ep-install/bin/thrift
-- Checking for module 'arrow'
--   No package 'arrow' found
-- Could not find the Arrow library. Looked for headers in , and for libs in
-- Building Apache Arrow from commit: 501d60e918bd4d10c429ab34e0b8e8a87dffb732
-- CMAKE_CXX_FLAGS:  -O3 -DNDEBUG  -Wall -std=c++11
-- Found cpplint executable at /root/tmp/parquet-cpp-master/build-support/cpplint.py
CMake Error at CMakeLists.txt:515 (message):
  Missing transitive dependencies for Arrow static linking
Run Code Online (Sandbox Code Playgroud)

所以我找到了生成错误的代码

  if (NOT DEFINED ENV{BROTLI_STATIC_LIB_ENC} OR
      NOT DEFINED ENV{BROTLI_STATIC_LIB_DEC} OR
      NOT DEFINED ENV{BROTLI_STATIC_LIB_COMMON} OR
      NOT DEFINED ENV{SNAPPY_STATIC_LIB} OR
      NOT DEFINED ENV{ZLIB_STATIC_LIB} OR
      NOT DEFINED ENV{LZ4_STATIC_LIB} OR
      NOT DEFINED ENV{ZSTD_STATIC_LIB})
    message(FATAL_ERROR "Missing transitive dependencies for Arrow static linking")
Run Code Online (Sandbox Code Playgroud)

但这并没有真正帮助我,因为我不知道如何定义这些环境变量。

我需要编译Arrow并先安装自己吗?(我希望Parquet-cpp可以帮我做。)

p-a*_*l-o 5

我安排了一个脚本来下载依赖项源,设置环境变量并cmake在最后运行您的行。只需更改DEPDIR变量值,并将其设置为所选目录即可。

#!/bin/bash

CMKDIR=$PWD
DEPDIR=/tmp

cd $DEPDIR

#snappy
git clone https://github.com/google/snappy.git
cd snappy
mkdir build 
cd build 
cmake ..
make

export SNAPPY_STATIC_LIB=$DEPDIR/snappy/build/libsnappy.a

cd $DEPDIR

#brotli
git clone https://github.com/google/brotli.git
cd brotli
mkdir out
cd out
../configure-cmake
make

export BROTLI_STATIC_LIB_ENC=$DEPDIR/brotli/out/libbrotlienc-static.a
export BROTLI_STATIC_LIB_DEC=$DEPDIR/brotli/out/libbrotlidec-static.a
export BROTLI_STATIC_LIB_COMMON=$DEPDIR/brotli/out/libbrotlicommon-static.a

cd $DEPDIR

#zlib
git clone https://github.com/madler/zlib.git
cd zlib
./configure
make

export ZLIB_STATIC_LIB=$DEPDIR/zlib/libz.a

cd $DEPDIR

#lz4
git clone https://github.com/lz4/lz4.git
cd lz4
make

export LZ4_STATIC_LIB=$DEPDIR/lz4/lib/liblz4.a

cd $DEPDIR

#zstd
git clone https://github.com/facebook/zstd.git
cd zstd
make

export ZSTD_STATIC_LIB=$DEPDIR/zstd/lib/libzstd.a

cd $CMKDIR

cmake -DPARQUET_BUILD_TESTS=Off -DCMAKE_BUILD_TYPE=Release -DPARQUET_MINIMAL_DEPENDENCY=ON -DPARQUET_ARROW_LINKAGE=static
Run Code Online (Sandbox Code Playgroud)

该脚本非常简单,但应该有效。只需将其复制到一个新文件中(在同一CMakeLists.txt目录中),为该文件提供执行权限(即sudo chmod +x filename)并按以下方式执行它:

./filename.sh 
Run Code Online (Sandbox Code Playgroud)

关于fPIC选项问题,您必须编辑一些文件:

snappy:在前两行之后的开头,在CMakeLists.txt中添加此行:

set(CMAKE_POSITION_INDEPENDENT_CODE ON)
Run Code Online (Sandbox Code Playgroud)

lz4zstd:在此行之后,编辑lib子目录中的Makefile

CFLAGS  += $(DEBUGFLAGS) $(MOREFLAGS)
Run Code Online (Sandbox Code Playgroud)

添加此行:

CFLAGS += -fPIC
Run Code Online (Sandbox Code Playgroud)

zlib:在此行之后编辑Makefile

CFLAGS=-O3 -D_LARGEFILE64_SOURCE=1 -DHAVE_HIDDEN
Run Code Online (Sandbox Code Playgroud)

添加此行:

CFLAGS += -fPIC
Run Code Online (Sandbox Code Playgroud)

brotli:据我从make输出看到的,该选项已经设置。

在再次运行make之前,请执行以下脚本:

#!/bin/bash

DEPDIR=/tmp

cd $DEPDIR/snappy/build
cmake ..
make clean
make

cd $DEPDIR/lz4
make clean
make

cd $DEPDIR/zstd
make clean
make
Run Code Online (Sandbox Code Playgroud)