Libraries How-To: Linking Scientific Libraries
This page is a tutorial explaining how to include scientific libraries when compiling software.
There are a few common scientific libraries: LAPACK, BLAS, BLACS, scaLAPACK, FFTW, HDF5, and others. These libraries are generally highly optimized, and many scientific programs favor use of these libraries over in-house implementations of similar functionality. See our libraries overview page for more information.
Scientific libraries can be packaged together, like in the Intel Math Kernel Library (MKL), or Cray’s LibSci. They can also be built completely separately and act as standalone libraries. These libraries can be built with different MPI implementations and compiler choices.
If you’re building a code that relies on one or more of these libraries, you can choose how to include these libraries. By the end of this tutorial, how to include these libraries should be clearer. If you need help building a particular package on an NREL machine, please contact HPC help.
Makefiles, autoconf, and cmake
Build tools like make, autoconf, and cmake are convenient ways to automate the compilation of a code. If you’re building a package, you may need to modify/customize how the code compiles, e.g., so it finds and includes the libraries you want. This may involve directly modifying the makefile, modifying the make.include (or make.inc, makefile.include, etc.) file, or using tools like autoconf or CMake to configure the makefile.
Modifying a makefile (or make.include, etc.) so it compiles using the scientific libraries you want can be a daunting process. We’ll go through a prototypical example and show how different libraries can be included in the build of a program. To do this, we’ll use a makefile.include file for the electronic structure program VASP.
Note
We provide a walkthrough of linking scientific libraries using the VASP code as an example. This walkthrough tries to demonstrate key features of the general process of including scientific libraries in a build. We note that the exact build and modification process will vary between codes. Consulting the documentation of the code you’re trying to build is always the best place to start.
Walkthrough
Overview
We’ll use the VASP makefile.include file as our walkthrough example. We can find a number of VASP makefile.include files here. We’ll be looking specifically at this file.
We’ll take a look at building with Intel MKL and the HDF5 package.
Building with MKL and HDF5
We want to build with MKL and HDF5. If we look at the VASP documentation, we see that LAPACK, scaLAPACK, BLAS, and FFTW are required. MKL covers all of these needs. Thus, we need to tell the makefile where to look for MKL.
Environment Preparation
We need our MKL to be built with the same compilers and MPI implementation as we’re building VASP with. Let’s see what sorts of MKL builds are available to us. Using the following command to show what builds of mkl are available as a module:
module avail 2>&1 | grep mkl
Yields the output:
intel-oneapi-mkl/2023.0.0-intel ucx/1.13.0
Thus, if we want to use the toolchains managed by NREL, we must use the Intel oneapi toolchain in our VASP build, since intel-oneapi-mkl/2023.0.0-intel
is the only available mkl module. If you want to use a different toolchain, you could build MKL yourself, but that’s outside the scope of this article.
To “use the Intel oneapi toolchain” means to use Intel compilers and Intel’s implementation of MPI to compile VASP. We’re doing this because mkl was built with this toolchain, and we want our toolchains to match as best as possible to minimize build errors and bugs.
Let’s prepare our environment to use this toolchain. First,
module purge
To clear your environment. Now, we want the Intel oneapi mkl module, the Intel fortran compiler (ifort), and the Intel MPI fortran compiler (mpiifort). Type:
module avail 2>&1 | grep oneapi
to see which modules are related to the intel-oneapi toolchain. We can locate the three we want:
module load intel-oneapi-mkl/2023.0.0-intel
module load intel-oneapi-mpi/2021.8.0-intel
module load intel-oneapi/2022.1.0
How do we know these are the ones we want? The first line loads the mkl module. The second line gives us mpiifort, the Intel MPI fortran compiler, and the third line gives us ifort, the Intel Fortran compiler. (test the latter two with which mpiifort
and which ifort
-- you’ll see that they’re now in your path. If you module purge
and try which mpiifort
again, you’ll see you’re not able to find mpiifort anymore.)
Modifying the Makefile for MKL
Now that we have the toolchain loaded into our environment, let’s take a look at the actual makefile.include file (link to file here). There are two important sections for the purpose of getting the code to build. The first:
CPP = fpp -f_com=no -free -w0 $*$(FUFFIX) $*$(SUFFIX) $(CPP_OPTIONS)
FC = mpiifort -qopenmp
FCL = mpiifort
The first line says that the compiler pre-processor will be fpp (try which fpp
and you should get an output /sfs/nopt/nrel/apps/compilers/01-23/spack/opt/spack/linux-rhel8-icelake/gcc-8.4.0/intel-oneapi-compilers-2022.1.0-wosfexnwo5ag3gyfoco2w6upcew5yj6f/compiler/2022.1.0/linux/bin/intel64/fpp
, confirming that we’re pulling fpp from intel-oneapi).
The second and third lines say that we’ll be using Intel’s MPI (Try which mpiifort
to confirm that it is in your path). FC is the “Fortran Compiler” and FCL is the corresponding linker. Line 14 additionally says we’ll be compiling with openmp. Different compilers have different executable names (e.g. mpiifort for Intel MPI fortran compiler, mpifort for GNU). See the Fortran documentation page for a complete list.
The next important section is given below:
# Intel MKL (FFTW, BLAS, LAPACK, and scaLAPACK)
# (Note: for Intel Parallel Studio's MKL use -mkl instead of -qmkl)
FCL += -qmkl
MKLROOT ?= /path/to/your/mkl/installation
LLIBS += -L$(MKLROOT)/lib/intel64 -lmkl_scalapack_lp64 -lmkl_blacs_intelmpi_lp64
INCS =-I$(MKLROOT)/include/fftw
This makefile.include file has been provided to us by VASP. Our job here is two-fold:
- To ensure that we tell make (via the makefile.include file) the correct place to find MKL, I.e., to ensure that
MKLROOT
in the makefile.include file is set correctly.
- To ensure that we tell make the correct libraries to reference within
MKLROOT
.
To do step 1, first type:
module list
To see the modules you’ve loaded into your environment. You should have intel-oneapi-mkl/2023.0.0-intel
in the list. If not, review the environment preparation section. Now, we use the module show
command to find the root directory of mkl:
module show intel-oneapi-mkl/2023.0.0-intel
We see in the output of this command the following line:
setenv MKLROOT /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/intel-oneapi-mkl-2023.0.0-gnkrgwyxskxitvptyoubqaxlhh2v2re2/mkl/2023.0.0
If we type echo $MKLROOT
, we can confirm that this environment variable is properly set from when we ran the command module load intel-oneapi-mkl/2023.0.0-intel
. In the VASP makefile, we have MKLROOT ?= /path/to/your/mkl/installation
. The ?= means that this variable will not be set if MKLROOT
has already been set. So, we can ignore this line if we’d like. However, to be safe, we should simply copy the path of the MKL root directory to this line in makefile.include, so that this line now reads:
MKLROOT ?= /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/intel-oneapi-mkl-2023.0.0-gnkrgwyxskxitvptyoubqaxlhh2v2re2/mkl/2023.0.0
Tip
The name of the environment variable for mkl’s root directory set by its module (MKLROOT
, set when we module load intel-oneapi-mkl/2023.0.0-intel
) is not necessarily going to match the corresponding root directory variable in a given makefile. It did in this instance, but that’s not guaranteed. The VASP makefile.include could have just as easily used MKL_ROOT
, instead of MKLROOT
. This is one reason why it’s safer to use module show
to find the path of the root directory, then copy this path into the makefile, rather than rely on environment variables.
To do step 2, we should first look at the contents of $MKLROOT
. To show the contents of the MKL directory, type
ls /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/intel-oneapi-mkl-2023.0.0-gnkrgwyxskxitvptyoubqaxlhh2v2re2/mkl/2023.0.0
We should obtain the following output:
benchmarks bin env examples include interfaces lib licensing modulefiles tools
If we look closely at the makefile, we see beneath the MKLROOT
line the following:
MKLROOT ?= /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/intel-oneapi-mkl-2023.0.0-gnkrgwyxskxitvptyoubqaxlhh2v2re2/mkl/2023.0.0
LLIBS += -L$(MKLROOT)/lib/intel64 -lmkl_scalapack_lp64 -lmkl_blacs_intelmpi_lp64
the LLIBS
line is telling make which libraries in particular to pick out.
So, we want to go into the lib directory, and then the intel64 directory (since LLIBS is pointing to $MKLROOT/lib/intel64
). Let's see what's inside with the ls
command:
ls /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/intel-oneapi-mkl-2023.0.0-gnkrgwyxskxitvptyoubqaxlhh2v2re2/mkl/2023.0.0/lib/intel64
There's a lot of stuff in this directory! VASP helps us by telling us we need the mkl_scalapack_lp64
and mkl_blacs_openmpi_lp64
builds specifically. You won't always be told exactly which libraries, and figuring this out, if the information is not provided to you in the package documentation, can require some tinkering.
In general, the .a
extension is for static linking, and the .so
extension is for dynamic linking. For MKL in particular, the part ilp64
vs lp64
refer to two different interfaces to the MKL library.
Tip
Notice that, inside $MKLROOT/lib/intel64
, the filenames all start with libmkl
, but in our makefile, we reference lmkl_scalapack_lp64
. That's not a file in $MKLROOT/lib/intel64
, but libmkl_scalapack_lp64.so
is. The notation is that "big L" references the directories that the libraries are in, and the "little l" references the particular libraries. For example:
LLIBS += -L$(MKLROOT)/lib/intel64
-lmkl_scalapack_lp64
This is just a convention, but is important to get right because your compile will fail otherwise.
Now that we have the correct MKLROOT
set in the makefile.include, and we have an idea about how it's referencing the libraries within, we can move on to linking the HDF5 library.
Modifying the Makefile for HDF5
Because HDF5 is an optional library, we could compile the code now if we wanted to. However, for the sake of practice, let’s uncomment the block in the makefile.include file related to HDF5 and repeat the exercise of linking a library:
# HDF5-support (optional but strongly recommended)
CPP_OPTIONS+= -DVASP_HDF5
HDF5_ROOT ?= /path/to/your/hdf5/installation
LLIBS += -L$(HDF5_ROOT)/lib -lhdf5_fortran
INCS += -I$(HDF5_ROOT)/include
Our job, again, is to give the makefile the correct directions to our library. In this case, it’s HDF5. Let’s see which HDF5 modules are available:
module avail hdf5
Returns
hdf5/1.12.2-intel-oneapi-mpi-intel hdf5/1.12.2-openmpi-gcc
So, we see that HDF5 has been built with the intel-oneapi-mpi toolchain, and also with the GCC/openmpi toolchain. Since we’re building vasp using the intel-oneapi toolchain, we need to load the corresponding module:
module load hdf5/1.12.2-intel-oneapi-mpi-intel
Again, we must locate the root directory:
module show hdf5/1.12.2-intel-oneapi-mpi-intel
We see the line for setting the HDF5 root directory environment variable:
setenv HDF5_ROOT_DIR /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/hdf5-1.12.2-dzgeixsm2cd3mupx4ti77ozeh7rh6zdo
Like before, we copy this path into our makefile.include:
# HDF5-support (optional but strongly recommended)
CPP_OPTIONS+= -DVASP_HDF5
HDF5_ROOT ?= /sfs/nopt/nrel/apps/libraries/01-23/spack/opt/spack/linux-rhel8-icelake/intel-2021.6.0/hdf5-1.12.2-dzgeixsm2cd3mupx4ti77ozeh7rh6zdo
LLIBS += -L$(HDF5_ROOT)/lib -lhdf5_fortran
INCS += -I$(HDF5_ROOT)/include
We’re ready to compile! In the case of VASP, the compile command is make DEPS=1 std
but in general, the command may be make all
or similar (consult the documentation of the code you’re trying to build).
If you’re working with a code that has a testsuite, now is a good time to run the testsuite to make sure that your compile was successful.
Summary of Steps
- Download the source code of the package you’re trying to build. This will generally be found on the website of the package.
- Consult the documentation of the package to find out what scientific libraries are needed, and if the package developers provide guidance on what toolchains/libraries are best
- Determine the availability of the needed scientific libraries.
- Can a “library-of-libraries” like MKL or LibSci be used?
- Does NREL support the library as a module?
- If so, determine the toolchain it was built with (usually given in the name of the module). If the toolchain is not clear from the name of the module, try the
ldd
command (e.g., ldd path/to/executable/executable
), which will show you the dynamically linked libraries of the executable.
- Prepare your environment
module load
the necessary modules to prepare your environment. (See environment preparation step of VASP example)
- Prepare your makefile
- Make sure that the compilers and (optional) MPI used in the makefile match what is used to build your scientific libraries as best as possible
- Make sure that the paths to the scientific libraries in the makefile match the path given by the
module show
command
- Make sure the proper “little L” libraries are referenced in the makefile
- Compile!
Questions?
If you’re still stuck and unable to successfully link the scientific libraries you need, get in contact with HPC help.