Transferring Files with Globus
For large data transfers between NREL’s high-performance computing (HPC)
systems and another data center, or even a laptop off-site, we recommend using
Globus.
A supporting set of instructions for requesting a Globus account and data
transfer using Globus is available on the HPC NREL
Website
What Is Globus?
Globus provides services for research data management, including file transfer.
It enables you to quickly, securely and reliably move your data to and from
locations you have access to.
Globus transfers files using GridFTP. GridFTP is a high-performance data
transfer protocol which is optimized for high-bandwidth wide-area networks. It
provides more reliable high performance file transfer and synchronization than
scp or rsync. It automatically tunes parameters to maximize bandwidth while
providing automatic fault recovery and notification of completion or problems.
Get a Globus Account
To get a Globus account, sign up on the Globus account website.
Globus NREL Endpoints
The current NREL Globus Endpoints are:
- nrel#eglobus - This endpoint allows access to any files on Eagle's Lustre file system: /scratch and /projects.
- nrel#kglobus_projects - This endpoint will give you access to any files you have on the Kestrel Project File System: /datasets, /projects, and /shared-projects.
- nrel#kglobus_scratch - This endpoint will give you access to any files you have on the Kestrel Scratch File System: /scratch.
- nrel#vast - This endpoint will give you access to any files you have on our VAST file system: /campaign, /datasets (Eagle), /MSS and /shared-projects (Eagle).
Globus Personal Endpoints
You can set up a "Globus Connect Personal EndPoint", which turns your personal
computer into an endpoint, by downloading and installing the Globus Connect
Personal application on your system.
Set Up a Personal EndPoint
- Download Globus Connect Personal
- Once installed, you will be able to start the Globus Connect Personal
application locally, and login using your previously created Globus
account credentials.
- Within the application, you will need to grant consent for Globus to access
and link your identity before creating a collection that will be visible from
the Globus Transfer website.
- Additional tutorials and information on this process is located at the Globus
Website for both
Mac and
Windows.
Transferring Files
You can transfer files with Globus through the Globus
Online website or via the CLI
(command line interface).
Important
It is strongly recommended to compress multiple files into a single archive (tar.gz, zip) before transferring data with Globus.
To compress a directory:
tar -czvf filename.tar.gz /path/to/dir
To extract an archive:
tar -xzvf filename.tar.gz
Globus Online
Globus Online is a hosted service that allows you to use a browser to transfer
files between trusted sites called "endpoints". To use it, the Globus software
must be installed on the systems at both ends of the data transfer. The NREL
endpoint is nrel#eglobus.
- Click Login on the Globus web site. On the login
page select "Globus ID" as the login method and click continue. Use the Globus
credentials you used to register your Globus.org account.
- The ribbon on the left side of the screen acts as a Navigator, select File Manager
if not already selected. In addition, select the 'middle' option for Panels in the upper
right, which will display space for two Globus endpoints.
- The collection tab will be searchable (e.g. nrel), or nrel#eglobus can be
entered in the left collection tab. In the box asking for authentication, enter
your NREL HPC username and password. Do not use your globus.org username
or password when authenticating with the nrel#eglobus endpoint.
- Select another Globus endpoint, such as a personal endpoint or
an endpoint at another institution that you have access to.
To use your personal endpoint, first start the Globus Connect Personal application.
Then search for either the endpoint name or your username in the collections tab,
and select your endpoint. After the first use, you should see your endpoints in
the recent tab when searching. You may also setup an endpoint/directory as a bookmark.
- To transfer files:
- select the files you want to transfer from one of the endpoints
- select the destination location in the other endpoint (a folder or directory)
- click the 'start' button on the source collection, and it will transfer files
to the target collection
- For additional information, the Globus Webpage has
tutorials and documentation under the Resources tab.
When your transfer is complete, you will be notified by email.
Globus CLI (command line interface)
Globus supports a command line interface (CLI), which can be used for scripting
and automating some transfer tasks. For more information,
it is suggested that the user refer to the Globus CLI
documentation located on the Globus Webpage.
For installing globus-cli, the recommendation is to use a Conda environment. In this
case, it is advised to follow the instructions about mixing Conda and Pip,
and only use Pip after establishing a base environment using Conda. For more information about mixing Conda and Pip, refer to our internal documentation at: Conda