Data Transfer Tools
Data Transfer Tools
This article lists the data transfer tools available on JASMIN and provides links to articles that describe them in more detail.
The are many tools that you can use for transferring data to/from JASMIN. See the articles linked below for more details.
Tool | Info |
---|---|
Globus | An efficient, secure data transfer service available to all JASMIN users (recommended). Includes capability to schedule, repeat and orchestrate transfers between third-party hosts and receive notifications of job status. Has web and command-line interfaces. Efficient for moving large volumes and/or numbers of files, especially over long distances. |
scp | A basic transfer tool that works over the SSH protocol. Similar to “cp” but copies between remote servers. |
rsync (over SSH) | Like scp but more sophisticated. Allows synchronisation between remote directory trees. |
sftp | SSH FTP - works over SSH. |
bbcp | A command-line tool that allows the user to specify parallel transfer over multiple streams, using SSH authentication. |
GridFTP (over SSH) | An old but comprehensive data transfer tool. Highly configurable and able to transfer over multiple parallel streams. Used over SSH in this case. Superseded by Globus |
GridFTP (certificate-based) | Legacy Gridftp using certificate-based authentication instead of SSH credentials. Efficient for moving large volumes and/or numbers of files, especially over long distances. Superseded by Globus |
FTP | File Transfer Protocol. An aged transfer protocol suitable for small file transfers but limited. |
LFTP | Parallel-capable FTP client. |
wget, curl | Download tools for accessing resources over HTTP primarily. (see 3rd party documentation) |
sharing GWS data via http | for exposing part(s) of a Group Workspace via HTTP to users without JASMIN accounts. |
Python transfer tools | Methods of managing/scripting data transfer tasks using Python. You can use libraries such as requests in a Python 3 virtual environment on the transfer servers. |
MASS client (Met Office) | A specific command-line tool installed on the mass-cli1.ceda.ac.uk server on JASMIN. Enables extractions from the Met Office MASS Archive. |
OPeNDAP | A transfer protocol for extracting subsets of files from a remote server (over HTTP) |
rclone | A 3rd party, open-source command-line utility which can interface to, and synchronise data between, a wide variety of cloud and other storage backends, such as Google Drive and AWS S3 compatible object stores. It can also sync data over SSH. This utility is not installed on JASMIN but is well-documented and trivial for users to download and configure themselves on one of the data transfer servers Please note install instructions, and dos and don’tsfor rclone on JASMIN. |