This article describes the CEDA Archive via the JASMIN system. The local mounting of the CEDA Archives on JASMIN provides direct access to thousands of datasets available to users logged in to JASMIN on the local file system, though not all as explained below.
Accessing the CEDA archive on JASMIN servers
The CEDA Archive covers datasets from the atmospheric, climate change and earth observation science domains, previously known as the BADC and NEODC archives. This is directly accessible as a file system from the shared science machines on JASMIN.
Many data sets have restricted access, requiring you to first apply for access to the data directly from the data centres (see this article for more details). Once you have been given authorisation from the data centre you may be given the required UNIX groups that will allow read access to the archive data on JASMIN where this is possible, though not all datasets are available as direct access via JASMIN.
Use the CEDA Data Catalogue to find and apply for access to the datasets you need. You can use the standard FTP and web based access methods to transfer data from the CEDA Archive to a JASMIN group workspace if direct access is not possible. As the data centres use the same JASMIN infrastructure the transfer rates are high. See the sites for details of datasets.
The contents of the CEDA Archive are available on the file system under /badc and /neodc.
Archive access groups
As a JASMIN user, you are already a member of the Linux group
open. This gives you file system access to the CEDA Archive unrestricted datasets. For restricted data sets file system access on JASMIN, you have to apply via the CEDA Data Catalogue. Upon approval, you will be added to the relevant access control groups for the requested data set where direct access on JASMIN is possible. The available archive access control groups are:
esacat1 ecmwf ukmo open eurosat byacl cmip5_research ukmo_wx ukmo_clim gws_specs. Access is controlled by Unix groups.
All use of data accessed directly from the CEDA Archive must be used in line with the relevant data licence in place for the relevant dataset for the purposes stated in the access application. Data licence information can be found on the relevant CEDA Data Catalogue page, a link to which can be found in the
00README_catalogue_and_licence.txt files found in the archive. For specific data licences granted for restricted datasets users should log into their MyCEDA page to view their granted licence and the associated usage purpose under which access was granted. Any required alternative use of the data beyond the original purpose stated in the original licence application can only be made with a freshly granted new licence application.
Accessing data in the archive
In the example below, the logged-in user is listing the contents of the CRU data sets within the BADC archive. These are "open" so all logged-in users can access them:
$ ls -l /badc/cru/data total 320 -rw-r----- 1 badc open 396 Feb 18 2015 00README drwxr-x--- 8 badc open 4096 Mar 22 10:32 cru_cy drwxr-x--- 4 badc open 4096 Dec 6 2014 crutem drwxr-x--- 12 badc open 4096 May 9 14:11 cru_ts drwxr-x--- 3 badc open 4096 Feb 18 2015 PDSI