CyVerse Data Store¶
The Data Store is more than a place to save your files – it is a way to manage the life cycle of your data. From creation to publication to beyond, there are a number of practices to ensure that the integrity and value of your data are maintained.
We have already covered accessing the Data Store using the Discovery Environment in the previous lesson. In this lesson we will practice other ways of accessing the Data Store, plus how to make your data publicly available.
iCommands is a collection of tools developed by the iRODS project, which is the technology that supports the CyVerse Data Store. Using iCommands is the most flexible way to interact with the Data Store.
iCommands provides command line access to the Data Store, so it can be included in scripts to automate data upload and download. Unfortunately, the latest iCommands cannot be installed on most Windows operating systems, but participants with Windows computers can do this exercise using Atmosphere (which will be covered in tomorrow’s lessons). If you are running Window 10, you can run iCommands on the Linus subsystem.
Follow along with the Using iCommands quick start to:
- Install and configure iCommands
- Upload a file to your home folder (iput)
- Download a file to your desktop (iget)
- In addition, we will use iCommands to:
- Create a new folder in your home directory (imkdir)
- Move a file from your home directory to the new folder (imv)
- Navigate to a public folder (icd)
- Copy a public file to the newly created folder (icp)
$ imkdir newdir $ imv file_name newdir/file_name $ icd /iplant/home/shared/imicrobe/camera $ ils $ icp camera_projects/CAM_PROJ_AcidMine.csv /iplant/home/$username/newdir/CAM_PROJ_AcidMine.csv $ icd /iplant/home/$username/ $ils newdir $ils -A newdir
Here is the full documentation of iCommands.
Cyberduck is a free 3rd party software tool that allows you to drag-and-drop files between your local computer (or a remote server) and the Data Store. Cyberduck can also be used to rename files, and browse other shared or public Data Store locations.
Follow along with the CyberDuck quick start to:
- Install and configure CyberDuck
- Upload a file to your CyVerse home directory
- Navigate to a public folder
CyVerse Data Commons¶
The Data Commons provides services throughout CyVerse to manage, organize, preserve, publish, discover, and reuse data.
For data that are not stable or permanent, you can request a Community Released Folder.
For an overview see Publishing data on the CyVerse Data Commons.
Advanced Metadata Usage¶
The Data Commons provides advanced metadata features in the Discovery Environment, including:
Exercise: - Open the DE - Apply the DOI request template to a folder. - Apply an ontology term to a file or folder.
Exercise: - Copy the B123 file to your home directory
$ icd /iplant/home/$username $ icp -r /iplant/home/rwalls/B123 B123 $ ils
- In the DE apply the DE apply metadata to the contents of B123
- Browse to B123
- View the metadata for one of the files using the Metadata menu or the three dots (it should be black)
- View the contents of the file
- Browse back to your home directory
- Check the box next to any directory
- Select Metadata > Apply Bulk Metadata
- Select the file B123/Rice_metadata.csv
- Browse back to B123
- View the metadata of the different files in the directory