Parallel Tar With Validation

Compute Resources

Overview

The purpose of this document is to demonstrate steps required to parallel tar using RIS-hosted image gcr.io/ris-registry-shared/parallel-tar. Input variable and process execution validation included.

Interactive Command-Line Session

bsub -n 2 -Is -q general-interactive -a 'docker(gcr.io/ris-registry-shared/parallel-tar)' /bin/bash
parallel-tar.py -s path/to/source_dir/ -d path/to/dest_dir/

Non-Interactive Session

bsub -n 2 -q general -a 'docker(gcr.io/ris-registry-shared/parallel-tar)' "parallel-tar.py -s path/to/source_dir/ -d path/to/dest_dir/"

Expected Output

Running tar with parameters
source directory: path/to/source_dir/
destination directory: path/to/dest_dir/
threads: 4
tar file name: path/to/dest_dir/source_dir.tar.gz
remove source directory: False

Validating tar contents

Parallel-tar completed successfully

Script Variables

  • -s ā€“source-dir source directory to tar and compress

  • -d ā€“dest-dir destination directory to place tar compressed file

  • -t ā€“threads number of threads for the compress algorithm, default 4

  • -n ā€“tar-file-name name of tar file with out the .tar.gz extension

  • -r ā€“remove-source-dir Remove the source directory when tar process completes, default False