dbGaP Job Submission

All dbGaP data is stored on Isilon share mounted automatically when a user requests an interactive dbGaP session.

Step 1: Request an interactive session on Oscar on the dbGaP.

interact -q dbgap -n 20 -m 20g -t 01:00:00

Group membership fordbgap, dbg_PiLastName, dbg_import, dbg_export along with SLURM associations are required for working with dbGaP.

Step 2: Understand the dbGaP Data Hierarchy

The native GPFS file system (Home, Scratch, Data, Runtime) will be read-only access. All dbGaP related work must be done in the/dbGaProot directory. The /dbGaP directory has two sub-directories data & results . The downloaded datasets from the xfer server will be written to/dbgap/import/userand all output files will be written to the/dbgap/results/usernamedirectory. The hierarchy structure is:

psaluja@node1030:/dbGaP$ tree /dbGaP/
/dbGaP/
β”œβ”€β”€ data
β”‚Β Β  └── import
β”‚Β Β      β”œβ”€β”€ group_1
β”‚Β Β      β”œβ”€β”€ group_2
β”‚Β Β      β”œβ”€β”€ user_1
|       |   β”œβ”€β”€ SRR10859003_1.fastq.gz
β”‚Β Β      └── user_2
β”‚Β Β          β”œβ”€β”€ SRR10859003_1.fastq.gz
β”‚Β Β          β”œβ”€β”€ SRR10859003_2.fastq.gz
β”‚Β Β          └── SRR10859003_3.fastq.gz
└── results
    β”œβ”€β”€ user_1
    β”‚Β Β  β”œβ”€β”€ job_script.sh
    β”‚Β Β  β”œβ”€β”€ slurm-145960.out
    β”‚Β Β  └── slurm-1445969.out
    └── user_2

Step 3: Submitting a dbGaP batch jobs

Home, Scratch & Data will be in read-only mode only users must write their code files, batch scripts in their designated. Example batch script for dbGaP jobs

Any output files including SLURM out and err files must be written to /dbGaP/results directory.

Last updated

Was this helpful?