Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Environment setup

Directories and links needed in your home directory.

Code Block
languagebash
cd 
ln -s -f $SCRATCH scratch
ln -s -f $WORK2 work2
ln -s -f /work2/projects/BioITeam
ln -s -f /work2/projects/BioITeam/projects/courses/Core_NGS_Tools CoreNGS

mkdir -p ~/local/bin
cd ~/local/bin
ln -s -f /work2/projects/BioITeam/common/bin/launcher_creator.py
ln -s -f /work2/projects/BioITeam/common/bin/launcher_maker.py

Anchor
Catchup_bashrc
Catchup_bashrc
.bashrc setup

If you already have a .bashrc set up, make a backup copy first. You can restore your original login script after class is over.

Code Block
languagebash
cd
cp .bashrc .bashrc.beforeNGSTools

Copy and configure the login profile for this class

Code Block
languagebash
cd
cp /work2/projects/BioITeam/projects/courses/Core_NGS_Tools/tacc/bashrc.corengs.stampede2  .bashrc
chmod 600 .bashrc

# or, using your symlink
cd
cp ~/CoreNGS/tacc/bashrc.corengs.stampede2  .bashrc
chmod 600 .bashrc

Source it to make it active (if this doesn't work, log off then log back in):

Code Block
languagebash
titleCopy a pre-configured login script
source ~/.bashrc

Anchor
Catchup_env_vars
Catchup_env_vars
Environment variables

General

Code Block
languagebash
export ALLOCATION=UT-2015-05-18
export BIWORK=/work2/projects/BioITeam
export CORENGS=$BIWORK/projects/courses/Core_NGS_Tools

export PATH=.:$HOME/local/bin:$PATH

Turn on coloring by file type in the shell:

Code Block
languagebash
# For better colors using a dark background terminal, un-comment this line:
export LS_COLORS=$LS_COLORS:'di=1;33:fi=01:ln=01;36:'

# For better colors using a white background terminal:
export LS_COLORS=$LS_COLORS:'di=1;34:fi=01:ln=01;36:'

# May or may not be needed
export LS_OPTIONS='-N --color=auto -T 0

TACC intro

Anchor
Catchup_cmds
Catchup_cmds
Commands files

Simple commands

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/slurm/simple
cd $SCRATCH/core_ngs/slurm/simple
cp $CORENGS/tacc/simple.cmds .

Wayness commands

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/slurm/wayness
cd $SCRATCH/core_ngs/slurm/wayness
cp $CORENGS/tacc/wayness.cmds .

Anchor
Catchup_idev
Catchup_idev
Start an idev session

To start an a 3-hour idev (interactive development) session:

Code Block
languagebash
titleStart an idev session
idev -p normal -m 120 -N 1 -n 2468 -A UT-2015-05-18 --reservation=CCBBBIO_DATA_week_1
Tip

You can tell you're in a idev session because the hostname command will return a compute node name (e.g. nid00438 c401-041.stampede2.tacc.utexas.edu) instead of a login node name (e.g. login5. login2.stampede2.tacc.utexas.edu).

The n idev session will terminate when the requested time has expired, or you use the exit command.

Working with FASTQ

Anchor
Catchup_yeast_data
Catchup_yeast_data
Yeast data

Working with some yeast ChIP-seq FASTQ data:

Code Block
languagebash
# Create a $SCRATCH area to work on data for this course,
# with a sub-direct[1ory for pre-processing raw fastq files
mkdir -p $SCRATCH/core_ngs/fastq_prep

# Make symbolic links to the original yeast data:
cd $SCRATCH/core_ngs/fastq_prep
ln -s -f $CORENGS/yeast_stuff/Sample_Yeast_L005_R1.cat.fastq.gz
ln -s -f $CORENGS/yeast_stuff/Sample_Yeast_L005_R2.cat.fastq.gz

# Copy over a small FASTQ file
cd $SCRATCH/core_ngs/fastq_prep
cp $CORENGS/misc/small.fq .

Anchor
Catchup_multiqc
Catchup_multiqc
ATACseq data for MultiQC

Get some FastQC reports for MultiQC:

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/multiqc/fqc.atacseq
cd $SCRATCH/core_ngs/multiqc/fqc.atacseq
cp $CORENGS/multiqc/fqc.atacseq/*.html .

Anchor
Catchup_cutadapt
Catchup_cutadapt
FASTQ files for cutadapt

For command-line cutadapt exploration:

Code Block
languagebash
cd $SCRATCH/core_ngs/fastq_prep
cp $CORENGS/human_stuff/Sample_H54_miRNA_L004_R1.cat.fastq.gz .
cp $CORENGS/human_stuff/Sample_H54_miRNA_L005_R1.cat.fastq.gz .
zcat Sample_H54_miRNA_L004_R1.cat.fastq.gz | head -2000 > miRNA_test.fq

For batch cutadapt processing:

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/cutadapt
cd $SCRATCH/core_ngs/cutadapt
cp $CORENGS/human_stuff/Sample_H54_miRNA_L004_R1.cat.fastq.gz .
cp $CORENGS/human_stuff/Sample_H54_miRNA_L005_R1.cat.fastq.gz .
cp $CORENGS/yeast_stuff/Yeast_RNAseq_L002_R1.fastq.gz .
cp $CORENGS/yeast_stuff/Yeast_RNAseq_L002_R2.fastq.gz .
cp $CORENGS/tacc/cuta.cmds .

Alignment workflow

Anchor
Catchup_align_setup
Catchup_align_setup
Alignment workflow setup

Starting files:

Code Block
languagebash
# FASTA (for building references)
mkdir -p $SCRATCH/core_ngs/references/fasta
cp $CORENGS/references/*.* $SCRATCH/core_ngs/references/fasta/

# FASTQ (to align)
mkdir -p $SCRATCH/core_ngs/alignment/fastq
cp $CORENGS/alignment/*fastq.gz $SCRATCH/core_ngs/alignment/fastq/

Anchor
Catchup_references
Catchup_references
References

Get a copy of all references we build in the exercises (including FASTA):

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/references
rsync -ptlvrP $CORENGS/references/ $SCRATCH/core_ngs/references/

Anchor
Catchup_bwa_pe_yeast
Catchup_bwa_pe_yeast
BWA PE alignment of yeast data

To jump into aligning PE yeast data with BWA

Code Block
languagebash
# Pre-built references
mkdir -p $SCRATCH/core_ngs/references
rsync -avrP $CORENGS/references/ $SCRATCH/core_ngs/references/

# FASTQ (to align)
mkdir -p $SCRATCH/core_ngs/alignment/fastq
cp $CORENGS/alignment/*fastq.gz $SCRATCH/core_ngs/alignment/fastq/

# Alignment directory
mkdir -p $SCRATCH/core_ngs/alignment/yeast_bwa
cd $SCRATCH/core_ngs/alignment/yeast_bwa
ln -s -f ../fastq
ln -s -f ../../references/bwa/sacCer3

module load biocontainers  # takes a while
module load bwa
module load samtools

Anchor
Catchup_bwa_pe_yeast
Catchup_bwa_pe_yeast
samtools manipulation of aligned yeast data

To jump into post-alignment manipulation of the yeast_pairedend.bam with samtools:

Code Block
languagebash
mkdir -p $SCRATCH/core_ngs/alignment/yeast_bwa
cd $SCRATCH/core_ngs/alignment/yeast_bwa
cp $CORENGS/catchup/yeast_bwa/yeast_pairedend.bam .

module load biocontainers  # takes a while
module load samtools

# If the sorted, indexed BAM is needed:
cp $CORENGS/catchup/yeast_bwa/yeast_pairedend.sort* .

SAMTools and BEDTools

Anchor
Catchup_samtools
Catchup_samtools
Setup for samtools

Code Block
languagebash
titleSetup for samtools exercises
mkdir -p $SCRATCH/core_ngs/samtools
cd $SCRATCH/core_ngs/samtools
cp $CORENGS/catchup/for_samtools/* .

module load biocontainers  # takes a while
module load samtools