Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.

This is the home of the Core NGS Tools course, May 2014June 2022, at 

This workshop provides an introduction to common analysis tools and file formats currently used in NGS, with emphasis on quality assessment and manipulation of raw NGS sequences (FastQC, cutadapt), read mapping (bwa, bowtie2), the Sequence Alignment Map (SAM) format, and tools for manipulating BAM files (samtools, bedtools). Participants will gain hands-on experience using these and other NGS tools in the Linux command line environment at TACC, as well as exposure to the many bioinformatics resources TACC makes available.


We will meet in Room 101B of the Flawn Academic Center (FAC) building.  We STRONGLY encourage you to use the computers provided in the classroom, but you may also bring your personal laptops.


Day 1: Linux/TACC Introduction and Raw Sequence structure

Part 1: Linux/TACC Introduction

  • Overview of NGS data analysis
  • The FASTA sequencing data format
  • The FASTQ sequencing data format, with Illumina/GSAF-specific details
  • Compression, Linux manipulation of fastq files

Day 2: Raw Sequencing Quality Evaluation

Part 1: FASTQ manipulation tools

  • Overview of sequence quality checking
  • FASTQC - a good place to start
  • FASTX toolkit manipulation of FASTQ data
  • Adapter trimming with cutadapt

Part 2: FASTQ manipulation at TACC

  • Running batch jobs at TACC
  • Batch manipulation of FASTQ files

Day 3: Alignment and BAM file manipulation

Part 1: Alignment and aligners

  • Overview of read alignment, references and alignment tools
  • BWA overview, relevant options
  • Bowtie2 overview, relevant options

Part 2: SAM/BAM format and manipulation

  • The SAM file format specification
  • Manipulating BAM files with samtools
  • Alignment filtering examples

Day 4: Post-Alignment Visualization and Analysis

Part 1: Visualization tools and formats

  • Data formats for visualization (BED, GTF/GFF)
  • The Integrative Genomics Viewer (IGV)
  • UCSC Genome Browser

Part 2: SAM/BAM format and manipulation

  • Analysis with bedtools – intersect, coverage, merge
  • Obtaining public datasets from GEO
  • Other NGS tools and resources

Link to Etherpad:

Use this to post any questions you have about the lessons and tutorials.

Your Instructors

  • Anna Battenhouse, Associate Research Scientist, Iyer Lab,
  • Dr. Daechan Park, Post-doctoral fellow, Georgio Lab
  • Nathan Abell, Research Assistant, Iyer Lab
  • Amelia Weber Hall, Graduate Student, Iyer Lab



Instructors: meet 8am Monday

Each Part 1/Part 2 section needs to be standardized with:
*Learning Objectives
*Workflow diagram (data, toolbox/recipe, exercises)
*Tutorial (bulk of time here)
*Recap learning objectives
*Next steps...


For online attendees, the Zoom URL is:

There will be a short break each day around 10:30am.

Your TACC account will remain on our class  TACC project allocation through June 30, 2022

We will provide access to recordings of each day's materials after the course is over.



Macros This page includes some basic macros. As you create pages, add news items and comments, you'll see the macros below fill up with all the activity in your space. Macros are your friends: look for the Macro icon in the Rich Text editor options when you're editing a page.

Recently Updated

Navigate space
Page Tree Search
Page Tree