You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

Old Etherpad link:  https://etherpad.mozilla.org/g2NxIEAFWL

Day 1: Linux/TACC Introduction

Day 2: Raw Sequencing data manipulation

  • Pre-processing FASTQ sequences
  • Overview of NGS data formats and analyses
  • The FASTA sequencing data format
  • The FASTQ sequencing data format, with Illumina/GSAF-specific details
  • Compression, Linux manipulation of fastq files
  • Overview of sequence quality checking
  • FASTQC - a good place to start
  • FASTX toolkit manipulation of FASTQ data
  • Adapter trimming with cutadapt
  • Batch manipulation of FASTQ files
  • No labels