Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

File NameDescriptionSample
Sample_Yeast_L005_R1.cat.fastq.gzPaired-end Illumina, First of pair, FASTQYeast ChIP-seq
Sample_Yeast_L005_R2.cat.fastq.gzPaired-end Illumina, Second of pair, FASTQYeast ChIP-seq
human_rnaseq.fastq.gzSinglePaired-end Illumina, First of pair only, FASTQHuman RNA-seq
human_mirnaseq.fastq.gzSingle-end Illumina, FASTQHuman microRNA-seq

...

Searching genomes, however, is hard work and takes a long time if done on an un-indexed, linear genomic sequence.  So, most aligners require that references be indexed for quick access  The aligners we are using each require a different index, but use the same method (the Burrows-Wheeler Transform) to get the job done.  This requires taking a FASTA file as input, with each chromosome (or contig) as a separate entry, and producing some aligner-specific set of files as output.  Then, those output files are used by the aligner when executing a given alignment command. Here are some details of where you can find the references we need now (and here are many more):

ReferenceSpeciesBase LengthContig NumberSourceDownload Link
Hg19Human 3,137,161,26425 (really 93) UCSC 
MirbaseV20Human  1908Mirbase 
SacCer3Yeast  17UCSC 
Mm9Mm10Mouse  UCSC22 (really 66)UCSC 

The yeast and mirbase fasta files (with just reference sequence information) are located on the path:

Code Block
/

 

Hg19 is way too big for us to index here, so we've already done it.  


BWA - Yeast ChIP-seq

 C

Bowtie2 and Local Alignment - Human microRNA-seq

...