Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

You have already worked with a paired-end yeast ChIP-seq dataset, which we will continue to use here.  We will also use two additional RNA-seq datasets.  The additional data are located in the path:

Code Block
/corral-repl/utexas/BioITeam$SCRATCH/core_ngs_tools/human_stuff 

So, the following are the data files you will need:

File NameDescriptionSample
Sample_Yeast_L005_R1.cat.fastq.gzPaired-end Illumina, First of pair, FASTQYeast ChIP-seq
Sample_Yeast_L005_R2.cat.fastq.gzPaired-end Illumina, Second of pair, FASTQYeast ChIP-seq
human_rnaseq.fastq.gzPaired-end Illumina, First of pair only, FASTQHuman RNA-seq
human_mirnaseq.fastq.gzSingle-end Illumina, FASTQHuman microRNA-seq

Now we need to set up the raw data for processing. Stage these files on Stampede from Corral in the fewest possible commands in a directly called "

Expand
titleHint:
Code Block
cds
mkdir raw_data
cp /corral-repl/utexas/BioITeam/core_ngs_tools/*/*.fastq.gz ./raw_data/
cd raw_data/

Do you believe that I gave you files of any reasonable quality?  I wouldn't, so you should check it outDo a fast quality check on the two new data files like you did earlier on the yeast files.

Expand
titleHint:
Code Block
cds; cd raw_data
module load fastqc
fastqc human_rnaseq.fastq.gz
fastqc human_mirnaseq.fastq.gz

...