...
You have already worked with a paired-end yeast ChIP-seq dataset, which we will continue to use here. We will also use two additional RNA-seq datasets. The additional data are located in the path:
Code Block |
---|
/corral-repl/utexas/BioITeam$SCRATCH/core_ngs_tools/human_stuff |
So, the following are the data files you will need:
File Name | Description | Sample |
---|
Sample_Yeast_L005_R1.cat.fastq.gz | Paired-end Illumina, First of pair, FASTQ | Yeast ChIP-seq |
Sample_Yeast_L005_R2.cat.fastq.gz | Paired-end Illumina, Second of pair, FASTQ | Yeast ChIP-seq |
human_rnaseq.fastq.gz | Paired-end Illumina, First of pair only, FASTQ | Human RNA-seq |
human_mirnaseq.fastq.gz | Single-end Illumina, FASTQ | Human microRNA-seq |
Now we need to set up the raw data for processing. Stage these files on Stampede from Corral in the fewest possible commands in a directly called "
Expand |
---|
|
Code Block |
---|
cds
mkdir raw_data
cp /corral-repl/utexas/BioITeam/core_ngs_tools/*/*.fastq.gz ./raw_data/
cd raw_data/ |
|
Do you believe that I gave you files of any reasonable quality? I wouldn't, so you should check it outDo a fast quality check on the two new data files like you did earlier on the yeast files.
Expand |
---|
|
Code Block |
---|
cds; cd raw_data
module load fastqc
fastqc human_rnaseq.fastq.gz
fastqc human_mirnaseq.fastq.gz |
|
...