The University Wiki Service will undergo an upgrade on May 26th, 2022 from 8:00 PM to 10:00 PM.
During this time the service will be unavailable, please save all changes before maintenance begins.

Please refer to the Confluence upgrade release notes for a list of changes.
If you have any questions, please email help@wikis.utexas.edu or call the UT Service Desk at 512-475-9400, thank you!
Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Code Block
languagebash
titleGet the alignment exercises files
mkdir -p $SCRATCH/core_ngs/alignment/fastq
mkdir -p $SCRATCH/core_ngs/references/fasta
cp $CORENGS/alignment/*fastq.gz $SCRATCH/core_ngs/alignment/fastq/
cp $CORENGS/references/*.*fa $SCRATCH/core_ngs/references/fasta/

...

We can quickly index the references for the yeast genome, the human miRNAs, and the V. cholerae genome, because they are all small, so we'll build each index from the appropriate FASTA files right before we use them. We will also obtain the special GenBank file that contains both the V. cholerae genome sequence and annotations (a .gbk file). These FASTA files, which you staged above, are: 

Code Block
languagebash
titleReference FASTA locations
/work/projects/BioITeam/projects/courses/Core_NGS_Tools/references/sacCer3.fa
/work/projects/BioITeam/projects/courses/Core_NGS_Tools/references/hairpin_cDNA_hsa.fa
/work/projects/BioITeam/projects/courses/Core_NGS_Tools/references/vibCho.O395.gbk

hg19 is way too big for us to index here so we will use an existing set of BWA hg19 index files located at:

...