Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • H54_miRNA_L004.cuta.log, H54_miRNA_L005.cuta.log, yeast_rnaseq.cuta.log
    • these are the main execution log files, one for each trim_adapters.sh command
  • H54_miRNA_L004.acut.pass0.log, H54_miRNA_L005.acut.pass0.log
    • these are cutadapt statistics files for the single-end adapter trimming
    • their contents will look like our small example above
  • yeast_rnaseq.acut.pass1.log, yeast_rnaseq.acut.pass2.log
    • these are cutadapt statistics files from trimming the R1 and R2 adapters, respectively.

Take a look at the first 13 17 lines of the yeast_rnaseq.acut.pass1.log log file:

Expand
titleHint
Code Block
languagebash
head -1317 yeast_rnaseq.acut.pass1.log


...

Code Block
titlecutadapt log file
This is cutadapt 1.1018 with Python 23.7.121
Command line parameters: -m 32 -a GATCGGAAGAGCACACGTCTGAACTCCAGTCAC --trim-n --paired-output yeast_rnaseq_R2.tmp.cuta.fastq -o yeast_rnaseq_R1.tmp.cuta.fastq Yeast_RNAseq_L002_R1.fastq.gz Yeast_RNAseq_L002_R2.fastq.gz
Trimming 1 adapter with at most 10.0% errors in paired-end legacy mode ...Processing reads on 1 core in paired-end legacy mode ...
WARNING: Legacy mode is enabled. Read modification and filtering options
*ignore* the second read. To switch to regular paired-end mode,
provide the --pair-filter=any option or use any of the
-A/-B/-G/-U/--interleaved options.
Finished in 126151.0554 s (2024 us/read; 32.0755 M reads/minute).

=== Summary ===

Total read pairs processed:          6,440,847
  Read 1 with adapter:               3,875,741 (60.2%)
  Read 2 with adapter:                       0 (0.0%)
Pairs that were too short:             112,845847 (1.8%)
Pairs written (passing filters):     6,328,002000 (98.2%)

  • cutadapt started with 6,440,847 read pairs
    • 112,845 847 reads were discarded as too short after the R1 adapter was removed
      • the same 112,845 847 reads were discarded from both the R1 and R2 files
    • the remaining 6,328,002 000 were then subjected to pass2 processing.

...

Code Block
titlecutadapt log file
ls5:/scratch/01063/abattenh/core_ngs/cutadapt$ head -13 yeast_rnaseq.acut.pass2.log
This is cutadapt 1.1018 with Python 23.7.121
Command line parameters: -m 32 -a TGATCGTCGGACTGTAGAACTCTGAACGTGTAGA --paired-output yeast_rnaseq_R1.cuta.fastq -o yeast_rnaseq_R2.cuta.fastq yeast_rnaseq_R2.tmp.cuta.fastq yeast_rnaseq_R1.tmp.cuta.fastq
TrimmingProcessing 1reads adapteron with at most 10.0% errors1 core in paired-end legacy mode ...
Finished in 9783.4064 s (1513 us/read; 34.9054 M reads/minute).

=== Summary ===

Total read pairs processed:          6,328,002000
  Read 1 with adapter:                  90,848 (1.4%)
  Read 2 with adapter:                       0 (0.0%)
Pairs that were too short:                   0 (0.0%)
Pairs written (passing filters):     6,328,002,000 (100.0%)

Total basepairs processed: 1,198,172,994 bp
  Read 1:   639,128,000 bp
  Read 2:   559,044,994 bp
Total written (filtered):  1,197,894,462 bp (100.0%)
  • cutadapt started with 6,328,002 000 pass1 reads
    • no additional reads were discarded as too short after the R2 adapter was removed
    • so new R1 and R2 files, both with 6,328,002 000 reads, were produced
      • yeast_rnaseq_R1.cuta.fastq.gz and yeast_rnaseq_R2.cuta.fastq.gz

Exercise: Verify that both adapter-trimmed yeast_rnaseq fastq files have 6,328,002 000 reads

Expand
titleHint
Code Block
languagebash
echo "$((`zcat yeast_rnaseq_R1.cuta.fastq.gz | wc -l` / 4))"
zcat yeast_rnaseq_R2.cuta.fastq.gz | wc -l | awk '{print $1/4}'


...