These samples were run by seq2science v0.3.0, a tool for easy preprocessing of NGS data.
Take a look at our docs for info about how to use this report to the fullest.
- Contact E-mail
- yourmail@here.com
- Workflow
- hendrik
- Date
- November 03, 2020
Report generated on 2020-11-04, 11:35 based on data in:
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.fastp.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.tsv
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.fastp.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/plotCorrelation/hg38.tsv
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.strandedness.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.fastp.json
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.fastp.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.tsv
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.tsv
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/samplesconfig_mqc.html
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.tsv
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/plotPCA/hg38.tsv
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.tsv
/scratch/siebrenf/hendrik/results2/qc/markdup/hg38-White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243.samtools-coordinate.metrics.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.strandedness.txt
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.fastp.json
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.tsv
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.tsv
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/InsertSizeMetrics/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.tsv
/scratch/siebrenf/hendrik/results2/star/hg38-White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241.samtools-coordinate-unsieved.bam.mtnucratiomtnuc.json
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/star/hg38-White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/samtools_stats/final_bam/hg38-White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239.samtools-coordinate.samtools_stats.txt
/scratch/siebrenf/hendrik/results2/qc/strandedness/hg38-White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245.strandedness.txt
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244.fastp.json
/scratch/siebrenf/hendrik/results2/qc/clustering/hg38-Sample_clustering_mqc.png
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242.fastp.json
/scratch/siebrenf/hendrik/results2/qc/trimming/White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240.fastp.json
Change sample names:
General Statistics
Showing 8/8 rows and 13/28 columns.Sample Name | % Duplication | GC content | % PF | % Adapter | Insert Size | % Dups | % Mapped | M Total seqs | % Proper Pairs | M Total seqs | Genome coverage | M Genome reads | M MT genome reads |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238 | 11.2% | 51.7% | 98.2% | 0.1% | 203 bp | 17.0% | 100.0% | 39.3 | 100.0% | 28.2 | 0.8 X | 58.4 | 0.5 |
White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239 | 7.9% | 52.5% | 98.0% | 0.1% | 189 bp | 14.1% | 100.0% | 39.8 | 100.0% | 29.1 | 0.8 X | 60.7 | 0.6 |
White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240 | 17.3% | 52.4% | 97.8% | 0.1% | 190 bp | 23.3% | 100.0% | 43.0 | 100.0% | 28.0 | 0.9 X | 67.2 | 0.4 |
White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241 | 14.8% | 52.0% | 98.1% | 0.1% | 184 bp | 21.5% | 100.0% | 41.4 | 100.0% | 27.8 | 0.9 X | 62.5 | 0.5 |
White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242 | 22.2% | 51.0% | 98.0% | 0.1% | 190 bp | 27.1% | 100.0% | 45.3 | 100.0% | 29.0 | 0.9 X | 65.6 | 0.4 |
White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243 | 16.9% | 50.5% | 98.1% | 0.1% | 191 bp | 21.1% | 100.0% | 36.5 | 100.0% | 25.5 | 0.7 X | 51.7 | 0.3 |
White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244 | 6.0% | 50.4% | 97.6% | 0.1% | 194 bp | 10.9% | 100.0% | 32.1 | 100.0% | 25.1 | 0.6 X | 45.8 | 0.4 |
White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245 | 5.5% | 52.5% | 98.0% | 0.1% | 191 bp | 9.8% | 100.0% | 38.4 | 100.0% | 23.7 | 1.5 X | 105.6 | 0.4 |
Sample clustering
fastp
fastp An ultra-fast all-in-one FASTQ preprocessor (QC, adapters, trimming, filtering, splitting...)
Filtered Reads
Filtering statistics of sampled reads.
Duplication Rates
Duplication rates of sampled reads.
Insert Sizes
Insert size estimation of sampled reads.
Sequence Quality
Average sequencing quality over each base of all reads.
GC Content
Average GC content over each base of all reads.
N content
Average N content over each base of all reads.
Picard
Picard is a set of Java command line tools for manipulating high-throughput sequencing data.
Insert Size
Plot shows the number of reads at a given insert size. Reads with different orientations are summed.
Mark Duplicates
Number of reads, categorised by duplication state. Pair counts are doubled - see help text for details.
The table in the Picard metrics file contains some columns referring read pairs and some referring to single reads.
To make the numbers in this plot sum correctly, values referring to pairs are doubled according to the scheme below:
READS_IN_DUPLICATE_PAIRS = 2 * READ_PAIR_DUPLICATES
READS_IN_UNIQUE_PAIRS = 2 * (READ_PAIRS_EXAMINED - READ_PAIR_DUPLICATES)
READS_IN_UNIQUE_UNPAIRED = UNPAIRED_READS_EXAMINED - UNPAIRED_READ_DUPLICATES
READS_IN_DUPLICATE_PAIRS_OPTICAL = 2 * READ_PAIR_OPTICAL_DUPLICATES
READS_IN_DUPLICATE_PAIRS_NONOPTICAL = READS_IN_DUPLICATE_PAIRS - READS_IN_DUPLICATE_PAIRS_OPTICAL
READS_IN_DUPLICATE_UNPAIRED = UNPAIRED_READ_DUPLICATES
READS_UNMAPPED = UNMAPPED_READS
SamTools pre-sieve
Samtools is a suite of programs for interacting with high-throughput sequencing data.
The pre-sieve statistics are quality metrics measured before applying (optional) minimum mapping quality, blacklist removal, mitochondrial read removal, and tn5 shift.Percent Mapped
Alignment metrics from samtools stats
; mapped vs. unmapped reads.
For a set of samples that have come from the same multiplexed library, similar numbers of reads for each sample are expected. Large differences in numbers might indicate issues during the library preparation process. Whilst large differences in read numbers may be controlled for in downstream processings (e.g. read count normalisation), you may wish to consider whether the read depths achieved have fallen below recommended levels depending on the applications.
Low alignment rates could indicate contamination of samples (e.g. adapter sequences), low sequencing quality or other artefacts. These can be further investigated in the sequence level QC (e.g. from FastQC).
Alignment metrics
This module parses the output from samtools stats
. All numbers in millions.
SamTools post-sieve
Samtools is a suite of programs for interacting with high-throughput sequencing data.
The post-sieve statistics are quality metrics measured after applying (optional) minimum mapping quality, blacklist removal, mitochondrial read removal, and tn5 shift.Percent Mapped
Alignment metrics from samtools stats
; mapped vs. unmapped reads.
For a set of samples that have come from the same multiplexed library, similar numbers of reads for each sample are expected. Large differences in numbers might indicate issues during the library preparation process. Whilst large differences in read numbers may be controlled for in downstream processings (e.g. read count normalisation), you may wish to consider whether the read depths achieved have fallen below recommended levels depending on the applications.
Low alignment rates could indicate contamination of samples (e.g. adapter sequences), low sequencing quality or other artefacts. These can be further investigated in the sequence level QC (e.g. from FastQC).
Alignment metrics
This module parses the output from samtools stats
. All numbers in millions.
deepTools
deepTools is a suite of tools to process and analyze deep sequencing data.
Correlation heatmap
Pairwise correlations of samples based on distribution of sequence reads
PCA plot
PCA plot with the top two principal components calculated based on genome-wide distribution of sequence reads
Strandedness
Strandedness package provides a number of useful modules that can comprehensively evaluate high throughput RNA-seq data.
Sequencing strandedness was inferred for the following samples, and was called if 60% of the sampled reads were explained by either sense (forward) or antisense (reverse).Infer experiment
Infer experiment counts the percentage of reads and read pairs that match the strandedness of overlapping transcripts. It can be used to infer whether RNA-seq library preps are stranded (sense or antisense).
Samples & Config
sample | assembly | descriptive_name | letter | stimulation | number |
---|---|---|---|---|---|
White-Bloodcells-Morsche-MDL1-C-nonstimulated-0-16238 | hg38 | MDL1-CU0 | C | no | zero |
White-Bloodcells-Morsche-MDL2-C-nonstimulated-1-16239 | hg38 | MDL2-CU1 | C | no | one |
White-Bloodcells-Morsche-MDL3-C-stimulated-0-16240 | hg38 | MDL3-CS0 | C | yes | zero |
White-Bloodcells-Morsche-MDL4-C-stimulated-1-16241 | hg38 | MDL4-CS1 | C | yes | one |
White-Bloodcells-Morsche-MDL5-L-nonstimulated-0-16242 | hg38 | MDL5-LU0 | L | no | zero |
White-Bloodcells-Morsche-MDL6-L-nonstimulated-1-16243 | hg38 | MDL6-LU1 | L | no | one |
White-Bloodcells-Morsche-MDL7-L-stimulated-0-16244 | hg38 | MDL7-LS0 | L | yes | zero |
White-Bloodcells-Morsche-MDL8-L-stimulated-1-16245 | hg38 | MDL8-LS1 | L | yes | one |
# tab-separated file of the samples
samples: samples.tsv
# pipeline file locations
result_dir: /scratch/siebrenf/hendrik/results2 # where to store results
genome_dir: /scratch/siebrenf/genomes # where to look for or download the genomes
fastq_dir: /scratch/siebrenf/hendrik/fastq # where to look for or download the fastqs
# contact info for multiqc report and trackhub
email: yourmail@here.com
# produce a UCSC trackhub?
create_trackhub: True
# how to handle replicates
technical_replicates: merge # change to "keep" to not combine them
# which trimmer to use
trimmer: fastp
# which quantifier to use
quantifier: htseq # or salmon or featurecounts
##### aligner and filter options are not used for the gene counts matrix if the quantifier is Salmon
# which aligner to use
aligner: star
# filtering after alignment
remove_blacklist: True
min_mapping_quality: 255 # (only keep uniquely mapped reads from STAR alignments)
only_primary_align: True
##### differential gene expression analysis (optional) #####
#deseq2:
# multiple_testing_procedure: BH
# alpha_value: 0.1
# shrinkage_estimator: apeglm
#contrasts:
# - 'stage_2_1'
# - 'stage_all_1'