Loading report..

Highlight Samples

Regex mode off

    Rename Samples

    Click here for bulk input.

    Paste two columns of a tab-delimited table here (eg. from Excel).

    First column should be the old name, second column the new name.

    Regex mode off

      Show / Hide Samples

      Regex mode off

        Explain with AI

        Configure AI settings to get explanations of plots and data in this report.

        Keys entered here will be stored in your browser's local storage. See the docs.


        Anonymize samples off

        Export Plots

        px
        px
        X

        Download the raw data used to create the plots in this report below:

        Note that additional data was saved in multiqc_data when this report was generated.


        Choose Plots

        If you use plots from MultiQC in a publication or presentation, please cite:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411
        Settings are automatically saved. You can also save named configurations below.

        Save Settings

        You can save the toolbox settings for this report to the browser or as a file.


        Load Settings

        Choose a saved report profile from the browser or load from a file:

          Load from File

        Tool Citations

        Please remember to cite the tools that you use in your analysis.

        To help with this, you can download publication details of the tools mentioned in this report:

        About MultiQC

        This report was generated using MultiQC, version 1.27.1

        You can see a YouTube video describing how to use MultiQC reports here: https://youtu.be/qPbIlO_KWN0

        For more information about MultiQC, including other videos and extensive documentation, please visit http://multiqc.info

        You can report bugs, suggest improvements and find the source code for MultiQC on GitHub: https://github.com/MultiQC/MultiQC

        MultiQC is published in Bioinformatics:

        MultiQC: Summarize analysis results for multiple tools and samples in a single report
        Philip Ewels, Måns Magnusson, Sverker Lundin and Max Käller
        Bioinformatics (2016)
        doi: 10.1093/bioinformatics/btw354
        PMID: 27312411

        A modular tool to aggregate results from bioinformatics analyses across many samples into a single report.

        Report generated on 2025-03-26, 09:23 CET based on data in:
        • /bank/experiments/2025-03-lore/qc/general_stats.tsv
        • /bank/experiments/2025-03-lore/qc/reads_per_step.tsv
        • /bank/experiments/2025-03-lore/qc/skera.tsv
        • /bank/experiments/2025-03-lore/qc/lima.tsv
        • /bank/experiments/2025-03-lore/qc/isoseq_correct_barcodes.tsv
        • /bank/experiments/2025-03-lore/qc/isoseq_correct.tsv
        • /bank/experiments/2025-03-lore/qc/isoseq_bcstats.tsv
        • /bank/experiments/2025-03-lore/qc/isoseq_collapse.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications_by_read.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_rt_switching.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_genes.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_filter_reasons.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications_by_cell.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications_by_transcript.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications_by_isoform.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_classifications_by_mapping.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_classify_junctions.tsv
        • /bank/experiments/2025-03-lore/qc/pigeon_report.tsv
        • /bank/experiments/2025-03-lore/qc/segmented.knee_mqc.png
        • /bank/experiments/2025-03-lore/pbmm2/segmented.stats.tsv
        • /bank/experiments/2025-03-lore/pbmm2/segmented.coverage.tsv
        • /bank/experiments/2025-03-lore/pbmm2/segmented.bam.mtnucratiomtnuc.json
        • /bank/experiments/2025-03-lore/qc

        General Statistics

        Showing 1/1 rows and 7/21 columns.
        Sample NameReads in cells% Reads in cellsMT genome coverageGenome coverageMT to Nuclear RatioM Genome readsM MT genome readsError rateNon-primaryReads mapped% Mapped% Proper pairs% MapQ 0 readsTotal seqsMean insertReadsBasesCoverageMean depthMean BQMean MQ
        segmented
        14.6M
        16.0%
        11896.8X
        2.9X
        4168.5
        9.2M
        0.2M
        0.37%
        0.0M
        9.3M
        99.8%
        0.0%
        8.8%
        9.3M
        0.0bp
        9.4M
        461.2Mb
        14.9%
        2.9x
        38.8
        55.3

        Reads per processing step

        The number of reads remaining after each processing step. The final reads are deduplicated and aligned to the genome/transcriptome.

        Legend:
        ccs_reads: Circular consensus sequencing (HiFi) reads (Skera)
        s_reads: Segmented reads (Skera)
        primed_reads: Primed reads (Lima)
        fl_reads: Full Length reads (Iso-seq correct)
        flnc_reads: Full Length Non-Chimeric reads (Iso-seq correct)
        polya_reads: PolyA-tailed reads (Iso-seq correct)
        non-missing: Barcoded reads (Iso-seq correct)
        yield_reads: Estimated reads in cells (Iso-seq correct).

        Created with MultiQC

        Skera

        Deconcatenates Kinnex HiFi reads to produce S-reads that represent the original cDNA molecules.

        Showing 1/1 rows and 5/5 columns.
        samplereadss_readsmean_len_s_readspercent_full_arraymean_array_size
        segmented
        6104086.0
        91323074.0
        981.0
        87.5
        15.0

        Lima

        Removes and spurious false positives.

        Showing 1/1 rows and 15/15 columns.
        sampleReads inputReads above all thresholds (A)Reads below any threshold (B) (B) Below min length (B) Below min score (B) Below min end score (B) Below min passes (B) Below min score lead (B) Below min ref span (B) Without SMRTbell adapter (B) Wrong different pair (B) Undesired 5p--5p pairs (B) Undesired 3p--3p pairs (A) With same pair (A) With different pair
        segmented
        91323074
        90692203
        630871
        12350
        0
        228466
        6
        0
        490179
        0
        0
        192404
        231319
        0
        90692203

        Iso-seq correct

        Identifies cell barcode errors and corrects them. Additionally, Iso-seq correct estimates which reads are likely to originate from real cells vs ambrient RNA.

        Stats

        Showing 1/1 rows and 10/10 columns.
        sampleprocessed_readsfiltered_readsprocessed_basesedit_read_countunchanged_read_countmissing_read_countfound_but_failing_read_countfailing_read_countyield_countyield_fraction
        segmented
        89851332.0
        0.0
        76851365691.0
        89358088.0
        3050.0
        490194.0
        74756392.0
        75246586.0
        14604746.0
        0.2

        Barcode corrections

        Created with MultiQC

        Barcode stats

        Showing 1/1 rows and 10/10 columns.
        samplenumber_unique_groupbarcodesnumber_unique_molbarcodestotal_number_readsmean_groupbarcode_depthcutoff_thresholdnumber_of_cellsmedian_umis_per_cellreads_in_cellsfraction_reads_in_cellsmean_reads_per_cell
        segmented
        1303203.0
        67371906.0
        89851332.0
        68.0
        1318.0
        13791.0
        2505.0
        57425785.0
        0.6
        4164.0

        Knee plots


        Samtools

        Version: 1.21

        Toolkit for interacting with BAM/CRAM files.URL: http://www.htslib.orgDOI: 10.1093/bioinformatics/btp352

        Percent mapped

        Alignment metrics from samtools stats; mapped vs. unmapped reads vs. reads mapped with MQ0.

        For a set of samples that have come from the same multiplexed library, similar numbers of reads for each sample are expected. Large differences in numbers might indicate issues during the library preparation process. Whilst large differences in read numbers may be controlled for in downstream processings (e.g. read count normalisation), you may wish to consider whether the read depths achieved have fallen below recommended levels depending on the applications.

        Low alignment rates could indicate contamination of samples (e.g. adapter sequences), low sequencing quality or other artefacts. These can be further investigated in the sequence level QC (e.g. from FastQC).

        Reads mapped with MQ0 often indicate that the reads are ambiguously mapped to multiple locations in the reference sequence. This can be due to repetitive regions in the genome, the presence of alternative contigs in the reference, or due to reads that are too short to be uniquely mapped. These reads are often filtered out in downstream analyses.

        Created with MultiQC

        Alignment stats

        This module parses the output from samtools stats. All numbers in millions.

        Created with MultiQC

        Coverage: global stats

        Stats parsed from samtools coverage output, and summarized (added up or weighted-averaged) across all regions.

        Showing 1/1 rows and 6/6 columns.
        Sample NameReadsBasesCoverageMean depthMean BQMean MQ
        segmented
        9.4M
        461.2Mb
        14.9%
        2.9x
        38.8
        55.3

        Coverage: stats per region

        Per-region stats parsed from samtools coverage output.

        Created with MultiQC

        Iso-seq collapse

        Collapses redundant transcripts into unique isoforms based on exonic structures.

        Showing 1/1 rows and 2/2 columns.
        samplenumber_mapped_unique_isoformsnumber_of_mapped_unique_loci
        segmented
        980929
        84131

        Pigeon Classify

        Pigeon is used to classify isoforms into categories, filter this output, and to report on the gene and isoform- level saturation.

        Classifications overview

        Showing 2/2 rows and 5/5 columns.
        SampleInputPassedRemovedUnique genesUnique transcripts
        segmented
        980929
        None
        None
        478784
        833527
        segmented (filtered)
        980929
        197700
        783229
        21757
        133186

        Classifications by read

        Showing 2/2 rows and 9/9 columns.
        SampleFull splice matchIncomplete splice matchNovel in catalogNovel not in catalogAntisenseGenic intronGenic genomicIntergenicOther
        segmented
        4043350
        1824365
        121922
        267929
        330231
        1505
        464837
        1797957
        19680
        segmented (filtered)
        3795951
        571401
        91983
        103494
        1409
        0
        2165
        2927
        6392

        Classifications by isoform

        Showing 2/2 rows and 9/9 columns.
        SampleFull splice matchIncomplete splice matchNovel in catalogNovel not in catalogAntisenseGenic intronGenic genomicIntergenicOther
        segmented
        64158
        154179
        41888
        125335
        35853
        423
        116562
        438211
        4320
        segmented (filtered)
        50512
        67228
        30711
        43780
        912
        0
        887
        1927
        1743

        Classifications by cell

        Showing 2/2 rows and 4/4 columns.
        Samplemedian_genes_per_cellmedian_transcripts_per_cellmedian_genes_per_cell_knownmedian_transcripts_per_cell_known
        segmented
        4.0
        4.0
        3.0
        2.0
        segmented (filtered)
        4.0
        4.0
        4.0
        4.0

        Classifications by transcript

        Showing 2/2 rows and 4/4 columns.
        Sampletranscripts_fsmtranscripts_ismtranscripts_nictranscripts_nnc
        segmented
        64158.0
        154179.0
        41888.0
        125335.0
        segmented (filtered)
        50512.0
        67228.0
        30711.0
        43780.0

        Classifications by mapping

        Showing 2/2 rows and 3/3 columns.
        Sampleflnc_mapped_genomeflnc_mapped_transcriptomeflnc_mapped_transcriptome_excluding_ribomito
        segmented
        8871776.0
        6257566.0
        4586174.0
        segmented (filtered)
        4575722.0
        4562829.0
        2965486.0

        Junctions

        Showing 2/2 rows and 4/4 columns.
        SampleKnown canonicalKnown non-canonicalNovel canonicalNovel non-canonical
        segmented
        1330326
        1460
        122771
        104366
        segmented (filtered)
        830116
        45
        72129
        0

        Genes

        Showing 2/2 rows and 2/2 columns.
        SampleKnownNovel
        segmented
        27198
        451586
        segmented (filtered)
        19371
        2386

        RT Switching

        Showing 2/2 rows and 4/4 columns.
        SampleAll transcriptsUnique transcriptsAll junctionsUnique junctions
        segmented
        24780
        1283
        25606
        25606
        segmented (filtered)
        None
        None
        None
        None

        Filter reasons

        Showing 2/2 rows and 4/4 columns.
        SampleIntraprimingMonoexonicRT switchingLow coverage/non-canonical
        segmented
        None
        None
        None
        None
        segmented (filtered)
        431098
        0
        13045
        339086

        Saturation reports

        Gene and isoform- level saturation. The tables show the number of unique genes found in a subsampled number of reads.

        Created with MultiQC

        Software Versions

        Software Versions lists versions of software tools extracted from file contents.

        SoftwareVersion
        Samtools1.21
        mtnucratio0.7.1