MultiQC Report

A modular tool to aggregate results from bioinformatics analyses across many samples into a single report.

JavaScript Disabled

MultiQC reports use JavaScript for plots and toolbox functions. It looks like you have JavaScript disabled in your web browser. Please note that many of the report functions will not work as intended.

Loading report..

Report generated on 2019-05-22, 17:06 based on data in: /nfs/users2/bi/projects/training/RNAseq/2019/challenge/QC

General Statistics

Showing ¹⁸/₁₈ rows and ⁹/₁₁ columns.

Sample Name	5'-3' bias	M Aligned	% Aligned	M Aligned	% Aligned	M Aligned	% Dups	% GC	Length	% Failed	M Seqs
SRR7939021	1.13	40.3	83.8%	34.3	92.5%	37.8
SRR7939021_1							59.7%	48%	101 bp	25%	40.9
SRR7939021_2							56.6%	49%	101 bp	25%	40.9
SRR7939022	1.14	33.3	84.6%	28.5	92.3%	31.1
SRR7939022_1							57.9%	48%	101 bp	25%	33.7
SRR7939022_2							54.6%	49%	101 bp	25%	33.7
SRR7939023	1.12	46.4	84.2%	39.7	92.5%	43.6
SRR7939023_1							61.7%	48%	101 bp	25%	47.1
SRR7939023_2							59.0%	49%	101 bp	25%	47.1
SRR7939024	1.12	44.9	84.0%	38.2	92.5%	42.1
SRR7939024_1							61.7%	48%	101 bp	25%	45.5
SRR7939024_2							58.6%	49%	101 bp	25%	45.5
SRR7939025	1.13	42.0	84.5%	36.0	92.4%	39.3
SRR7939025_1							61.1%	48%	101 bp	25%	42.6
SRR7939025_2							57.8%	49%	101 bp	25%	42.6
SRR7939026	1.14	50.0	84.1%	42.7	92.2%	46.8
SRR7939026_1							63.1%	48%	101 bp	25%	50.8
SRR7939026_2							58.2%	49%	101 bp	25%	50.8

Uncheck the tick box to hide columns. Click and drag the handle on the left to change order.

Sort	Group	Column	Description	ID	Scale
\|\|	QualiMap	5'-3' bias	5'-3' bias	`5_3_bias`	None
\|\|	QualiMap	M Aligned	Reads Aligned (millions)	`reads_aligned`	read_count
\|\|	Salmon	% Aligned	% Mapped reads	`percent_mapped`	None
\|\|	Salmon	M Aligned	Mapped reads (millions)	`num_mapped`	read_count
\|\|	STAR	% Aligned	% Uniquely mapped reads	`uniquely_mapped_percent`	None
\|\|	STAR	M Aligned	Uniquely mapped reads (millions)	`uniquely_mapped`	read_count
\|\|	FastQC	% Dups	% Duplicate Reads	`percent_duplicates`	None
\|\|	FastQC	% GC	Average % GC Content	`percent_gc`	None
\|\|	FastQC	Length	Average Sequence Length (bp)	`avg_sequence_length`	None
\|\|	FastQC	% Failed	Percentage of modules failed in FastQC report (includes those not plotted here)	`percent_fails`	None
\|\|	FastQC	M Seqs	Total Sequences (millions)	`total_sequences`	read_count

QualiMap

QualiMap is a platform-independent application to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

Genomic origin of reads

Classification of mapped reads as originating in exonic, intronic or intergenic regions. These can be displayed as either the number or percentage of mapped reads.

There are currently three main approaches to map reads to transcripts in an RNA-seq experiment: mapping reads to a reference genome to identify expressed transcripts that are annotated (and discover those that are unknown), mapping reads to a reference transcriptome, and de novo assembly of transcript sequences (Conesa et al. 2016).

For RNA-seq QC analysis, QualiMap can be used to assess alignments produced by the first of these approaches. For input, it requires a GTF annotation file along with a reference genome, which can be used to reconstruct the exon structure of known transcripts. This allows mapped reads to be grouped by whether they originate in an exonic region (for QualiMap, this may include 5′ and 3′ UTR regions as well as protein-coding exons), an intron, or an intergenic region (see the Qualimap 2 documentation).

The inferred genomic origins of RNA-seq reads are presented here as a bar graph showing either the number or percentage of mapped reads in each read dataset that have been assigned to each type of genomic region. This graph can be used to assess the proportion of useful reads in an RNA-seq experiment. That proportion can be reduced by the presence of intron sequences, especially if depletion of ribosomal RNA was used during sample preparation (Sims et al. 2014). It can also be reduced by off-target transcripts, which are detected in greater numbers at the sequencing depths needed to detect poorly-expressed transcripts (Tarazona et al. 2011).

Gene Coverage Profile

Mean distribution of coverage depth across the length of all mapped transcripts.

For RNA-seq QC analysis, QualiMap can be used to assess alignments produced by the first of these approaches. For input, it requires a GTF annotation file along with a reference genome, which can be used to reconstruct the exon structure of known transcripts. QualiMap uses this information to calculate the depth of coverage along the length of each annotated transcript. For a set of reads mapped to a transcript, the depth of coverage at a given base position is the number of high-quality reads that map to the transcript at that position (Sims et al. 2014).

QualiMap calculates coverage depth at every base position of each annotated transcript. To enable meaningful comparison between transcripts, base positions are rescaled to relative positions expressed as percentage distance along each transcript (0%, 1%, …, 99%). For the set of transcripts with at least one mapped read, QualiMap plots the cumulative mapped-read depth (y-axis) at each relative transcript position (x-axis). This plot shows the gene coverage profile across all mapped transcripts for each read dataset. It provides a visual way to assess positional biases, such as an accumulation of mapped reads at the 3′ end of transcripts, which may indicate poor RNA quality in the original sample (Conesa et al. 2016).

Salmon

Salmon is a tool for quantifying the expression of transcripts using RNA-seq data.

STAR

STAR is an ultrafast universal RNA-seq aligner.

Alignment Scores

Gene Counts

Statistics from results generated using --quantMode GeneCounts. The three tabs show counts for unstranded RNA-seq, counts for the 1st read strand aligned with RNA and counts for the 2nd read strand aligned with RNA.

FastQ Screen

FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.

FastQC

FastQC is a quality control tool for high throughput sequence data, written by Simon Andrews at the Babraham Institute in Cambridge.

Sequence Quality Histograms

12

0

0

The mean quality value across each base position in the read. See the FastQC help.

Per Sequence Quality Scores

12

0

0

The number of reads with average quality scores. Shows if a subset of reads has poor quality. See the FastQC help.

Per Base Sequence Content

0

0

12

The proportion of each base position for which each of the four normal DNA bases has been called. See the FastQC help.

Click a sample row to see a line plot for that dataset.

Rollover for sample name

Position: -

%T: -

%C: -

%A: -

%G: -

Per Sequence GC Content

5

7

0

The average GC content of reads. Normal random library typically have a roughly normal distribution of GC content. See the FastQC help.

Per Base N Content

12

0

0

The percentage of base calls at each position for which an N was called. See the FastQC help.

Sequence Length Distribution

12

0

0

All samples have sequences of a single length (101bp).

Sequence Duplication Levels

0

0

12

The relative level of duplication found for every sequence. See the FastQC help.

Overrepresented sequences

12

0

0

The total amount of overrepresented sequences found in each library. See the FastQC help for further information.

12 samples had less than 1% of reads made up of overrepresented sequences

Adapter Content

12

0

0

The cumulative percentage count of the proportion of your library which has seen each of the adapter sequences at each position. See the FastQC help. Only samples with ≥ 0.1% adapter contamination are shown.

v1.3 (785738d)

MultiQC Toolbox

Highlight Samples

Rename Samples

Show / Hide Samples

Export Plots

Choose Plots

Save Settings

Load Settings

About MultiQC

General Statistics

QualiMap

Genomic origin of reads

Gene Coverage Profile

Salmon

STAR

Alignment Scores

Gene Counts

FastQ Screen

FastQC

Sequence Quality Histograms

12

0

0

Per Sequence Quality Scores

12

0

0

Per Base Sequence Content

0

0

12

Rollover for sample name

Per Sequence GC Content

5

7

0

Per Base N Content

12

0

0

Sequence Length Distribution

12

0

0

Sequence Duplication Levels

0

0

12

Overrepresented sequences

12

0

0

Adapter Content

12

0

0

Toggle navigation v1.3 (785738d)

MultiQC Toolbox

Apply Highlight Samples

Apply Rename Samples

Apply Show / Hide Samples

Export Plots

Choose Plots

Save Settings

Load Settings

About MultiQC

General Statistics

General Statistics: Columns

QualiMap

Genomic origin of reads Help

Gene Coverage Profile Help

Salmon

STAR

Alignment Scores

Gene Counts

FastQ Screen

FastQC

Sequence Quality Histograms 12 0 0

Per Sequence Quality Scores 12 0 0

Per Base Sequence Content 0 0 12

Rollover for sample name

Per Sequence GC Content 5 7 0

Per Base N Content 12 0 0

Sequence Length Distribution 12 0 0

Sequence Duplication Levels 0 0 12

Overrepresented sequences 12 0 0

Adapter Content 12 0 0

v1.3 (785738d)

Highlight Samples

Rename Samples

Show / Hide Samples

Genomic origin of reads

Gene Coverage Profile

Sequence Quality Histograms

12

0

0

Per Sequence Quality Scores

12

0

0

Per Base Sequence Content

0

0

12

Per Sequence GC Content

5

7

0

Per Base N Content

12

0

0

Sequence Length Distribution

12

0

0

Sequence Duplication Levels

0

0

12

Overrepresented sequences

12

0

0

Adapter Content

12

0

0