Introduction to RNA-seqWhere are we heading towards in this workshop?


Figure 1

Illustration of part of the central dogma of molecular biology, where DNA is transcribed to RNA, and intronic sequences are spliced out

Figure 2

Illustration of the major experimental steps of an RNA-seq experiment

Figure 3

A classification of many different factors affecting measurements obtained from an experiment into treatment, biological, technical and error effects

Figure 4

Illustration of a set of reads generated by a sequencer, and genomic and transcriptomic reference sequences

Figure 5

An example MA plotAn example heatmap


Downloading and organizing files


Figure 1

Basic gene annotation in GTF format
Basic gene annotation in GTF format

Figure 2

Reference genome file in FASTA format
Reference genome file in FASTA format

Figure 3

Transcript sequences file in FASTA format
Transcript sequences file in FASTA format

Figure 4

FASTQ files from GEO
FASTQ files from GEO

Figure 5

FASTQ files from SRA Run Selector
FASTQ files from SRA Run Selector

Quality control of RNA-seq reads


Figure 1

Per base sequence quality for a typical RNA-seq dataset
Per base sequence quality for a typical RNA-seq dataset

Figure 2

Per tile sequence quality plot
Per tile sequence quality plot

Figure 3

Distribution of per sequence quality scores
Distribution of per sequence quality scores

Figure 4

Per base sequence content for RNA-seq reads
Per base sequence content for RNA-seq reads

Figure 5

Per sequence GC content distribution
Per sequence GC content distribution

Figure 6

Per base N content plot
Per base N content plot

Figure 7

Sequence length distribution for RNA-seq data
Sequence length distribution for RNA-seq data

Figure 8

Sequence duplication levels
Sequence duplication levels

Figure 9

Adapter content across the read
Adapter content across the read

Figure 10

Example MultiQC summary across all samples
Example MultiQC summary across all samples

A. Genome-based quantification (STAR + featureCounts)


B. Transcript-based quantification (Salmon)


Gene-level QC and differential expression (DESeq2)


Figure 1

Open OnDemand interface
Open OnDemand interface

Figure 2

Distribution of gene biotypes in annotation
Distribution of gene biotypes in annotation

Figure 3

Total assigned reads per sample
Total assigned reads per sample

Figure 4

Size factors vs library size
Size factors vs library size

Figure 5

average counts vs variance
average counts vs variance

Figure 6

average counts vs variance after transformation
average counts vs variance after transformation

Figure 7

Eucledean distance heatmap
Eucledean distance heatmap

Figure 8

PCA plot showing sample clustering
PCA plot showing sample clustering

Figure 9

Dispersion estimates from DESeq2
Dispersion estimates from DESeq2

Figure 10

Volcano plot of differential expression results
Volcano plot of differential expression results

Gene set enrichment analysis